Models

12 models

Filtered by provider

DeepSeek

Compare in Chat

DeepSeek: DeepSeek V3Free

Free input

Free output

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

by deepseek·131K context

text

deepseek-ai/deepseek-coder-6.7b-instruct

— input

— output

by deepseek-ai·33K context

text

deepseek-ai/deepseek-v4-flash

— input

— output

by deepseek-ai·33K context

text

deepseek-v4-flashFree

Max Output

66K

Free input

Free output

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

by deepseek·1.0M context

text

DeepSeek: DeepSeek V3 0324

$0.200/M input

$0.770/M output

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

by deepseek·164K context

text

DeepSeek: DeepSeek V3.1

$0.210/M input

$0.790/M output

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context...

by deepseek·164K context

text

DeepSeek: DeepSeek V3.1 Terminus

$0.270/M input

$0.950/M output

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

by deepseek·164K context

text

DeepSeek: DeepSeek V3.2

$0.229/M input

$0.343/M output

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

by deepseek·131K context

text

DeepSeek: DeepSeek V3.2 Exp

$0.270/M input

$0.410/M output

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

by deepseek·164K context

text

DeepSeek: R1

$0.700/M input

$2.50/M output

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

by deepseek·164K context

text

DeepSeek: R1 0528

$0.500/M input

$2.15/M output

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

by deepseek·164K context

text

DeepSeek: R1 Distill Llama 70B

$0.800/M input

$0.800/M output

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

by deepseek·128K context

text