deepseek-r1 by DeepSeek

4 tracked versions of the deepseek-r1 family. Timeline ordered newest → oldest. Flagship version marked.

Version timeline

R1 0528Open-weight
unknown
Version
0528
Context
164K
Parameters
—
License
—
May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...
R1 Distill Llama 70BOpen-weight
unknown
Version
—
Context
131K
Parameters
—
License
—
DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...
R1 Distill Qwen 32BOpen-weight
unknown
Version
—
Context
33K
Parameters
—
License
—
DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...
R1FlagshipOpen-weight
2025-01-20
Version
—
Context
64K
Parameters
671B
License
MIT
DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

About DeepSeek

See every provider hosting DeepSeek models at /provider/deepseek.