deepseek-r1 by DeepSeek

4 tracked versions of the deepseek-r1 family. Timeline ordered newest → oldest. Flagship version marked.

Version timeline

  1. R1 0528Open-weight
    unknown
    Version
    0528
    Context
    164K
    Parameters
    License

    May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

  2. unknown
    Version
    Context
    131K
    Parameters
    License

    DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

  3. unknown
    Version
    Context
    33K
    Parameters
    License

    DeepSeek R1 Distill Qwen 32B is a distilled large language model based on [Qwen 2.5 32B](https://huggingface.co/Qwen/Qwen2.5-32B), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). It outperforms OpenAI's o1-mini across various benchmarks, achieving new...

  4. R1FlagshipOpen-weight
    2025-01-20
    Version
    Context
    64K
    Parameters
    671B
    License
    MIT

    DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

About DeepSeek

See every provider hosting DeepSeek models at /provider/deepseek.