Chinese reasoning models

Models that expose an explicit `<think>` or chain-of-thought trace — DeepSeek-R1, Qwen-QwQ, GLM-Z1. Best for math-heavy, multi-step, or agentic workloads where the reasoning trace is itself valuable. Includes tool-use / function-calling tuned variants.

ModelCreatorContextOpen weightTags
Yi Lightning01.AI16Kchat · reasoning
Qwen 2.5 MaxAlibaba Cloud33Kchat · reasoning · code
Qwen3 72B InstructAlibaba131KYeschat · reasoning · multilingual
Doubao Pro 32KByteDance33Kchat · reasoning · code
DeepSeek Chat V3.1DeepSeek131KYeschat · reasoning · code
DeepSeek R1DeepSeek AI66KYesreasoning · math · coding
DeepSeek V3DeepSeek AI66KYesreasoning · coding · tool_use
Llama 3.1 405B InstructMeta131KYeschat · reasoning · code
Llama 3.3 70B InstructMeta131KYeschat · coding · reasoning
GLM-4-AirZhipu AI131Kchat · reasoning · code

Other capabilities