Chinese long context models

Models that keep accuracy at 128K+ tokens — Kimi, GLM-Long, Qwen2.5-1M. Useful for long-document Q&A, whole-repo code review, or multi-hour transcripts where chunking would lose the through-line.

ModelCreatorContextOpen weightTags
Qwen3 72B InstructAlibaba131KYeschat · reasoning · multilingual
DeepSeek V3DeepSeek AI66KYesreasoning · coding · tool_use
Llama 3.3 70B InstructMeta131KYeschat · coding · reasoning
MiniMax M2.7MiniMax197KYeschat · long_context
Kimi K2.5Moonshot AI262KYeschat · reasoning · code
Kimi K2.6Moonshot AI262KYeschat · reasoning · code
GLM 5Zhipu AI203KYesreasoning · coding · tool_calling
GLM 5.1Zhipu AI203KYeschat · reasoning · long_context

Other capabilities