MiniMaxOpen-weight·1000K context·456B params

MiniMax-Text-01minimax/minimax-text-01

MiniMax-Text-01 is a powerful language model with 456 billion total parameters, of which 45.9 billion are activated per token. The model adopts a hybrid architecture combining Lightning Attention, Softmax Attention and Mixture-of-Experts (MoE). With an extended training context length of 1 million tokens, it can handle up to 4 million tokens during inference. The model features 80 layers with 64 attention heads, hybrid attention mechanism (softmax attention after every 7 lightning attention), 32 experts with top-2 routing strategy, and supports function calling with OpenAI-compatible format.

Pricing across providers

No pricing data yet. Verification in progress — check back soon.

Capabilities

  • text-generation
  • conversational
  • function-calling

Languages: en

Benchmarks

  • MMLU · official · source
    8850.0%
  • HumanEval · official · source
    8690.0%

Code samples

Technical specs

Context
1000K
Max output
Parameters
456B
Release
2025-01-12
Training cutoff
License

Similar models

No similar models in our directory yet.

Frequently asked

How much does MiniMax-Text-01 cost?+
No public pricing listed yet. Check back — we verify provider pricing on a 14-day cadence.
How do I access MiniMax-Text-01 from outside China?+
All hostings listed above support global access. The official API (e.g. api.deepseek.com, dashscope-intl.aliyuncs.com) accepts international credit cards and does not require a Chinese mobile number. For privacy-sensitive workloads, third-party aggregators like Together.ai host the model on US/EU infrastructure.
Is MiniMax-Text-01 open-source? Can I fine-tune it?+
Yes. MiniMax-Text-01 is open-weight under the unspecified license. Weights are available on Hugging Face for local inference, fine-tuning, and commercial use (see license for specific terms).
Is MiniMax-Text-01 OpenAI-compatible?+
Most listed hostings expose an OpenAI-compatible API, so you can point an existing openai SDK client at the Provider's base_url and use the Provider's model name. See the Code Samples above for a copy-pasteable example.
What's the maximum context window for MiniMax-Text-01?+
The model supports up to 1,000,000 tokens of context (input + output). Some hosted versions may impose a smaller limit — check the "Context" column in the pricing matrix for each provider.