MiniMaxOpen-weight·1000K context·456B params
MiniMax-Text-01minimax/minimax-text-01
MiniMax-Text-01 is a powerful language model with 456 billion total parameters, of which 45.9 billion are activated per token. The model adopts a hybrid architecture combining Lightning Attention, Softmax Attention and Mixture-of-Experts (MoE). With an extended training context length of 1 million tokens, it can handle up to 4 million tokens during inference. The model features 80 layers with 64 attention heads, hybrid attention mechanism (softmax attention after every 7 lightning attention), 32 experts with top-2 routing strategy, and supports function calling with OpenAI-compatible format.
Pricing across providers
No pricing data yet. Verification in progress — check back soon.
Capabilities
- text-generation
- conversational
- function-calling
Languages: en
Benchmarks
Code samples
Technical specs
- Context
- 1000K
- Max output
- —
- Parameters
- 456B
- Release
- 2025-01-12
- Training cutoff
- —
- License
- —
Similar models
No similar models in our directory yet.
Frequently asked
How much does MiniMax-Text-01 cost?+−
No public pricing listed yet. Check back — we verify provider pricing on a 14-day cadence.
How do I access MiniMax-Text-01 from outside China?+−
All hostings listed above support global access. The official API (e.g. api.deepseek.com, dashscope-intl.aliyuncs.com) accepts international credit cards and does not require a Chinese mobile number. For privacy-sensitive workloads, third-party aggregators like Together.ai host the model on US/EU infrastructure.
Is MiniMax-Text-01 open-source? Can I fine-tune it?+−
Yes. MiniMax-Text-01 is open-weight under the unspecified license. Weights are available on Hugging Face for local inference, fine-tuning, and commercial use (see license for specific terms).
Is MiniMax-Text-01 OpenAI-compatible?+−
Most listed hostings expose an OpenAI-compatible API, so you can point an existing
openai SDK client at the Provider's base_url and use the Provider's model name. See the Code Samples above for a copy-pasteable example.What's the maximum context window for MiniMax-Text-01?+−
The model supports up to 1,000,000 tokens of context (input + output). Some hosted versions may impose a smaller limit — check the "Context" column in the pricing matrix for each provider.