Alibaba CloudProprietary·262K context·— params
Qwen3 Max Thinkingalibaba/qwen3-max-thinking
Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...
Cheapest blended:$1.56 / 1M tokenson Alibaba Cloud DashScope · 1 provider listed
Pricing across providers
Sort by:
| Provider | Input /1M | Output /1M | Blended /1M | Latency p50 | Format | Freshness | Action |
|---|---|---|---|---|---|---|---|
| Alibaba Cloud DashScope qwen3-max-thinking | $0.78 | $3.90 | $1.56 | — | OpenAI-compatible | Needs 27d ago | Try → |
Affiliate disclosure: We may earn a commission from qualified signups. Pricing independence is enforced at the data layer — see our Editorial Independence Policy.
Works with
Point any of these clients at a hosting's base URL — they all speak at least one of this model's endpoint protocols (OPENAI_COMPATIBLE).
Capabilities
Code samples
Example using Alibaba Cloud DashScope — the cheapest hosting for this model as of last verification. Swap base_url and model to use a different provider from the matrix above.
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://dashscope.aliyun.com/compatible-mode/v1",
)
response = client.chat.completions.create(
model="qwen3-max-thinking",
messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)
Technical specs
- Context
- 262K
- Max output
- 33K
- Parameters
- —
- Release
- —
- Training cutoff
- —
- License
- —
Similar models
Compare with
- Qwen3 Max Thinking vs DeepSeek V3Comparison planned — not yet published
- Qwen3 Max Thinking vs DeepSeek V3 0324Comparison planned — not yet published
- Qwen3 Max Thinking vs DeepSeek V3.1Comparison planned — not yet published
Frequently asked
How much does Qwen3 Max Thinking cost?+−
The cheapest public hosting is $1.56 per 1M blended tokens on Alibaba Cloud DashScope. 1 total providers are listed above with per-input / per-output / cached pricing.
How do I access Qwen3 Max Thinking from outside China?+−
All hostings listed above support global access. The official API (e.g. api.deepseek.com, dashscope-intl.aliyuncs.com) accepts international credit cards and does not require a Chinese mobile number. For privacy-sensitive workloads, third-party aggregators like Together.ai host the model on US/EU infrastructure.
Is Qwen3 Max Thinking open-source?+−
No. Qwen3 Max Thinking is a proprietary model from Alibaba Cloud. Access is only via API; weights are not published.
Is Qwen3 Max Thinking OpenAI-compatible?+−
Most listed hostings expose an OpenAI-compatible API, so you can point an existing
openai SDK client at the Provider's base_url and use the Provider's model name. See the Code Samples above for a copy-pasteable example.What's the maximum context window for Qwen3 Max Thinking?+−
The model supports up to 262,144 tokens of context (input + output). Some hosted versions may impose a smaller limit — check the "Context" column in the pricing matrix for each provider.