QwenOpen-weight·33K context·122B params·apache-2.0
Qwen3.5-122B-A10Balibaba/qwen3.5-122b-a10b
Qwen3.5-122B-A10B is a multimodal foundation model featuring unified vision-language capabilities, efficient hybrid architecture with Gated Delta Networks and sparse Mixture-of-Experts for high-throughput inference, and support for 201 languages. Delivers exceptional performance in reasoning, coding, agents, and visual understanding benchmarks.
Pricing across providers
No pricing data yet. Verification in progress — check back soon.
Capabilities
- image-text-to-text
- conversational
- reasoning
- coding
- visual understanding
- agents
Languages: english, chinese, 201 languages and dialects
Code samples
Technical specs
- Context
- 33K
- Max output
- —
- Parameters
- 122B
- Release
- —
- Training cutoff
- —
- License
- apache-2.0
Similar models
Compare with
- Qwen3.5-122B-A10B vs Qwen3.5-122B-A10BComparison planned — not yet published
Frequently asked
How much does Qwen3.5-122B-A10B cost?+−
No public pricing listed yet. Check back — we verify provider pricing on a 14-day cadence.
How do I access Qwen3.5-122B-A10B from outside China?+−
All hostings listed above support global access. The official API (e.g. api.deepseek.com, dashscope-intl.aliyuncs.com) accepts international credit cards and does not require a Chinese mobile number. For privacy-sensitive workloads, third-party aggregators like Together.ai host the model on US/EU infrastructure.
Is Qwen3.5-122B-A10B open-source? Can I fine-tune it?+−
Yes. Qwen3.5-122B-A10B is open-weight under the apache-2.0 license. Weights are available on Hugging Face for local inference, fine-tuning, and commercial use (see license for specific terms).
Is Qwen3.5-122B-A10B OpenAI-compatible?+−
Most listed hostings expose an OpenAI-compatible API, so you can point an existing
openai SDK client at the Provider's base_url and use the Provider's model name. See the Code Samples above for a copy-pasteable example.What's the maximum context window for Qwen3.5-122B-A10B?+−
The model supports up to 32,768 tokens of context (input + output). Some hosted versions may impose a smaller limit — check the "Context" column in the pricing matrix for each provider.