QwenOpen-weight·33K context·122B params·apache-2.0

Qwen3.5-122B-A10Balibaba/qwen3.5-122b-a10b

Qwen3.5-122B-A10B is a multimodal foundation model featuring unified vision-language capabilities, efficient hybrid architecture with Gated Delta Networks and sparse Mixture-of-Experts for high-throughput inference, and support for 201 languages. Delivers exceptional performance in reasoning, coding, agents, and visual understanding benchmarks.

Pricing across providers

No pricing data yet. Verification in progress — check back soon.

Capabilities

  • image-text-to-text
  • conversational
  • reasoning
  • coding
  • visual understanding
  • agents

Languages: english, chinese, 201 languages and dialects

Code samples

Technical specs

Context
33K
Max output
Parameters
122B
Release
Training cutoff
License
apache-2.0

Similar models

Compare with

  • Qwen3.5-122B-A10B vs Qwen3.5-122B-A10B
    Comparison planned — not yet published

Frequently asked

How much does Qwen3.5-122B-A10B cost?+
No public pricing listed yet. Check back — we verify provider pricing on a 14-day cadence.
How do I access Qwen3.5-122B-A10B from outside China?+
All hostings listed above support global access. The official API (e.g. api.deepseek.com, dashscope-intl.aliyuncs.com) accepts international credit cards and does not require a Chinese mobile number. For privacy-sensitive workloads, third-party aggregators like Together.ai host the model on US/EU infrastructure.
Is Qwen3.5-122B-A10B open-source? Can I fine-tune it?+
Yes. Qwen3.5-122B-A10B is open-weight under the apache-2.0 license. Weights are available on Hugging Face for local inference, fine-tuning, and commercial use (see license for specific terms).
Is Qwen3.5-122B-A10B OpenAI-compatible?+
Most listed hostings expose an OpenAI-compatible API, so you can point an existing openai SDK client at the Provider's base_url and use the Provider's model name. See the Code Samples above for a copy-pasteable example.
What's the maximum context window for Qwen3.5-122B-A10B?+
The model supports up to 32,768 tokens of context (input + output). Some hosted versions may impose a smaller limit — check the "Context" column in the pricing matrix for each provider.