qwen3-235b by Alibaba Cloud

3 tracked versions of the qwen3-235b family. Timeline ordered newest → oldest. Flagship version marked.

Version timeline

  1. Qwen3 235B A22BOpen-weight
    unknown
    Version
    Context
    131K
    Parameters
    License

    Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

  2. unknown
    Version
    2507
    Context
    262K
    Parameters
    License

    Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following,...

  3. Version
    2507
    Context
    131K
    Parameters
    License

    Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

About Alibaba Cloud

See every provider hosting Alibaba Cloud models at /provider/alibaba.