Alibaba CloudProprietary·1000K context·— params

Qwen3.5-Flashalibaba/qwen3-5-flash-02-23

Name: Qwen3.5-Flash API
Brand: Alibaba Cloud
Availability: InStock

The Qwen3.5 native vision-language Flash models are built on a hybrid architecture that integrates a linear attention mechanism with a sparse mixture-of-experts model, achieving higher inference efficiency. Compared to the...

Cheapest blended:$0.11 / 1M tokenson Alibaba Cloud DashScope · 1 provider listed

Pricing across providers

Sort by:

Provider	Input /1M	Output /1M	Blended /1M	Latency p50	Format	Freshness	Action

1 hosting hidden — prices older than 60 days are pending re-verification.

Affiliate disclosure: We may earn a commission from qualified signups. Pricing independence is enforced at the data layer — see our Editorial Independence Policy.

Works with

Point any of these clients at a hosting's base URL — they all speak at least one of this model's endpoint protocols (OPENAI_COMPATIBLE).

Capabilities

Code samples

Example using Alibaba Cloud DashScope — the cheapest hosting for this model as of last verification. Swap base_url and model to use a different provider from the matrix above.

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://dashscope.aliyun.com/compatible-mode/v1",
)

response = client.chat.completions.create(
    model="qwen3.5-flash-02-23",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Technical specs

Context: 1000K
Max output: 66K
Parameters: —
Release: —
Training cutoff: —
License: —

Similar models

Compare with

Qwen3.5-Flash vs GLM 4.5V
Comparison planned — not yet published
Qwen3.5-Flash vs GLM 4.6V
Comparison planned — not yet published
Qwen3.5-Flash vs GLM 5V Turbo
Comparison planned — not yet published

Frequently asked

How much does Qwen3.5-Flash cost?+

The cheapest public hosting is $0.11 per 1M blended tokens on Alibaba Cloud DashScope. 1 total providers are listed above with per-input / per-output / cached pricing.

How do I access Qwen3.5-Flash from outside China?+

All hostings listed above support global access. The official API (e.g. api.deepseek.com, dashscope-intl.aliyuncs.com) accepts international credit cards and does not require a Chinese mobile number. For privacy-sensitive workloads, third-party aggregators like Together.ai host the model on US/EU infrastructure.

Is Qwen3.5-Flash open-source?+

No. Qwen3.5-Flash is a proprietary model from Alibaba Cloud. Access is only via API; weights are not published.

Is Qwen3.5-Flash OpenAI-compatible?+

Most listed hostings expose an OpenAI-compatible API, so you can point an existing openai SDK client at the Provider's base_url and use the Provider's model name. See the Code Samples above for a copy-pasteable example.

What's the maximum context window for Qwen3.5-Flash?+

The model supports up to 1,000,000 tokens of context (input + output). Some hosted versions may impose a smaller limit — check the "Context" column in the pricing matrix for each provider.