Zhipu AIProprietary·131K context· params

GLM-4-Airzhipu/glm-4-air

GLM-4-Air is Zhipu's cost-efficient tier — 10-20x cheaper than GLM-4-Plus with ~80% of the quality on most workloads. Good default for high-volume production traffic.

Cheapest blended:$0.07 / 1M tokenson Zhipu AI · 1 provider listed

Pricing across providers

Sort by:
ProviderInput /1MOutput /1MBlended /1MLatency p50FormatFreshnessAction
Zhipu AI
glm-4-air
$0.07$0.07$0.07OpenAI-compatibleVerified 2d agoTry →

Affiliate disclosure: We may earn a commission from qualified signups. Pricing independence is enforced at the data layer — see our Editorial Independence Policy.

Works with

Point any of these clients at a hosting's base URL — they all speak at least one of this model's endpoint protocols (OPENAI_COMPATIBLE).

Capabilities

  • chat
  • reasoning
  • code

Languages: en, zh

Code samples

Example using Zhipu AI — the cheapest hosting for this model as of last verification. Swap base_url and model to use a different provider from the matrix above.

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.example.com/v1",
)

response = client.chat.completions.create(
    model="glm-4-air",
    messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)

Technical specs

Context
131K
Max output
4K
Parameters
Release
Training cutoff
License

Similar models

Compare with

  • GLM-4-Air vs DeepSeek Chat V3.1
    Comparison planned — not yet published
  • GLM-4-Air vs DeepSeek R1
    Comparison planned — not yet published
  • GLM-4-Air vs DeepSeek V3
    Comparison planned — not yet published

Frequently asked

How much does GLM-4-Air cost?+
The cheapest public hosting is $0.07 per 1M blended tokens on Zhipu AI. 1 total providers are listed above with per-input / per-output / cached pricing.
How do I access GLM-4-Air from outside China?+
All hostings listed above support global access. The official API (e.g. api.deepseek.com, dashscope-intl.aliyuncs.com) accepts international credit cards and does not require a Chinese mobile number. For privacy-sensitive workloads, third-party aggregators like Together.ai host the model on US/EU infrastructure.
Is GLM-4-Air open-source?+
No. GLM-4-Air is a proprietary model from Zhipu AI. Access is only via API; weights are not published.
Is GLM-4-Air OpenAI-compatible?+
Most listed hostings expose an OpenAI-compatible API, so you can point an existing openai SDK client at the Provider's base_url and use the Provider's model name. See the Code Samples above for a copy-pasteable example.
What's the maximum context window for GLM-4-Air?+
The model supports up to 131,072 tokens of context (input + output). Some hosted versions may impose a smaller limit — check the "Context" column in the pricing matrix for each provider.