If you're shipping a chat product with 100K+ daily active users, the LLM API bill stops being a rounding error and starts being a real line item on your P&L. This post walks the sub-$0.20 per 1M token tier of Chinese LLM APIs and tells you where the real bottom-of-market is in 2026 — and where it isn't quite as cheap as the headline sticker suggests.
| Model | Provider | Input / Output (per 1M) | Blended (ADR-001) | Overseas access |
|---|---|---|---|---|
| Doubao Lite 32K | ByteDance Doubao | $0.042 / $0.083 | $0.053 | No |
| GLM-4-Air | Zhipu AI | $0.07 / $0.07 | $0.07 | No |
| Yi Lightning | 01.AI | $0.14 / $0.14 | $0.14 | Yes |
| Doubao Pro 32K | ByteDance Doubao | $0.11 / $0.28 | $0.15 | No |
| DeepSeek V3 | DeepSeek | $0.27 / $1.10 | $0.48 | No |
The blended-price column uses ADR-001's formula — (input × 3 + output) ÷ 4 — so two models with different input/output asymmetries are comparable. The full pricing matrix sorts every tracked Chinese and global model by the same number.
At $0.042 / $0.083 per 1M tokens (roughly ¥0.3 / ¥0.6 in ByteDance's native CNY pricing), Doubao Lite 32K is the cheapest production-grade Chinese chat model tracked in this directory. The blended rate of 5.3 cents per 1M tokens means a chat session with 1K input + 500 output tokens costs approximately $0.000084 — roughly one ten-thousandth of a cent.
What you're actually getting:
Who should use it: high-volume consumer chat products operating inside mainland China where Doubao's internal-use-tested infrastructure (Douyin, Toutiao, Feishu all run on it) gives you confidence at scale.
Who shouldn't: anyone serving primarily US/EU users, anyone who needs strong coding or reasoning performance, anyone still evaluating between providers — Doubao Lite isn't where you want to benchmark model quality.
Zhipu's GLM-4-Air costs $0.07 / $0.07 per 1M tokens — symmetric pricing, so the blended rate is also $0.07. What makes Air interesting relative to Doubao Lite is the quality gap is narrower than the price gap implies:
Who should use it: teams that need a cheap chat tier AND might need 128K context for document Q&A. The extra 33% cost over Doubao Lite buys you meaningful capability headroom.
01.AI's Yi Lightning is the most expensive entry in this buyer's guide at $0.14 / $0.14 per 1M tokens, but it's the only model in the table that ships with an overseas endpoint. If you serve users globally and can't tolerate GFW latency spikes, Yi Lightning effectively wins by default — the cheaper models aren't accessible.
It's also:
Who should use it: any product serving users outside mainland China who wants Chinese-model quality without VPN / compliance headaches.
There are some real reasons to NOT reach for a sub-$0.20 tier:
See also: our comparison pages stack providers on these exact axes, and our LLM API hub groups every tracked chat-model provider by the same criteria.
If you're still unsure, load our pricing matrix with your estimated input:output ratio and let the blended sort do the work.
Last updated: 2026-04-22. All prices verified against provider docs; CNY-denominated prices converted at 7.2 USD/CNY. See our affiliate disclosure for how we monetize outbound links — commission rate never affects ranking.