Qwen 3 and the Chinese Model Wave: DeepSeek, Moonshot, Ernie, Zhipu
Alibaba Qwen 3, DeepSeek R1, Moonshot Kimi K2, Baidu Ernie X1, and Zhipu ChatGLM. Chinese open-weights models reached the global frontier in 2025.
The single most consequential structural shift in the frontier-model landscape during 2025 was the rise of Chinese open-weights models to genuine global frontier capability. DeepSeek R1 landed in January 2025 and produced a market-moving moment — Nvidia’s stock dropped meaningfully on the news, the framing of American AI dominance was openly questioned, and the practical question of whether export controls had succeeded shifted from settled to unsettled overnight. Alibaba’s Qwen 3 release in April 2025 deepened the picture. Moonshot’s Kimi K2, Baidu’s Ernie X1, Zhipu’s ChatGLM iterations, and a long tail of smaller Chinese labs filled out a model wave that has fundamentally reshaped what global open-weights AI looks like in 2026.
This piece is the enterprise reading of where Chinese models sit, what they actually deliver, and how the export-control story has played out.
Qwen 3 and the Alibaba family#
Alibaba released Qwen 3 in April 2025 across a wide range of sizes from 600M parameters at the small end up to 235B parameters in the largest variant, with a mixture-of-experts architecture in the larger sizes. The Qwen 3 235B-A22B model — 235 billion total parameters with 22 billion active — was the headline variant, with benchmark results positioning it as competitive with Claude Sonnet 4 and GPT-5 on many tasks at the time of release. The Apache 2.0 licensing across the family made the weights immediately deployable for commercial enterprise use without the license friction that Llama imposes.

The follow-up Qwen 3 Coder, the QwQ reasoning variants, and the Qwen 3 VL multimodal models filled out the product line. By the end of 2025 the Qwen family was the most-downloaded model family on Hugging Face by a meaningful margin, including downloads of the smaller variants for fine-tuning derivative work. The community ecosystem around Qwen has matured into something genuinely larger than what surrounds any other open-weights model family from any region.
DeepSeek R1 and the January 2025 moment#
DeepSeek R1 was released on January 20, 2025 by High-Flyer-backed DeepSeek AI in Hangzhou. The model was a reasoning-tuned variant built on DeepSeek V3 — a 671 billion parameter mixture-of-experts model with 37 billion active parameters — and the published benchmarks put it as competitive with OpenAI o1 on AIME mathematics, on Codeforces coding, and on the major reasoning benchmarks. The technical paper accompanying the release documented a reinforcement-learning training approach that DeepSeek argued produced comparable reasoning capability at substantially lower training cost than the dominant Western frontier-model approaches.
The market reaction was sharp. Nvidia’s stock dropped roughly 17% on the Monday following the release as the AI-infrastructure bull narrative was openly questioned. The “is the moat real” conversation that had been a niche concern through 2024 became a mainstream procurement question. The cost claims in the DeepSeek paper were partially walked back by subsequent independent analysis — the full training cost including the prior runs to produce V3 was meaningfully larger than the marginal R1 cost the paper had emphasized — but the underlying point that Chinese labs could produce reasoning-class models at meaningful cost discounts had been established and did not go away.
Moonshot Kimi K2, Baidu Ernie X1, Zhipu ChatGLM#
The Chinese model wave was wider than Alibaba and DeepSeek. Moonshot’s Kimi K2 release in July 2025 brought a 1-trillion-parameter mixture-of-experts model with strong agentic-task performance and a long context window. The Moonshot product Kimi has become one of the most popular consumer AI chat products in China and has begun making inroads outside China through the open-weights release. Baidu released Ernie X1 in March 2025 as the reasoning-tuned variant of the Ernie 4.5 family — Baidu’s main internal frontier-model line — with strong Chinese-language performance and competitive benchmarks on the international tests.
Zhipu AI’s ChatGLM iterations through 2025 brought the GLM-4-Plus and the follow-up GLM-Z1 reasoning variants. The Zhipu position in the market has been distinctive — closer to a research-and-enterprise lab than a consumer-products company, with a strong presence in Chinese government and state-owned-enterprise deployments. 01.AI, Tencent’s Hunyuan family, ByteDance’s Doubao, and a long tail of smaller labs round out the broader picture.
The benchmark leadership question#
The honest reading of benchmark leadership in 2026 is that Chinese open-weights models routinely top the open-weights leaderboards on most major benchmarks, with Qwen 3 235B and DeepSeek R1 trading positions on individual benchmarks. The gap with closed frontier models — Claude Opus 4.5, GPT-5 Pro, Gemini 3.0 — exists but is narrower than the Western frontier-model marketing positions admit. On many production-relevant benchmarks the gap is within the margin of error.
The benchmarks where the gap is meaningful are the most demanding reasoning tasks, the longest-context agentic workloads, and the multimodal-with-tool-use compositions where the closed frontier models still hold a real lead. For the bulk of routine production AI work — RAG, classification, extraction, summarization, structured generation, code completion — Qwen 3 and DeepSeek R1 are competitive in absolute quality and meaningfully better on cost.
The export control collateral#
The US export control regime targeting Chinese access to leading-edge GPUs has produced effects that diverge from the original policy intent. The intended effect was to slow Chinese frontier-model training. The actual effect has been more complicated. The restrictions tightened through 2024 and 2025 with the H800 and A800 variants Nvidia originally produced for the Chinese market falling under restriction and the follow-up H20 variant being further constrained. The Chinese labs adapted by using older accumulated GPU capacity more efficiently, by training across larger clusters with weaker individual GPUs, by aggressive algorithmic optimization, and by domestic-silicon alternatives including Huawei Ascend chips.

The DeepSeek R1 training-efficiency claims sit at the center of this story. The narrative that Chinese labs were producing frontier-class models on constrained hardware is partially true, partially marketing, and structurally important regardless of the specifics. The export-control framework was designed under the assumption that compute was the binding constraint on frontier-model development. The 2025 evidence has been that algorithmic improvement and training-method innovation can substantially offset compute disadvantages, which weakens the strategic premise of the restrictions.
Enterprise adoption picture#
The enterprise adoption picture for Chinese models in 2026 is complicated by the geopolitical layer. For enterprises in the West — particularly in defense, finance, and healthcare — deploying Chinese-trained models in production carries political and procurement-policy concerns that are real even when the underlying technical concerns are limited. The open-weights nature of Qwen 3 and DeepSeek R1 mitigates some of this — the weights can be inspected, the models can be run on inspected infrastructure, the inference is local rather than going back to Alibaba or DeepSeek — but the procurement-narrative friction remains.
For enterprises in the rest of the world the picture is meaningfully different. Asian markets including Southeast Asia, the Middle East, Latin America, and Africa have adopted Chinese open-weights models broadly. The Hugging Face download numbers for Qwen 3 and DeepSeek R1 reflect this global adoption pattern. For European customers the picture sits between American and rest-of-world — usable for many workloads, problematic for some specific regulated and sovereignty-sensitive deployments.
Where pdpspectra fits#
Our AI and LLM integration practice routinely evaluates Chinese open-weights models for client engagements where the unit economics matter and where the political and procurement-policy posture allows. Qwen 3 is our typical first recommendation for high-volume routine inference workloads where the cost savings against closed-frontier alternatives compound meaningfully. DeepSeek R1 is the right choice for reasoning-heavy workloads where the cost of running the equivalent closed-frontier reasoning tier is prohibitive.
Related reading: Llama 4 open source reality, Mistral Large 3, and open-source LLMs in production.
Closing#
The Chinese model wave is one of the most consequential developments in the global AI landscape and is meaningfully under-discussed in Western enterprise procurement conversations. Qwen 3, DeepSeek R1, Kimi K2, Ernie X1, and the broader family have reached frontier-class capability at meaningful cost discounts. The export-control story has produced effects that diverge from policy intent. The open-weights nature of the most-capable Chinese models makes them genuinely deployable inside Western enterprise infrastructure where procurement policy allows.
For enterprises building production AI in 2026, ignoring the Chinese open-weights option is a procurement mistake. Talk to our team about your model strategy.