AI Industry17 min read

China's AI Agent Era: How 140 Trillion Daily Tokens, New Regulations, and a DeepSeek Price War Are Reshaping Global AI

June 3, 2026·AI in China

*June 2026 marks a watershed moment for China's AI industry. New agent regulations, 140 trillion daily token consumption, and an aggressive price war led by DeepSeek are converging to reshape the global competitive landscape. (Photo: Unsplash)*

The Policy Pivot: China's Agent Framework

On June 1, 2026, three Chinese ministries jointly issued the "Implementation Opinions on Standardized Application and Innovative Development of AI Agents" — the most comprehensive national policy framework for AI agent governance to date. This was not a reactionary measure; it was a strategic declaration that China views the agentic AI paradigm as both inevitable and worthy of sovereign-level industrial planning.

The policy spans six major domains: technical standards, safety assessment, industry application, data governance, cross-border compliance, and talent development. For the first time, China has established a formal classification system for AI agents based on autonomy level, impact scope, and data sensitivity — a framework that closely resembles the EU's AI Act risk tiers but with distinctly Chinese characteristics.

Policy Domain	Key Requirement	Global Comparison
Agent Classification	Four-tier system (L1-L4) based on autonomy and decision impact	Similar to EU AI Act risk classes, but adds "social stability" as an explicit evaluation criterion
Data Governance	Agents handling >10M user records must use domestic cloud infrastructure	Stricter than EU data residency; comparable to China's existing cybersecurity law
Safety Assessment	Mandatory third-party auditing for L3-L4 agents before deployment	More prescriptive than US self-certification model
Cross-Border	Foreign agent services must register with national AI agency and pass content review	Extends existing app store registration requirements to agent platforms
Industry Application	Priority sectors: finance, healthcare, manufacturing, government	Aligns with China's existing "AI+" industrial policy
Innovation Incentives	Tax credits for agent R&D; reduced computing power costs for approved projects	More direct subsidies than US CHIPS Act approach

The timing is significant. The policy was released on the same day as NVIDIA CEO Jensen Huang's GTC Taipei keynote — a coincidence that Chinese state media explicitly framed as "two futures diverging." While Huang unveiled Vera CPUs and Isaac GROOT robot platforms, Beijing unveiled the rulebook for how those technologies would be regulated if they entered the Chinese market.

What makes this policy different from previous Chinese AI regulations is its pro-innovation posture. Unlike the 2024 draft measures that emphasized control and content moderation, the June 2026 framework explicitly encourages agent development in enterprise and industrial contexts. The message is clear: China wants to win the agentic AI race, but on its own terms.

The Infrastructure Arms Race: 140 Trillion Tokens

On May 30, 2026, the Inner Mongolia Token Exchange went live — China's first national green computing full-stack AI platform. The platform offers a one-stop marketplace for computing power scheduling, model API access, and token transaction settlement. It is the most concrete evidence yet that China views AI infrastructure as a strategic commodity worthy of state-level market architecture.

The timing of the platform's launch, just 48 hours before the agent policy announcement, suggests coordinated industrial planning. The exchange aggregates computing resources from three major telecom operators and multiple tech giants, supporting both domestic and international chip architectures. Its core function is to enable "elastic allocation, efficient scheduling, and intensive supply" of computing power — effectively creating a spot market for AI inference.

Platform Feature	Description	Strategic Implication
Heterogeneous Computing Aggregation	Integrates general-purpose, intelligent, and supercomputing resources from national nodes	Reduces vendor lock-in to any single chip architecture
Model Marketplace	10+ mainstream models pre-integrated, with pay-per-token pricing	Creates a "model supermarket" that commoditizes API access
Green Computing	Renewable energy-powered data centers in Inner Mongolia	Addresses the energy sustainability criticism of AI infrastructure
Token Settlement	Centralized billing and cross-platform token accounting	Enables enterprise procurement at scale with unified invoices
Domestic Chip Priority	First-class support for国产 (domestic) chips	Industrial policy tool for accelerating domestic semiconductor adoption

But the headline figure that dominates industry discussions is 140 trillion daily tokens — the total volume of AI inference requests processed across China's ecosystem. To put this in perspective, this volume is approximately 3.5× the combined daily token consumption of North America and Europe. The figure was disclosed by industry analysts at the 2026 World AI Conference in Tianjin and has since been corroborated by multiple cloud providers.

Region	Estimated Daily Tokens	Primary Use Cases	Growth Rate (YoY)
China	140 trillion	Agent automation, enterprise workflows, consumer chat	+340%
North America	28 trillion	Coding assistants, enterprise search, content generation	+180%
Europe	12 trillion	Regulatory compliance tools, multilingual services	+120%
Rest of World	15 trillion	Mixed	+95%

The 140 trillion figure is not merely a vanity metric. It reflects a fundamental difference in how Chinese users and enterprises deploy AI. While Western markets remain dominated by "chat" interfaces — single-turn or multi-turn conversations with language models — China's ecosystem has rapidly shifted to agentic workflows: AI systems that autonomously complete multi-step tasks, invoke tools, and interact with enterprise software.

This shift is why the new agent policy matters so much. China is not just regulating a technology; it is regulating an economy — an economy now measured in trillions of tokens per day.

The Price War: DeepSeek V4 Pro Cuts 75%

On June 1, 2026, DeepSeek announced a permanent 75% price reduction on V4 Pro API access — the most aggressive pricing move in the history of commercial large language models. The announcement, made via the company's developer portal without press fanfare, immediately reset pricing benchmarks across the global AI industry.

Model	Input Price (per 1M tokens)	Output Price (per 1M tokens)	Context Window	Notes
DeepSeek V4 Pro (new)	$0.10	$0.40	256K	75% price cut, permanent
DeepSeek V4 Pro (old)	$0.40	$1.60	256K	Previous pricing
OpenAI GPT-5.5 Instant	$0.50	$2.00	128K	Released same day (June 1)
Baidu Wenxin 5.1	$0.35	$1.40	200K	Announced June 1; free tier limited
Anthropic Claude Opus 4.8	$1.50	$7.50	200K	Premium positioning unchanged
Google Gemini 3.5 Pro	$0.30	$1.20	1M	Price competitive but not matching DeepSeek
Alibaba Qwen 3-Max	$0.20	$0.80	128K	Domestic Chinese pricing

The new DeepSeek pricing makes high-volume inference cheaper than cloud storage for equivalent data volumes. At $0.10 per million input tokens, a typical enterprise agent consuming 100 million tokens daily would pay just $10 — less than the cost of a single developer hour.

DeepSeek's move is widely interpreted as both a market-share grab and a strategic denial maneuver. By pricing below sustainable margins, DeepSeek makes it economically irrational for competitors to match without comparable cost structures. The company's moat is its proprietary training infrastructure — built on a mix of NVIDIA and domestic chips — which reportedly operates at 40% lower cost per FLOP than Western cloud-dependent labs.

Baidu responded within hours by announcing Wenxin 5.1, its next-generation flagship model. The announcement was notably defensive: Baidu positioned Wenxin 5.1 as "the model that understands Chinese business logic," implicitly conceding that it could not win on price alone. Baidu also announced that its previously free consumer tier would introduce usage caps — signaling that China's era of unlimited free AI inference is ending.

Company	Pricing Strategy	Likely Rationale
DeepSeek	Predatory pricing at 75% discount	Acquire developer ecosystem; lock in enterprise contracts before IPO
Baidu	Moderate pricing with feature differentiation	Preserve margins; leverage enterprise relationships and search integration
OpenAI	Premium pricing unchanged	Maintain brand positioning; focus on enterprise value rather than volume
Alibaba	Aggressive but sustainable	Cloud bundle strategy; use AI to drive compute consumption
ByteDance	Free consumer tier with ads	Data acquisition play; subsidize via advertising revenue

The price war has immediate implications for China's AI startup ecosystem. Founders report that investor diligence has shifted from "What's your model architecture?" to "What's your inference cost at DeepSeek pricing?" The commoditization of base model inference is accelerating the industry's move up the stack — toward agent frameworks, vertical applications, and proprietary data moats.

Global Context: NVIDIA GTC Taipei and the Divergence

The same June 1 morning that Beijing released its agent policy, Jensen Huang took the stage in Taipei for NVIDIA's GTC 2026 keynote. The juxtaposition could not have been more stark: two visions of AI's future, unveiled simultaneously on opposite sides of the Pacific.

Huang's announcements were substantial:

NVIDIA Announcement	Technical Significance	China Market Relevance
Vera CPU	Revolutionary ARM-based data center processor; "fastest ramping product in NVIDIA history"	Will face import restrictions; accelerates China's domestic CPU development urgency
Isaac GROOT Platform	Open humanoid robot development platform with simulation-to-reality pipeline	Directly competitive with Chinese robot companies (Unitree, Fourier, AgiBot)
NVIDIA-Microsoft AI PC	New PC category with on-device AI agent capabilities	Chinese PC makers (Lenovo, Huawei) must choose: adopt or build alternatives
Alpamayo 2	Open reasoning model optimized for robotaxi services	Baidu Apollo and Pony.ai are primary domestic competitors
NemoClaw (Foxconn pilot)	Enterprise agent platform for manufacturing workflows	Competes with Alibaba's Qianwen enterprise agents

Huang's most telling statement came near the end of his keynote: "NVIDIA has become an infrastructure company... not just a GPU company, not just a systems company, but an infrastructure company." This reframing — from component supplier to full-stack AI infrastructure provider — mirrors China's own strategy. Both sides are racing to own the full stack: chips, systems, software, and now agents.

But the divergence is equally clear. NVIDIA's vision is proprietary and vertically integrated: Vera CPUs, CUDA ecosystems, and NVIDIA-built AI PCs. China's vision, as articulated in the June 1 policy, is standardized and interoperable: mandatory open APIs for L3-L4 agents, domestic chip agnosticism, and state-managed marketplaces like the Inner Mongolia Token Exchange.

Dimension	NVIDIA/US Approach	China Approach
Hardware	Proprietary (Vera, Blackwell, CUDA lock-in)	Multi-vendor (NVIDIA + domestic chips); policy favors domestic
Software	Closed ecosystem (CUDA, TensorRT, Omniverse)	Standardized APIs mandated by regulation
Agents	Enterprise-led (Microsoft Copilot, NVIDIA Nemo)	State-industry collaborative framework
Pricing	Premium (high margins sustain R&D)	Commoditized (state-supported spot markets)
Data	Corporate ownership (enterprise data stays within vendor cloud)	National governance (data residency, content review)
Talent	Immigration-dependent (H-1B, O-1 visas)	Domestic pipeline expansion (universities adding AI majors)

The most immediate competitive flashpoint is robotics. Huang's Isaac GROOT platform — unveiled with fanfare and Foxconn partnership announcements — enters a market where Chinese companies have already shipped thousands of humanoid units. Unitree's G1, Fourier's GR-1, and AgiBot's products are already in commercial deployment, while NVIDIA's platform remains in pilot phase. The race is not who announces first; it is who scales first.

The Business Impact: What 140 Trillion Tokens Means

To understand why China's 140 trillion daily tokens matter for global business, it helps to translate the metric into operational reality.

Enterprise Scenario	Daily Token Volume	Monthly Cost (DeepSeek pricing)	Monthly Cost (OpenAI pricing)	Savings with DeepSeek
Small SaaS startup	10M tokens/day	$30	$150	80%
Mid-size fintech	500M tokens/day	$1,500	$7,500	80%
E-commerce platform	2B tokens/day	$6,000	$30,000	80%
National bank	20B tokens/day	$60,000	$300,000	80%
Government smart city	50B tokens/day	$150,000	$750,000	80%

At these volumes, AI inference is no longer a "software cost" — it is an infrastructure utility comparable to electricity or bandwidth. The Inner Mongolia Token Exchange formalizes this reality by creating a regulated marketplace where enterprises can hedge token costs, lock in computing reservations, and trade excess capacity.

For multinational corporations operating in China, the new agent policy introduces both opportunities and compliance burdens. The cross-border registration requirement means that foreign AI agent services — including Microsoft Copilot, Salesforce Agentforce, and ServiceNow — must now obtain explicit approval before deploying in the Chinese market. This creates a natural advantage for domestic platforms like Baidu's Qianfan, Alibaba's Bailian, and ByteDance's Volcano Engine.

Foreign Platform	China Market Status	Compliance Path Forward
Microsoft Copilot	Enterprise deployment ongoing; new registrations paused	Must establish domestic data residency and pass content review
Salesforce Agentforce	Limited pilot programs	Full registration required; likely partner with domestic cloud provider
OpenAI API	No direct access; all traffic via reverse proxies (grey market)	Explicitly non-compliant; enterprise use is legal risk
Anthropic Claude	No direct market access	Same as OpenAI; blocked by default
Google Gemini	Limited enterprise via Cloud China	Must register agent services separately from cloud

The compliance landscape is still evolving. Industry lawyers note that the "agent" definition in the June 1 policy is broad enough to capture everything from automated customer service chatbots to complex ERP automation workflows. The practical interpretation will depend on enforcement guidelines expected in Q3 2026.

The Road Ahead: What's Next for China's Agent Economy

Three developments in the coming months will determine whether June 2026 is remembered as a turning point or merely a headlines month.

1. DeepSeek's IPO Trajectory

The pricing war is widely interpreted as pre-IPO market positioning. DeepSeek's rumored $70 billion valuation — already supported by CATL's cross-industry investment — depends on demonstrating both massive user adoption and pricing power. The 75% price cut sacrifices short-term revenue for long-term ecosystem lock-in, a classic platform strategy. If DeepSeek files in Q3 or Q4 2026, the IPO will be the largest Chinese tech listing since Alibaba's 2014 debut.

2. Agent Standardization Wars

The June 1 policy mandates technical standards, but does not specify which ones. A standards war is already brewing between Baidu's agent protocol (optimized for search integration), Alibaba's (optimized for cloud commerce), and ByteDance's (optimized for content and social graphs). The ministry that controls the final standard — likely the Ministry of Industry and Information Technology (MIIT) — will effectively pick winners in China's agent infrastructure layer.

3. The US Response

Washington has not yet responded to China's agent policy or the Inner Mongolia exchange. Industry observers expect three possible reactions: expanded export controls on AI-relevant chips (already under consideration), a US equivalent to China's national agent framework (unlikely given partisan gridlock), or industry-led standardization via the IEEE or similar bodies. The most probable outcome is stalemate: two parallel ecosystems developing independently, with limited interoperability.

Timeline	Expected Development	Impact Level
June 2026	MIIT releases agent technical standard draft	High — determines which protocols become national standards
July 2026	Inner Mongolia exchange adds futures/derivatives for token pricing	Medium — introduces volatility management for enterprise buyers
Q3 2026	DeepSeek rumored IPO filing	High — validates the "price war as market acquisition" thesis
Q3 2026	First L3-L4 agent safety audit results published	High — sets precedents for what "compliant agent" means
Q4 2026	Baidu Wenxin 5.1 enterprise adoption metrics	Medium — tests whether feature differentiation beats price
2027	China's 15th Five-Year Plan (十五五) AI targets announced	High — multi-year capital allocation signal

Social Media Reactions

The convergence of policy, pricing, and platform announcements on June 1 generated intense discussion across Chinese social media. Here are representative comments, translated with original context preserved.

Platform	Comment (Chinese)	Translation	Engagement
Weibo	"DeepSeek这一刀砍下去，国内大模型厂商估计有一半要倒闭。价格战的尽头是垄断。"	"DeepSeek's price cut will probably bankrupt half of China's LLM companies. The end of a price war is monopoly."	12K likes, 3.2K reposts
Zhihu	"140万亿token是什么概念？相当于全中国人每人每天和AI聊1000句话。这不是泡沫，这是刚需。"	"140 trillion tokens means every Chinese person chats with AI 1,000 times daily. This isn't a bubble; it's rigid demand."	8.5K upvotes
Xiaohongshu	"文心5.1开始收费了，豆包还能免费多久？打工人的AI自由要结束了😭"	"Wenxin 5.1 is charging now. How long before Doubao goes paid too? The era of free AI for office workers is ending 😭"	45K likes
Twitter/X (English)	"China just regulated AI agents while the US still can't agree on what an 'agent' is. Regulatory clarity is a competitive advantage."	—	28K likes
Hacker News	"DeepSeek at $0.10/million tokens makes self-hosting uneconomical for most use cases. The cloud won."	—	342 comments
LinkedIn	"The Inner Mongolia token exchange is the most interesting infrastructure play in AI right now. It's basically a commodity exchange for intelligence."	—	15K reactions

Conclusion: Two Futures, One Race

June 1, 2026 will be remembered as the day China's AI industry formally declared its intent to own the agentic layer of artificial intelligence. The policy framework, the token infrastructure, and the pricing war are not separate events; they are coordinated components of a national strategy to make China the default jurisdiction for agent-based AI development.

The 140 trillion daily tokens are the proof point. They represent not hype but habit — the daily rhythm of hundreds of millions of Chinese users and enterprises delegating tasks to AI systems. The new regulations ensure that this habit develops within state-supervised boundaries. The price war ensures that it develops on domestic platforms. The infrastructure ensures that it develops on domestic hardware.

For global observers, the lesson is clear: the AI race is no longer about who builds the best model. It is about who builds the best *ecosystem* — the regulatory environment, the computing marketplace, the pricing dynamics, and the developer community that together determine where agentic AI matures first.

China's bet is that ecosystem advantages compound faster than model performance advantages. DeepSeek's model may not be the world's best on every benchmark, but at $0.10 per million tokens, deployed through a regulated national marketplace, integrated with domestic chips, and wrapped in a clear compliance framework — it may be the world's most *deployable*.

That is a different kind of leadership. And it is one that the rest of the world will need to respond to, whether by competition, cooperation, or — most likely — a messy combination of both.

*This article was published on June 3, 2026. For related coverage, see our analysis of China's AI Creator Economy, the DeepSeek valuation story, and MiniMax's global expansion.*

By Meeeeed

Editor at AI in China. Tracking Chinese AI companies, funding rounds, and the technologies reshaping global tech. More about me.