AI Technology15 min read

The Silicon Curtain: How Huawei's $12 Billion AI Chip Surge Is Splitting Global AI in Two

June 22, 2026·AI in China
The Silicon Curtain: How Huawei's $12 Billion AI Chip Surge Is Splitting Global AI in Two
High-tech circuit board with glowing blue neural network pathways representing AI chip infrastructure

*The global AI infrastructure landscape is splitting along geopolitical lines. While Western AI runs on NVIDIA silicon, China's tech giants are placing billion-dollar bets on Huawei's Ascend chips — creating two parallel compute ecosystems. Photo: Unsplash*

In late March 2026, a procurement memo circulated through ByteDance's Singapore headquarters that would have been unthinkable eighteen months earlier. The company was placing a volume order for Huawei's Ascend 910C AI chips — not as a backup plan, but as a primary compute strategy. Two months later, Alibaba followed. Then Tencent. Then Baidu. By June, the Financial Times reported what insiders already knew: Huawei's AI chip revenue would grow 60% year-over-year to approximately $12 billion in 2026, with China's four largest technology companies collectively shifting from NVIDIA dependency to domestic silicon.

This is not a supply-chain adjustment. It is a structural bifurcation of the global AI infrastructure stack. For two decades, the world's AI systems have run on a single hardware paradigm: NVIDIA GPUs, programmed through CUDA, manufactured by TSMC. That monoculture is ending. What is emerging is a dual-track world where Western AI runs on NVIDIA/AMD silicon and Chinese AI runs on Huawei Ascend — with incompatible software ecosystems, divergent cost structures, and increasingly separate talent pools.

When ByteDance trains Doubao 2.0 on Huawei chips instead of H100s, the optimization choices ripple through every layer of the stack. When DeepSeek's V4 trains exclusively on Ascend silicon, the resulting model is not merely "a Chinese alternative" to GPT-5.2. It is a product of a different computational regime, optimized for different constraints, potentially capable of different capabilities.

The Silicon Curtain is not coming. It is already here.


The $12 Billion Inflection Point: What the Numbers Reveal

The Financial Times report that broke the $12 billion figure in May 2026 confirmed the velocity of change. Huawei's AI chip revenue had already crossed $7.5 billion in 2025 — a year when most Western analysts still dismissed Ascend as a second-tier alternative. The 60% growth to $12 billion means Huawei is not merely capturing stranded demand. It is winning competitive evaluations against the world's most advanced AI accelerators.

Huawei AI Chip Business Metrics202420252026 (Projected)CAGR
AI chip revenue~$3.2B~$7.5B~$12.0B93%
Ascend 910 series shipments~120K~280K~450K+93%
Major customers (confirmed)Huawei Cloud, gov't+Baidu, some Alibaba+ByteDance, Tencent, all BAT
Software ecosystem (CANN)Basic PyTorch supportProduction-readyCompetitive with CUDA for most workloads

*Sources: Financial Times, May 2026; Bernstein Research; company disclosures; industry estimates.*

The customer list is what matters. In 2024, Huawei Ascend's primary buyers were Huawei Cloud itself and government computing centers — captive demand that did not prove commercial competitiveness. By 2026, ByteDance, Alibaba, Tencent, and Baidu are all placing volume orders. These are not companies that compromise on performance. ByteDance's recommendation algorithm powers TikTok and Douyin — the most computationally intensive AI inference workload on the planet, serving over 100 million daily active users on Doubao alone. If ByteDance is willing to shift inference load to Ascend, the chips have crossed a performance threshold that observers thought was years away.

Table 2: Major Chinese Tech Companies — AI Chip Strategy Shifts (2026)

CompanyPrimary AI Chip (2024)Primary AI Chip (2026)Custom Silicon PlansAnnual AI Capex (2026 est.)
ByteDanceNVIDIA H100/H800Huawei Ascend 910C + custom350K custom inference chips (Samsung)~$8-10B
AlibabaNVIDIA H100/A100Huawei Ascend 910C + T-Head ZhenwuZhenwu M890 (in-house via T-Head)~$15B+
TencentNVIDIA H100Huawei Ascend 910CCo-investing in domestic foundries~$10B
BaiduNVIDIA A100/H100Huawei Ascend 910C + KunlunxinKunlunxin 2nd/3rd gen~$6B
DeepSeekNVIDIA H800Huawei Ascend 910C (V4 training)None disclosedSelf-funded (~$220M revenue)

*Sources: Reuters, March 2026; Financial Times, May 2026; company earnings calls; Bernstein estimates.*

The aggregate capex figure is staggering. China's top five AI labs and cloud providers are collectively spending over $40 billion annually on AI infrastructure — and an increasing share is flowing to domestic suppliers. This approaches the total AI infrastructure spending of Amazon Web Services (~$50B), the world's largest cloud provider. China is building a parallel compute ecosystem at a scale that ensures its viability regardless of future trade policy changes.


How the Ascend 910C Closed the Gap

The Ascend 910B, released in late 2023, was widely estimated to deliver 60-70% of an NVIDIA H100's performance on training workloads, with significant software gaps. The Ascend 910C, shipping in volume since late 2025, has narrowed that gap substantially. Independent benchmarks suggest the 910C now approaches 80-85% of H100 performance on standard transformer training — and in some inference scenarios, it matches or exceeds H100 efficiency.

Table 3: NVIDIA H100 vs. Huawei Ascend 910C — Technical Comparison

SpecificationNVIDIA H100 SXM5Huawei Ascend 910CRatio
FP16 compute (dense)989 TFLOPS~800 TFLOPS0.81x
FP16 compute (sparse)1,979 TFLOPS~1,200 TFLOPS0.61x
Memory capacity80 GB HBM3~64-80 GB HBM2e0.80-1.0x
Memory bandwidth3.35 TB/s~2.4 TB/s0.72x
Interconnect (chip-to-chip)NVLink 900 GB/sHCCS ~600 GB/s0.67x
Process nodeTSMC 4N (5nm)SMIC N+2 (7nm-equiv)
Power consumption700W~400-500W0.57-0.71x
Software ecosystemCUDA (15+ years)CANN / MindSpore (5 years)Maturing rapidly

*Sources: NVIDIA product specifications; Huawei technical disclosures; third-party benchmarking by SemiAnalysis; industry interviews. Note: Ascend 910C figures are estimates; Huawei does not publish full specifications.*

The power efficiency advantage is significant. At 400-500W versus H100's 700W, Ascend 910C offers substantially lower total cost of ownership for inference workloads — the dominant AI compute cost as models move from training to deployment. For power-constrained data centers, this efficiency gain translates directly to more tokens processed per rack.

But the real breakthrough has been in software. Huawei's CANN and MindSpore framework have matured from research projects to production-grade platforms. When DeepSeek trained its V4 model on Ascend chips in early 2026, it was a practical decision enabled by CANN's improving PyTorch compatibility and MindSpore's optimized transformer kernels.

Table 4: Software Ecosystem Maturity — CUDA vs. CANN (2026)

CapabilityNVIDIA CUDAHuawei CANN / MindSporeGap Assessment
PyTorch native supportNativeCompatible (98%+ model coverage)Closing
TensorFlow supportNativeSupportedModerate gap
JAX supportNativeLimitedSignificant gap
FlashAttention optimizationOptimizedImplemented (custom kernels)Closing
Custom kernel developmentMature (CUDA C++)CANN DSL, improvingModerate gap
Distributed training (3D parallelism)Megatron/DeepSpeed matureMindSpore + customClosing
Inference serving (vLLM-equivalent)vLLM, TritonMindIE, CANN-LLMClosing
Developer community sizeMillionsHundreds of thousandsSignificant gap
Third-party library ecosystemExtensiveGrowing (500+ models)Moderate gap

*Sources: Huawei developer conferences; MindSpore GitHub repository; independent benchmarking; developer surveys.*

Two years ago, CANN was a compatibility layer requiring substantial porting effort. Today, most PyTorch models run on Ascend with minimal modification. The remaining gaps — particularly in JAX support and custom kernel development — affect frontier research more than production deployment. For inference workloads that dominate commercial AI, CANN is increasingly competitive.


ByteDance's 350,000-Chip Gambit: Custom Silicon Enters the Race

If Huawei's $12 billion surge represents consolidation, ByteDance's move into custom silicon represents expansion. In February 2026, Reuters reported that ByteDance was developing a custom AI inference chip and negotiating with Samsung Electronics — with engineering samples expected by March 2026 and annual production targets of 100,000 to 350,000 units.

At 350,000 units annually, ByteDance would become one of the world's largest AI chip producers — comparable to AMD's AI accelerator shipments and exceeding Intel's Gaudi volumes. The chips are designed for inference: the token-serving operations that power Doubao's 100 million+ daily active users and TikTok's AI features.

Table 5: ByteDance Custom AI Chip — What We Know

AttributeDetails
Chip typeAI inference ASIC (not general-purpose GPU)
Manufacturing partnerSamsung Electronics (foundry)
Process nodeEstimated 4nm or 5nm (Samsung process)
Annual volume target100,000–350,000 units
Engineering samplesMarch 2026
Primary workloadsDoubao inference, TikTok recommendation, ad ranking
Strategic objectiveReduce NVIDIA dependency; cut inference costs 30-50%

*Sources: Reuters, February 2026; Techovedas analysis; Samsung foundry disclosures.*

The Samsung partnership is strategically significant for two reasons. First, it represents diversification beyond TSMC — whose advanced nodes are increasingly restricted for Chinese companies by US export controls. Samsung's foundry business offers a viable alternative for 4nm and 5nm production. Second, it signals that ByteDance is planning for a future where even NVIDIA's H20 (the cut-down China-export variant) may become unavailable.

ByteDance's strategy mirrors Google's TPU and Amazon's Trainium/Inferentia: build specialized hardware for specific workloads, accept the loss of generality, and capture massive cost savings at scale. If Doubao processes 50 trillion tokens daily, even a 20% reduction in per-token cost translates to hundreds of millions in annual savings.

ByteDance is simultaneously ordering Ascend 910C chips and developing its own inference silicon. This suggests a layered strategy: Ascend for training and general-purpose inference in the near term, custom chips for specialized high-volume inference in the medium term. Huawei benefits from training volume today while facing potential displacement in inference tomorrow.


DeepSeek on Ascend: The Proof Point That Changed Minds

In early 2026, DeepSeek — the Hangzhou startup that had already disrupted global AI economics with its $5.6 million V3 training run — began training its V4 model exclusively on Huawei Ascend 910C chips. Not as a comparison experiment. As the primary training infrastructure for a model competing with GPT-5.2 and Gemini 3.

The significance cannot be overstated. DeepSeek is not a state-owned enterprise with patriotic obligations. It is a privately held startup backed by the High-Flyer hedge fund, with an estimated $220 million in annual revenue and a reputation for ruthless engineering efficiency. When DeepSeek's engineers evaluated training platforms for V4, they chose Ascend because the math worked — not because of politics.

Table 6: DeepSeek Model Training Infrastructure Evolution

ModelTraining HardwareParametersNotable Innovation
DeepSeek-V2NVIDIA H800236B (21B active)MLA attention, efficient MoE
DeepSeek-V3NVIDIA H800671B (37B active)GRPO reinforcement learning
DeepSeek-R1NVIDIA H800671BReasoning-focused, open-source
DeepSeek-V3.2SpecialeMixed (H800 + Ascend)671BAIME 2025: 96.0%
DeepSeek-V4Huawei Ascend 910C~1.6T (est.)Multimodal, long-context

*Sources: DeepSeek technical papers; industry analysis; Huawei partner disclosures. Note: V4 specifications are estimates.*

DeepSeek's V3.2Speciale result on AIME 2025 — 96.0%, surpassing GPT-5-High's 94.6% — was achieved on mixed infrastructure including Ascend chips. This was the moment when the industry realized the "China can't train frontier models without NVIDIA" assumption was obsolete. If DeepSeek could match OpenAI's reasoning performance while training partially on domestic silicon, the hardware constraint was no longer binding on model quality.

The V4 training run, reportedly conducted entirely on Ascend 910C clusters, represents the next step. DeepSeek's architectural innovations — particularly its Mixture-of-Experts design with sparse activation and Multi-Head Latent Attention (MLA) that reduces KV cache memory by 93% — are optimized for memory-bandwidth-constrained hardware. The constraint became the catalyst for creativity.


The Model Ecosystem: What Runs on Chinese Silicon

Hardware without software is merely expensive sand. The ultimate test of China's AI chip independence is whether the country's most capable models can be trained and served entirely on domestic silicon. By mid-2026, the answer is increasingly yes.

Table 7: Major Chinese AI Models — Hardware Affinity (2026)

CompanyModelParametersPrimary Training HardwareCan Run Fully on Ascend?
DeepSeekV3.2Speciale / V4671B–1.6TAscend 910C (V4)Yes (V4)
AlibabaQwen3-Max / Qwen3.51T+Mixed (NVIDIA + T-Head)Partial (T-Head compatible)
ByteDanceDoubao 2.0Not disclosedMixed (NVIDIA + Ascend)Inference: yes; Training: partial
BaiduERNIE 5.02.4TKunlunxin + NVIDIAKunlunxin native; Ascend: compatible
MoonshotKimi K2.51T (32B active)NVIDIA H100/H800Porting in progress
ZhipuGLM-5744BHuawei Ascend (reportedly)Yes
MiniMaxM2.5Not disclosedMixedPartial
01.AIYi-LargeNot disclosedNVIDIANo (NVIDIA-only)

*Sources: Company technical disclosures; industry analysis; Huawei partner announcements.*

The diversity of approaches is notable. Zhipu's GLM-5 is reportedly trained entirely on Chinese chips. Baidu's ERNIE 5.0 runs natively on Kunlunxin. Alibaba's Qwen3-Max is optimized for both NVIDIA and T-Head's Zhenwu M890. What unites these approaches is a shared direction of travel: every major Chinese AI lab is increasing the share of domestic silicon in its infrastructure. None are abandoning NVIDIA entirely — the H20 remains available for certain workloads. But the dependency ratio is shifting rapidly. Where Chinese labs ran 90% of their compute on NVIDIA in 2023, the ratio is closer to 50-60% in 2026 and projected to fall below 40% by 2027.

The model quality implications are equally significant. Despite training on hardware that benchmarks 15-20% below NVIDIA's best, Chinese models have closed to within 5-9 percentage points of American frontier models on standardized evaluations. The USCC March 2026 report acknowledged this explicitly: "Chinese models are currently within 5–9 percentage points of the American frontier despite hardware disadvantage." The deficit is either a testament to algorithmic innovation or evidence that hardware gaps matter less than the scaling orthodoxy assumed.


The Price War: How Chinese Silicon Reshapes Global Economics

The most visible impact of China's AI chip shift is in pricing. DeepSeek's API pricing — $0.28 per million input tokens — is roughly one-sixth to one-quarter of comparable US models. ByteDance's Doubao 2.0 is positioned at 90% cheaper than GPT-5.2. Alibaba's Qwen3-Max undercuts Claude Opus 4.5 by similar margins.

These differences are not solely due to cheaper labor or government subsidy. They reflect a fundamentally different cost structure. Huawei Ascend chips, manufactured on SMIC's 7nm-class nodes and sold without NVIDIA's margin structure, cost substantially less per unit of performance. Chinese data center electricity costs average 30-40% below US rates. And the absence of CUDA licensing fees — CANN is free — removes a cost layer that Western AI companies take for granted.

Table 8: AI Inference Pricing Comparison — Chinese vs. Western Models (June 2026)

ProviderModelInput ($/M tokens)Output ($/M tokens)vs. GPT-5.2
OpenAIGPT-5.2$2.50$10.00Baseline
AnthropicClaude Opus 4.5$3.00$15.00+50%
GoogleGemini 3 Pro$1.75$7.00-30%
DeepSeekV3.2Speciale$0.28$0.42-96%
ByteDanceDoubao 2.0$0.15$0.30-97%
AlibabaQwen3-Max$0.35$0.70-93%
ZhipuGLM-5$0.11$0.22-98%
MiniMaxM2.5$0.20$0.40-95%

*Sources: Official API pricing pages; Artificial Analysis aggregation; data as of June 2026.*

The pricing disparity has triggered what analysts call the "Great AI Deflation." Western labs, facing margin pressure, have been forced to cut prices repeatedly. OpenAI's GPT-5.2 pricing represents a 40% reduction from GPT-5's launch prices six months earlier — a compression rate unprecedented in software history.

But the pricing war has a darker side for Western AI economics. If Chinese labs can sustain profitable operations at these price levels — and DeepSeek's reported $220 million annual revenue suggests they can — then the entire business model of American frontier AI comes into question. OpenAI's $40 billion annual revenue depends on pricing power that Chinese competition is systematically eroding. The "intelligence too cheap to meter" future may arrive faster than Silicon Valley's business models can adapt.


The Global Bifurcation: Two Stacks, One Future?

The most consequential question raised by Huawei's $12 billion surge is whether the global AI infrastructure stack can remain unified — or whether we are witnessing the permanent divergence of two incompatible ecosystems.

The evidence points toward divergence. NVIDIA's CUDA ecosystem, built over fifteen years with millions of developers, represents a gravitational force that no Chinese alternative can fully replicate. But CUDA's dominance is increasingly confined to Western markets. Chinese AI development is progressively optimized for CANN, MindSpore, and emerging frameworks like ModelBest's ForgeTrain, which generates cross-platform code automatically.

DimensionWestern AI StackChinese AI StackTrajectory
Primary hardwareNVIDIA H100/H200/BlackwellHuawei Ascend 910C + domestic alternativesDiverging
Software ecosystemCUDA, PyTorch, JAXCANN, MindSpore, PaddlePaddlePartially compatible
Foundry accessTSMC 3nm/4nmSMIC 7nm-equiv + Samsung 4nm2-3 node gap
Model opennessMixed (OpenAI closed, Meta open)Predominantly open-sourceChinese more open
Cost structurePremium ($2-15/M tokens)Ultra-low ($0.11-0.42/M tokens)Price pressure Westward
Global deploymentAmericas, Europe, developed AsiaChina, SE Asia, Middle East, AfricaExpanding spheres
Government policyExport controls, CHIPS ActDomestic substitution mandates, subsidiesReinforcing divergence

*Sources: USCC March 2026 report; industry analysis; policy documentation.*

The "expanding spheres" dynamic is particularly important. Alibaba Cloud, powered by Qwen models and increasingly by Zhenwu and Ascend chips, is the dominant cloud provider in Southeast Asia. Huawei's cloud+AI stack has deep penetration in the Middle East and Africa — markets where US export controls have limited NVIDIA's presence and where Chinese pricing is decisive. The global AI market is organizing into spheres of influence aligned with geopolitical blocs.

For multinational enterprises, this creates a strategic dilemma. A European company deploying AI faces three options: run on American infrastructure (AWS/Azure/GCP with NVIDIA), run on Chinese infrastructure (Alibaba Cloud with Ascend), or attempt to maintain dual stacks. The third option is technically possible but economically painful — models optimized for one hardware platform rarely transfer efficiently to the other. Most enterprises will choose a primary stack based on their geographic and regulatory footprint.

The USCC March 2026 report identified two "reinforcing loops" in China's AI strategy: open collaboration within Chinese labs (where frontier models build on each other's work), and global diffusion (where adoption worldwide generates deployment data and developer contributions). When Airbnb's chatbot runs on Qwen, Alibaba benefits from global-scale feedback. When Cursor builds on Kimi K2.5, Moonshot gains usage data. The open-source strategy is a deliberate mechanism for gaining adoption advantages that would otherwise require Western market access.


What Chinese Developers and Industry Insiders Are Saying

华为昇腾开发者社区 @AscendDeveloper

"CANN 5.0的PyTorch兼容性已经做到95%了,我们内部的项目迁移基本上改几行配置就能跑。最大的问题不是能不能跑,是性能调优的经验积累还赶不上CUDA。"

*"CANN 5.0's PyTorch compatibility is at 95%. Our internal projects basically run with a few config changes. The biggest problem isn't whether it runs — it's that performance tuning experience hasn't caught up to CUDA yet."*

知乎 @芯片架构师老周

"昇腾910C的算力确实够用了,但互联带宽还是瓶颈。做大模型训练的时候,NVLink的900GB/s和HCCS的600GB/s差距会放大。不过华为在下一代已经在推全光互联了,这个如果能落地,局面会完全不一样。"

*"Ascend 910C's compute is definitely sufficient, but interconnect bandwidth remains the bottleneck. When training large models, the gap between NVLink's 900GB/s and HCCS's 600GB/s gets amplified. But Huawei is already pushing all-optical interconnect for the next generation — if that lands, the game changes completely."*

微博 @科技观察员小马

"字节自己做芯片不意外,Doubao每天50万亿token,按H100的推理成本算,一年光算力支出就是几十亿美金。自己做ASIC能省一半,这个账怎么算都值。"

*"ByteDance making its own chips is no surprise. Doubao processes 50 trillion tokens daily. At H100 inference costs, that's billions in compute spend annually. Building custom ASICs saves half — the math works no matter how you calculate."*

Twitter/X @ChinaTechWatch

"The Huawei Ascend story is not just about chips. It's about whether export controls accelerate the very capabilities they aim to restrict. Every restriction has forced Chinese labs to optimize harder, and the result is models within 5% of the frontier at 1/10th the cost."

GitHub Discussion @deepseek-v4-ascend

"We ported our fine-tuning pipeline from H100 to Ascend 910C. Training throughput was 85% of H100 on the same model, but power consumption was 40% lower. For our use case — daily fine-tuning on customer data — the TCO advantage is compelling."

脉脉 @匿名字节员工

"内部说法是2027年推理流量70%要切到自研芯片。昇腾是过渡方案,长期看字节要像Google做TPU一样做自己的全家桶。"

*"Internally, the target is 70% of inference traffic on custom chips by 2027. Ascend is a transitional solution; long-term, ByteDance wants to do its own full stack like Google's TPU."*


The Road Ahead: Three Scenarios for 2027-2028

Scenario 1: Convergence (30% probability)

US-China trade tensions ease, NVIDIA regains full market access to China, and Chinese labs revert to a mixed strategy with NVIDIA as primary hardware. Huawei Ascend becomes a viable regional alternative but not a global competitor. This scenario requires a fundamental shift in US trade policy that appears unlikely given current political dynamics.

Scenario 2: Stable Bifurcation (50% probability)

The current trajectory continues. Two parallel ecosystems mature independently, with limited interoperability. Western AI runs on NVIDIA/CUDA; Chinese AI runs on Ascend/CANN. Each ecosystem serves its geographic and regulatory sphere. Global enterprises maintain dual stacks for compliance. This is the base case implied by current investment patterns and policy directions.

Scenario 3: Chinese Overtake (20% probability)

Huawei's next-generation Ascend chips (920 series, rumored for 2027) close or eliminate the performance gap with NVIDIA's contemporaneous products. Combined with China's manufacturing cost advantages and aggressive open-source model distribution, Chinese AI infrastructure becomes competitive in neutral markets (Southeast Asia, Middle East, Africa, Latin America). NVIDIA's global market share erodes beyond China.

The third scenario, while lower probability, has asymmetric implications. If Chinese AI infrastructure becomes globally competitive, the network effects that have sustained NVIDIA's dominance — developer familiarity, library ecosystem, research institution partnerships — could shift to Chinese platforms. This is the scenario that keeps Silicon Valley strategists awake at night.


Conclusion: The Curtain Is Already Here

The $12 billion figure is a milestone, not a destination. Huawei's AI chip business will likely exceed $20 billion by 2028 if current growth rates persist. ByteDance's custom silicon will enter volume production. SMIC's manufacturing capabilities will advance to 5nm-class nodes. And Chinese models trained on domestic hardware will continue closing the quality gap with American counterparts.

What the Western technology industry has not fully internalized is that this transition is irreversible. Even if US export controls were lifted tomorrow, Chinese technology companies would not return to NVIDIA dependency. The investments in CANN, in MindSpore, in custom silicon, in Ascend-optimized model architectures — these are sunk costs that generate compounding returns. The path dependence of technology ecosystems means that once a critical mass of developers, models, and applications is optimized for a platform, switching costs become prohibitive.

The Silicon Curtain is not a future risk to be managed. It is a present reality to be navigated. For enterprises, dual-stack strategies are no longer optional. For investors, valuing Chinese AI labs requires understanding a different cost structure and growth trajectory. For policymakers, export controls have accelerated rather than slowed the very capability they sought to contain.

And for the global AI community, it means the monoculture is over. The next decade of artificial intelligence will be built on at least two fundamentally different hardware foundations, optimized by different constraints, serving different markets, and potentially developing different capabilities. Whether this divergence leads to competition, isolation, or managed coexistence will define the geopolitics of intelligence in the 21st century.

The curtain has fallen. The play is already in its second act.


Related Articles

- Alibaba's Zhenwu M890: Can T-Head's 100 Billion RMB Chip Bet Break NVIDIA's Grip?

- DeepSeek V4's Million-Token Context and China's AI Sovereignty

- China's AI Chip Renaissance: Q1 2026 Market Analysis

- ModelBest's ForgeTrain: When AI Writes Its Own Training Infrastructure

- The Profit-for-Future Gamble: How China's Tech Giants Sacrificed $100 Billion to Win AI

- Huawei Pangu Ultra MoE: 718B Parameters on Chinese Silicon


*This article was published on June 22, 2026. For corrections or inquiries, contact editor@ainchina.com*

*Data sources: Financial Times, Reuters, Bernstein Research, SemiAnalysis, USCC March 2026 report, company disclosures, Huawei developer conferences, industry interviews. Market share and performance figures are estimates based on available data. Huawei does not publish full Ascend specifications.*

*Word count: ~3,400 words | Reading time: 16 minutes*

M

By Meeeeed

Editor at AI in China. Tracking Chinese AI companies, funding rounds, and the technologies reshaping global tech. More about me.

← Previous

GLM-5.1: The 754B-Parameter Open-Source MoE That Beat GPT-5.4 on Real-World Coding — and Trained Without NVIDIA

Next →

Huawei Pangu Ultra MoE: The 718-Billion-Parameter Model Built Entirely on Chinese Silicon