S&P 5005,842.10 0.42%
NASDAQ19,210.55 0.88%
NVDA1,184.22 2.41%
MSFT478.90 0.88%
GOOGL210.11 1.12%
META612.50 0.34%
AAPL239.80 0.21%
AMZN248.66 1.40%
AVGO1,902.40 3.12%
TSLA298.10 1.05%
BTC98,420 1.88%
ETH4,210 2.24%
10Y4.18% 0.02%
DXY104.12 0.18%
S&P 5005,842.10 0.42%
NASDAQ19,210.55 0.88%
NVDA1,184.22 2.41%
MSFT478.90 0.88%
GOOGL210.11 1.12%
META612.50 0.34%
AAPL239.80 0.21%
AMZN248.66 1.40%
AVGO1,902.40 3.12%
TSLA298.10 1.05%
BTC98,420 1.88%
ETH4,210 2.24%
10Y4.18% 0.02%
DXY104.12 0.18%
Back to homepage
AI Chips

Cloud Wars: How AWS, Google and Microsoft Aim to Break Nvidia's AI Chip Grip

Big cloud players are betting on custom silicon to cut costs and control AI stacks — a smart play, but software ecosystems and scale still favor Nvidia.

P
Pedro Marini
June 1, 2026 · 4 min read
Cloud Wars: How AWS, Google and Microsoft Aim to Break Nvidia's AI Chip Grip

Illustration by IMF Alpha editorial · Reviewed by Pedro Marini

Listen to this article
AI narration · ~4 min
Tickers mentioned
NVDA+4.20%AMZN+1.80%MSFT+0.90%GOOGL+2.30%META+3.10%

Quick take: Cloud vendors are building custom AI chips to cut costs and reduce vendor dependence. That’s sensible. But displacing Nvidia is a different, tougher fight — it’s as much about software, developer mindshare and proven production runs as it is about silicon.

GPUs didn’t win by accident. After the deep learning breakthroughs of the early 2010s, GPUs became dominant because they offered programmable parallelism and, critically, a rich software stack centered on CUDA. History matters: hardware only takes over when the tooling and libraries follow.

Cloud vendors are pushing hard.

  • AWS shipped Trainium and Inferentia to shave training and inference bills on its own fleet — the argument being lower per-workload cost and tighter service integration.
  • Google has long optimized TPUs for its TensorFlow-first environment and internal model work.
  • Microsoft and other hyperscalers are quietly building custom boards or striking partnerships to diversify suppliers and control supply chains.
  • Smaller players such as Graphcore and Cerebras pitch themselves for particular model shapes where their architectures shine.

Still, the counterargument is blunt: Nvidia didn’t just make chips. It built an ecosystem — CUDA, cuDNN, a huge catalog of optimized kernels and vast third-party support. That’s a moat of developer time and accumulated engineering effort, not just teraflops.

What cloud-first silicon buys you

  • Cost control — providers claim meaningful savings at scale; when you’re training models that cost millions, margins matter.
  • Tighter stack integration — silicon tuned to a vendor’s services can make deployment and monitoring simpler.
  • Supply resilience — owning design reduces exposure to single-vendor chokepoints.

What it doesn’t buy quickly

  • Instant developer adoption — moving models, retooling pipelines or learning new SDKs carries real engineering cost.
  • The broader software ecosystem — lots of tools, libraries and tuned models still assume a GPU-first world.

From an investor’s point of view: this isn’t an overnight, winner-takes-all flip. Nvidia remains the easy market bet for AI acceleration, which explains its valuation today. But if cloud-native silicon gains traction, it could compress Nvidia’s long-term margins and alter how capital-hungry AI startups need to be.

Keep an eye on a few signals

  • Real-world LLM training and inference benchmarks — not just peak FLOPS.
  • How smoothly major frameworks let teams migrate — one unified SDK, or a dozen.
  • Whether an enterprise-ready ecosystem (tooling, vendors, support) forms around any non-Nvidia chip.
  • Pricing differentials at scale — 10% is background noise; north of ~30% becomes strategic.

My take: building custom chips is a rational defensive move for hyperscalers that want control and clearer cost predictability. But toppling Nvidia requires more than better silicon — it needs time, developer trust and visible production wins. Expect a hybrid future: GPUs will stay central for most workloads, while bespoke silicon quietly grows into niches where scale and integration justify the switch.

Advertisement
Continue reading

Related coverage

The IMF Brief · Daily Newsletter

The AI economy, decoded before the open.

Five minutes. One email. The signal cutting through the noise at the intersection of artificial intelligence and Wall Street. Free, forever.

Join 184,000+ readers · No spam · Unsubscribe anytime