New York · 09:42 ESTMarkets Open

S&P 5005,842.10▲ 0.42%•

NASDAQ19,210.55▲ 0.88%•

NVDA1,184.22▲ 2.41%•

MSFT478.90▲ 0.88%•

GOOGL210.11▲ 1.12%•

META612.50▼ 0.34%•

AAPL239.80▲ 0.21%•

AMZN248.66▲ 1.40%•

AVGO1,902.40▲ 3.12%•

TSLA298.10▼ 1.05%•

BTC98,420▲ 1.88%•

ETH4,210▲ 2.24%•

10Y4.18%▼ 0.02%•

DXY104.12▲ 0.18%•

S&P 5005,842.10▲ 0.42%•

NASDAQ19,210.55▲ 0.88%•

NVDA1,184.22▲ 2.41%•

MSFT478.90▲ 0.88%•

GOOGL210.11▲ 1.12%•

META612.50▼ 0.34%•

AAPL239.80▲ 0.21%•

AMZN248.66▲ 1.40%•

AVGO1,902.40▲ 3.12%•

TSLA298.10▼ 1.05%•

BTC98,420▲ 1.88%•

ETH4,210▲ 2.24%•

10Y4.18%▼ 0.02%•

DXY104.12▲ 0.18%•

Back to homepage

AI Chips

Cloud Wars: How AWS, Google and Microsoft Aim to Break Nvidia's AI Chip Grip

Big cloud players are betting on custom silicon to cut costs and control AI stacks — a smart play, but software ecosystems and scale still favor Nvidia.

Pedro Marini

June 1, 2026 · 4 min read

Cloud Wars: How AWS, Google and Microsoft Aim to Break Nvidia's AI Chip Grip

Illustration by IMF Alpha editorial · Reviewed by Pedro Marini

Listen to this article

AI narration · ~4 min

Tickers mentioned

NVDA+4.20%AMZN+1.80%MSFT+0.90%GOOGL+2.30%META+3.10%

Quick take: Cloud vendors are building custom AI chips to cut costs and reduce vendor dependence. That’s sensible. But displacing Nvidia is a different, tougher fight — it’s as much about software, developer mindshare and proven production runs as it is about silicon.

GPUs didn’t win by accident. After the deep learning breakthroughs of the early 2010s, GPUs became dominant because they offered programmable parallelism and, critically, a rich software stack centered on CUDA. History matters: hardware only takes over when the tooling and libraries follow.

Cloud vendors are pushing hard.

AWS shipped Trainium and Inferentia to shave training and inference bills on its own fleet — the argument being lower per-workload cost and tighter service integration.
Google has long optimized TPUs for its TensorFlow-first environment and internal model work.
Microsoft and other hyperscalers are quietly building custom boards or striking partnerships to diversify suppliers and control supply chains.
Smaller players such as Graphcore and Cerebras pitch themselves for particular model shapes where their architectures shine.

Still, the counterargument is blunt: Nvidia didn’t just make chips. It built an ecosystem — CUDA, cuDNN, a huge catalog of optimized kernels and vast third-party support. That’s a moat of developer time and accumulated engineering effort, not just teraflops.

What cloud-first silicon buys you

Cost control — providers claim meaningful savings at scale; when you’re training models that cost millions, margins matter.
Tighter stack integration — silicon tuned to a vendor’s services can make deployment and monitoring simpler.
Supply resilience — owning design reduces exposure to single-vendor chokepoints.

What it doesn’t buy quickly

Instant developer adoption — moving models, retooling pipelines or learning new SDKs carries real engineering cost.
The broader software ecosystem — lots of tools, libraries and tuned models still assume a GPU-first world.

From an investor’s point of view: this isn’t an overnight, winner-takes-all flip. Nvidia remains the easy market bet for AI acceleration, which explains its valuation today. But if cloud-native silicon gains traction, it could compress Nvidia’s long-term margins and alter how capital-hungry AI startups need to be.

Keep an eye on a few signals

Real-world LLM training and inference benchmarks — not just peak FLOPS.
How smoothly major frameworks let teams migrate — one unified SDK, or a dozen.
Whether an enterprise-ready ecosystem (tooling, vendors, support) forms around any non-Nvidia chip.
Pricing differentials at scale — 10% is background noise; north of ~30% becomes strategic.

My take: building custom chips is a rational defensive move for hyperscalers that want control and clearer cost predictability. But toppling Nvidia requires more than better silicon — it needs time, developer trust and visible production wins. Expect a hybrid future: GPUs will stay central for most workloads, while bespoke silicon quietly grows into niches where scale and integration justify the switch.

Related coverage

News· 4 min

Why Investors Are Betting Big on Synthetic Data — and Why It Might Be the Safer AI Play

As lawsuits and privacy rules squeeze scraped training sets, synthetic data firms are drawing capital and corporate deals. Practical wins, hidden risks.

By Pedro Marini

News· 4 min

Who's Selling the Brain Fuel: How Data Marketplaces Are Rewiring AI Supply Chains

From web-scraping lawsuits to paid, privacy-preserving feeds and synthetic substitutes — firms are buying better data to train safer, more valuable models.

By Pedro Marini

News· 3 min

When Your Phone Becomes the Server: The On-Device AI Shift That Will Redraw Tech's Borders

Smaller models, smarter chips and privacy-first apps are turning phones and PCs into autonomous AI hubs — and the ripple effects will hit chips, apps and search.

By Pedro Marini

Cloud Wars: How AWS, Google and Microsoft Aim to Break Nvidia's AI Chip Grip

Related coverage

Why Investors Are Betting Big on Synthetic Data — and Why It Might Be the Safer AI Play

Who's Selling the Brain Fuel: How Data Marketplaces Are Rewiring AI Supply Chains

When Your Phone Becomes the Server: The On-Device AI Shift That Will Redraw Tech's Borders

The AI economy, decoded before the open.