S&P 5005,842.10 0.42%
NASDAQ19,210.55 0.88%
NVDA1,184.22 2.41%
MSFT478.90 0.88%
GOOGL210.11 1.12%
META612.50 0.34%
AAPL239.80 0.21%
AMZN248.66 1.40%
AVGO1,902.40 3.12%
TSLA298.10 1.05%
BTC98,420 1.88%
ETH4,210 2.24%
10Y4.18% 0.02%
DXY104.12 0.18%
S&P 5005,842.10 0.42%
NASDAQ19,210.55 0.88%
NVDA1,184.22 2.41%
MSFT478.90 0.88%
GOOGL210.11 1.12%
META612.50 0.34%
AAPL239.80 0.21%
AMZN248.66 1.40%
AVGO1,902.40 3.12%
TSLA298.10 1.05%
BTC98,420 1.88%
ETH4,210 2.24%
10Y4.18% 0.02%
DXY104.12 0.18%
Back to homepage
AI Chips

Nvidia's AI Throne: Cracks Showing as Rivals and Hyperscalers Build Their Own Chips

As Nvidia churns profits and prices, AMD, AWS, Google and startups are pushing custom silicon that could unsettle the GPU monopoly—and portfolios.

P
Pedro Marini
June 30, 2026 · 4 min read
Nvidia's AI Throne: Cracks Showing as Rivals and Hyperscalers Build Their Own Chips

Illustration by IMF Alpha editorial · Reviewed by Pedro Marini

Listen to this article
AI narration · ~4 min
Tickers mentioned
NVDA+3.40%AMD+1.80%INTC-0.50%AMZN+0.90%MSFT+1.20%

Why this matters now

Nvidia’s GPUs are the plumbing of modern AI — they power both training and inference across the cloud. That has created a simple market story: Nvidia equals AI. It’s a persuasive story, and partly true. But dominance on paper can be fragile. Over the last year or two a clearer rival ecosystem has taken shape: hyperscalers building custom accelerators, AMD bringing data-center GPUs to market, and specialist startups chasing niche inefficiencies. Investors who treat Nvidia’s lead as unassailable are exposing themselves to valuation risk.

A short history to frame this

The move from CPUs to GPUs for model training picked up momentum around 2016 and then went exponential after the LLM surge in 2022. Nvidia rode that wave not only with fast silicon but with software lock-in — CUDA. Any competitor now faces both raw performance hurdles and the inertia of an established developer ecosystem.

Who is actually threatening Nvidia, and how

  • AMD Instinct MI300: a full-scale bid at both training and inference, with a different memory approach and better energy math in some workloads. Early benchmarks and enterprise pilots matter here more than company press.
  • Hyperscaler custom chips: AWS Trainium and Inferentia, Google’s TPUs and other in-house silicon, and Microsoft’s experiments. When cloud providers run production on non-Nvidia hardware, that changes the economics for customers.
  • Startups and alternatives: Cerebras, Graphcore, SambaNova — they attack other bottlenecks, like interconnects or sparse compute, and sometimes beat Nvidia on cost-per-token in specific workloads.
  • Software portability: ONNX, ROCm and compiler improvements are slowly loosening CUDA’s grip. If tooling gets good enough, switching becomes feasible — assuming performance is competitive.

Signals that actually matter for investors

  • How quickly hyperscalers deploy and migrate. The moment major clouds move from pilots to production workloads on non-Nvidia chips is when adoption starts to show up in revenue.
  • Total cost of ownership. Performance-per-watt and cost-per-inference are the real decision drivers, not peak FLOPS on a spec sheet.
  • Wins in software compatibility. If AMD or someone else convinces major frameworks to run without heavy refactoring, switching costs fall dramatically.
  • Concrete customer case studies. Look for examples in recommendation systems, search ranking, or LLM inference where alternatives materially cut costs — not just lab demos.

What’s interesting is that small wins in tooling or cost can ripple — companies are pragmatic, and once a few large customers switch, others follow.

Why Nvidia is still hard to dislodge

Nvidia isn’t just chips. It owns drivers, libraries, enterprise support and developer mindshare — a kind of moat. Many startups build to complement Nvidia rather than replace it. And Nvidia keeps shipping: new architectures, cloud partnerships, deeper software work. That combination makes displacement slow and expensive.

Practical portfolio moves

  • Reduce single-name concentration that assumes endlessly superior margins. Multiple compression is the main downside risk.
  • Broaden exposure to firms that benefit from the AI buildout, not just the headline winner: cloud providers, selected chip rivals, and specialist IP companies.
  • Consider ETFs for hardware exposure if you want to avoid single-stock swings.

The nuance matters: winners will emerge, but they may be more numerous than the market currently expects. That matters when you’re sizing positions at today’s multiples.

What I’ll be watching this quarter

  • Public performance data on AMD MI300 and any announced hyperscaler migrations.
  • New developer tools that make swapping away from CUDA less painful.
  • Cloud-provider earnings commentary about AI infrastructure mixes.

Expect volatility and lots of headlines. Also expect a slower, structural rebalancing that creates selective opportunities beyond the obvious king of GPUs.

Advertisement
Continue reading

Related coverage

The IMF Brief · Daily Newsletter

The AI economy, decoded before the open.

Five minutes. One email. The signal cutting through the noise at the intersection of artificial intelligence and Wall Street. Free, forever.

Join 184,000+ readers · No spam · Unsubscribe anytime