Why Enterprises Are Abandoning OpenAI — and What They're Replacing It With
From Llama 2 forks to custom inference stacks, companies are choosing cost, control, and privacy over convenience. Investors should take note.
From Llama 2 forks to custom inference stacks, companies are choosing cost, control, and privacy over convenience. Investors should take note.

Illustration by IMF Alpha editorial · Reviewed by Pedro Marini
The idea that OpenAI is the default engine for enterprise AI is starting to fray. Over the past year a clearer pattern has emerged: large incumbents and well-funded startups are either pulling models in-house or switching to open-source stacks to rein in exploding inference bills and tighten data governance.
This is not a hobbyist fad. Think of it as the cloud migration of AI: when APIs were new and cheap, businesses leaned on hosted models. Now the math — and regulators — are nudging many back toward on-prem or hybrid setups.
Concrete drivers
What teams are actually deploying
Why this matters for markets
Risks and caveats
A short historical lens
It mirrors early cloud adoption: first convenience, then scale exposed the economics, then a migration to hybrid architectures. SaaS to self-hosted hybrid — the cycle is replaying in AI.
Investor signals and things to watch
The upshot
The move away from single-vendor APIs toward open-source models and private inference feels like a natural market maturation. It does not kill managed APIs overnight, but it shifts where value accumulates — toward chips, cloud infrastructure and the MLOps layer. The question for executives and investors is no longer whether AI matters, but which slice of the stack actually captures the margin.

Synthetic and curated datasets are emerging as the missing link between privacy, model performance, and regulatory pressure — and investors should pay attention.

As financial firms swap raw customer records for engineered datasets, the winners will be those who balance speed with skeptical validation.

Smartphones and edge chips are pushing large language models and inference off servers. That shift reshuffles winners, risks, and the economics of AI.