Brad Gerstner confirmed that tokens aren't being sold at a loss. Whatever the formula, API + Subscription split, the companies are making a profit on net token sale.
They maybe running at loss after all the salaries and stock comp, but tokens are in profit now.
It's like witnessing a rocket using the most powerful engine on Earth then once it escaped orbit turn off the engine and said "It is flying without power!".
Yes, sure, right now it is ... but that's NOT how it got here.
There are trillions invested to recoup and at most billions in sales. It doesn't add up to tokens making a profit any time soon.
The problem is, people see "they're not profitable once you account for training" and equate that to "AI will go away soon"
But if all the AI companies stopped training new models, they would all instantly become profitable (and stick around)
The thing that makes them unprofitable, is having to compete (which means training models). If / when enough companies exit the market, the cost to compete goes down and you end up in an equilibrium
Sure, but if companies don't exit the market and FOSS alternatives don't end up being unable to get near them in quality, they have to keep spending on training. And conversely, if the market becomes uncompetitive and FOSS sucks, the winners of the AI arms race are very strongly incentivised to stick their prices up anyway...
> if companies don't exit the market and FOSS alternatives don't end up being unable to get near them in quality, they have to keep spending on training
Eh, the AI companies still have lots of datacentres. For the guys who funded with equity, they could collapse down to just running those as utilities. (For the guys who funded with debt, they'd have to restructure.)
From the customer's perspective, this situation shouldn't result in a cost spike. (Consolidation, on the other hand, would. But that's a separate argument from the one the article attemptes to make.)
How often do VC funded unicorns collectively decide to stop scaling up, shut down all their departments targeting growth and reach breakeven point by becoming low margin utilities that will never justify their valuation?
Good thing the entire nation's economic growth outlook isn't tied to these companies then. For a second I thought we had a potentially dangerous situation on how we misappropriated trillions of capital.
Not really, because investors will sooner or later want to see real returns on what they invested. Tokens are suddenly not dirt cheap and enterprises are screwed.
It's like selling dope, once they're addicted, a dealer could turn the screw on them
That's why it's an issue for investors. Their investment may not payout. But the things that were built will still have been built and available to sell for related purposes, the models that were trained will still be trained, and so on.
If things don't end up working out a lot of people have already been (and in the future will be) paid. It's the investors that will lose out, not the subscriber.
When I compare different foundational models on the problems I solve with AI, the differences are not that large to prevent a switch if the price gets too high. I do this like each 6 months, just to assess what is the risk of getting dependent on one provider. It's not yet worring, at least for my use-cases.
OK hundreds of billions with more than ~200B disclosed for OpenAI, more than ~50B for Anthropic and I have no idea how much in terms of infrastructure from Azure, other neoclouds, NVIDIA, etc. It's honestly hard to keep track of "kind of IoUs-ish" from each other but my points is order of magnitude more than few billions than has be recouped so far with tokens and large contracts.
Tokens can be sold at profit, but 70% of compute expenditure goes to R&D and model training[0]. Inference needs to cover all of that as well as being profitable in a vacuum.
At the same time, the training paradigm being scaled, Reinforcement Learning, is significantly less data-efficient than next-token prediction. You basically need to run an agent for minutes (or longer if you want good long-horizon performance), only to give it a binary pass/fail - one bit of information.
Inference compute is definitely scaling fast, but to scale RL, training and R&D compute also needs to scale hard. I don't think it's obvious that inference will overtake R&D/training, unless there's a reputable source that states that.
They aren't being sold at a loss but they aren't being sold at enough to cover the current losses and the costs. The losses are being passed around in some fucked up circular funding mess which will inevitably collapse into a debt crisis at some point.
Open source models apply pressures on the low end of the market. The paid models are so much better that they can charge based on value for enterprises.
I wouldn't call Kimi K2.6, GLM5.1, DS4 or newer Qwen models "low end". I prefer GPT5.5, but if it disappeared tomorrow, I'd be perfectly fine with any of these chinese models.
I think for a while this is possible - the models definitely aren't as efficient as they can be as we've seen a lot of promising papers over the last year about how people are changing pieces and parts to do more with less. None of it has come to market yet that I'm aware of so for now it's just a hope I suppose but things like Opus definitely burn a ton of compute to be the leader in benchmarks but the gaps are closing.
He's an interested party. His investments are worth a lot more if he says that tokens are sold at a profit. I don't understand how anyone would trust him?
There are plenty of various providers on OpenRouter serving very large Chinese models like GLM for a fraction of what OpenAI/Anthropic. Presumably they are making a profit.
It’s unlikely that Claude is proportionally that bigger and more expensive to serve so profit margins on inference must be pretty decent
Do we know they are making a profit though? They could be subsidizing use to build market share the same way. They might not have billions, but at the volumes they are selling maybe they’ve got the cash to do it.
Even if they are “profitable” how many Uber drivers are “profitable” because they aren’t correctly calculating asset depreciation. Maybe these guys are doing the same thing.
Maybe it’s a lot of people who already had GPUs for crypto mining, and they’ve moved over to this, so that if they need to grow and buy new GPUs the costs would dramatically grow.
also, it's very much possible that the chinese companies get heavy investments from the state. Since it's very hard to get this info we have no idea wether they really make a profit or not.
I agree, and find that very plausible. I mean, for the CCP a few billions to subsidize domestic AI companies is a tiny investment with a potential huge payoff. It prevents (or at least make it harder for) US companies to build a monopoly on LLM tech and it could help popping the bubble which would weaken the US economy. In fact, if I remember correctly, the AI infrastructure build-out is what is keeping the US from a technical recession.
> subsidizing use to build market share the same way
To an extent maybe, but that market is almost entirely commoditized already. Besides Cerebras and maybe Groq (which already charge a slight premium) all the other providers are more less interchangeable.
> Maybe it’s a lot of people who already had GPUs for crypto mining
I’m not sure the type of GPUs that were most popular for crypto are at all useful for LLMs?
If there’s a few providers subsidizing, that’s the price ceiling. Everyone who wants to compete has to subsidize.
Now if this market had been operating for years, I’d say that it’s likely all these companies are profitable or close to it. But the market is so new and there’s so much hype, I find it very plausible that none of these guys are making a profit and they all hope to just hang in until all the subsidies go away.
> I’m not sure the type of GPUs that were most popular for crypto are at all useful for LLMs?
There’s some overlap. I’ve definitely read about people repurposing.
Do you think it will be the case for the Claude Code/Codex tokens as well? I think those are heavily subsidized, but they're the only ones I find real value in.
> Why are we all whispering about how profitable all this is?
Nobody is whispering about anything. Everyone is loudly assuming what's convenient for their thesis. Even if you have access to the books, the accounting isn't straightforward–there are yet insufficient data for a meaningful answer.
> It is the absolute last thing these firms would keep secret
If you find an optimisation strategy that you don't think your competitors have, you absolutely keep your margins secret for as long as possible. Knowing something is possible is the first step to making it so.
Based on what I said. If e.g. Sonnet (assuming it’s significantly smaller than Opus) is unprofitable why are there a bunch of inference providers on OpenRouter serving very large models way cheaper? They don’t have a pile of money to burn for no reason.
Obviously I, like basically everyone else here, don't have access to Open AI or Anthropic books so it's just guessing based on public available evidences, but "tokens aren't being sold at a loss" does not imply there is any profit.
And, even if there is some profit, it needs to be big enough to at least pay back the capex spendings and finance the next model iteration.
Ignoring the hundreds of billions of investments and debt and the astronomical costs of training and building data centers, sure. This is delusional thinking.
If tokens weren't being sold at a loss, Anthropic would be screaming about it from the rooftops. They've been desparately trying to make themselves not look like a money furnace lately, but it's not really working.
They might be sold at-compute-cost, but that of course ignores training, salaries, and everything else.
But growth stall, because competitors will capture future growth. You need to keep up.
Of course I acknowledge if Hersheys uses AI to make trillions of chocolates a second, there is an upper limit on consumption. And hence a limit on revenue.
> But growth stall, because competitors will capture future growth. You need to keep up.
You need to keep up if you follow Silicon Valley business model growth + funding = IPO. Every other business in the world needs revenue and profit to keep the business open, and growth does not necessarily translates into revenue and profit.
reply