Posts in Business (20 found)
Unsung Yesterday

“In a world of unresponsive 911 calls, it is the 912 that actually works.”

I know I just mentioned the Google Search app, but I’m also in the process of disentangling myself from Google and Gmail after last week’s Google I/O revelations. On that note, this is an interesting, meandering essay by Ernie Smith at Tedium , reflecting on the enshittification of Google and the two-year anniversary of &udm=14 , a simple site that removes AI from Google’s search results: I spent two hours of my life building a thing. Google has probably spent thousands, if not millions, of collective employee hours building all their AI innovations. And for a surprisingly large number of people, the two-hour workaround I built wins out. There’s a lesson in that. Somewhere in the middle, the essay transitions into talking about the value of good tools and single-serving websites: Our world needs more, smaller tools that speak the same language, where everyone makes a little money, but nobody dominates the industry. In the 1980s, the software industry was kind of like this. Oh, sure, Microsoft and Apple were still out front, sucking up all the oxygen. But there were lots of little companies, selling software on disks. The bigger ones put them in boxes in stores. The smaller ones realized that they could just ship software through the mail and let the software spread naturally among user communities. Shareware didn’t really survive the internet era—but, at least for a while, its spirit did. More recently, that spirit has taken a backseat to the larger companies that realize, if they’re big enough, they can shape how we interact with our world. In 1991, if you wanted to start a software company, you had to hope that your product was good enough that word of mouth and a P.O. Box could push it around. That’s exactly what happened when Tim Sweeney released ZZT. It became the starting point for Epic Games, the kind of company that today is big enough that, thanks to its Unreal Engine and the success of Fortnite, it can dictate terms to much of the gaming industry. If you ask me, I want a world where more software is like ZZT than it is like Fortnite, because more people have a chance to succeed in the former environment. Previously in this general category, we covered Keyhole and (Gmail) Simplify . If you have a favourite small tool or a simple tool-like website, I’d love to hear from you! #ai #enshittification #google #toolmaking

0 views
Stratechery 3 days ago

2026.22: Luceing Their Mind

Welcome back to This Week in Stratechery! As a reminder, each week, every Friday, we’re sending out this overview of content in the Stratechery bundle; highlighted links are free for everyone . Additionally, you have complete control over what we send to you. If you don’t want to receive This Week in Stratechery emails (there is no podcast), please uncheck the box in your delivery settings . On that note, here were a few of our favorites this week. This week’s Stratechery video is on The Inference Shift . Why Everyone Hates Luce. To say that the Jony Ive-designed Ferrari Luce, the iconic carmaker’s first electric vehicle, has faced a chilly reception is an understatement. I actually think it looks great —  for an electric car . On Dithering , John and I discuss why the real problem is that it’s branded Ferrari, and on Sharp Tech I get even more philosophical: electric cars are focused first and foremost on efficiency, and not only is that different than performance, Ferrari’s calling card, but also representative of the parts of modern society — including tech — that leave everyone feeling increasingly alienated (and why, surprisingly, AI might help). — Ben Thompson How to Monetize AI Answers. The ad business is, for me at least, endlessly fascinating, and not just because it is the most important business model in consumer tech: I think digital ads, particularly Meta-style ads that introduce you to things you never knew you wanted, a societal good. The other reason to care about ads, however, is that their economic importance means they are where the impacts of new technology are often felt first. This week’s Interview with Eric Seufert covers all this: how LLMs are changing digital ads, the changes both Google and OpenAI have made in terms of monetizing AI, and, more philosophically, why believing in ads might make one more optimistic about humanity in an AI-denominated future. — BT Social Mobility in China, and Lack Thereof.  Late last week China’s State Council announced a reform that will ease so-called “hukou restrictions” and allow migrant workers from all over the country to access social services in the cities where they work, which had long been forbidden. It’s a major reform that furthers Xi’s goal to unify the national market, and should improve the lives of millions of workers, but it also comes with plenty of questions as it’s implemented. We discussed all of it on a great episode of Sharp China this week , as well as reports that top Chinese talent in AI has been banned from leaving the country, continued capital control, and ongoing tensions with Japan and the U.S. that call to mind an ominous passage from Mao Zedong.  — AS Nvidia Earnings, The AI Stack, Nvidia’s New Reporting — Nvidia is changing its reporting to delineate between hyperscaler sales — where Nvidia is fighting commoditization — and everyone else, where Nvidia runs the whole stack. The SpaceX IPO and Data Centers in Space — There isn’t a financial model that justifies the SpaceX IPO, but data centers in space are plausible, and that might be enough. An Interview with Eric Seufert About Models and Ads, and AI’s Upside for Humanity — An Interview with Eric Seufert about building models for generative AI, why Meta’s foundational models are so important, and why understanding advertising leads to optimism about humanity’s future. How Spencer Pratt Happens — Spencer Pratt’s success in L.A. reflects his own surprising political talent, and an increasingly broken Democratic machine in California and beyond. Acquired the Podcast The Ferrari Luce How Things Fell Apart for Germany’s Nixdorf Computer Japan’s Rare Earths Island Social Mobility and Hukou Reform; US Halts Taiwan Arms Sales?; Ongoing Pressure on Japan; An American Xinhua Journalist Arrested The Knicks are in the NBA Finals, A Moment of SGA Truth, Around the League with Giannis, Bulls, and the Basketball Gods SpaceX Hype and the Elon Bargain, Nvidia and the Neoclouds, Q&A on Dropbox, Google, Ferrari Luce Backlash

0 views

Premium: What If...We're In An AI Bubble? (Part 3)

Last week I ran the second part of my three-part “What If…We’re In An AI Bubble?” series where I have been covering the scenarios that I believe could lead to the bubble popping. Here’s what I’ve discussed so far: Today I want to start with a very simple rundown of what has to happen for the AI bubble to make sense. These are all points that are rooted entirely in the projections and sales of the companies in question.  As NVIDIA intends to sell over a trillion dollars of Blackwell and Vera Rubin GPUs by the end of 2027 , it needs to have around (assuming a PUE of 1.35) 40GW of data center capacity built to support the 30GW+ of GPUs it will have sold .  With that compute being sold at around $12 million a megawatt (based on discussions with analysts and sources), that means that there must be around $435 billion in global annual compute demand to substantiate the amount of GPUs sold.  Outside of OpenAI and Anthropic, there doesn’t appear to be more than a few billion dollars of demand . Another concerning sign is that NVIDIA has had to agree to spend $30 billion in multi-year cloud compute agreements across the very partners it’s selling GPUs to ( per page 16 of its most-recent 10-Q ): The other problem is that data centers are taking way, way too long to finish , taking upwards of 24 months even for smaller 40MW builds.  This means that… Put another way, NVIDIA’s continued growth relies on people’s belief that A) these data centers get built and B) that they’ll actually make money.  Per COO Greg Brockman, OpenAI will spend around $50 billion on compute in 2026 , and I imagine Anthropic will spend in or around the same amount, especially as it’s now agreed to spend $15 billion a year on Musk’s Colossus data centers on top of whatever it spends on Google Cloud, Microsoft Azure and Amazon Web Services.  $100 billion is nowhere near enough to justify the compute being built. And while Anthropic and OpenAI have made more than $1.1 trillion in compute commitments in the next 3-5 years across Microsoft, Google, Amazon, Oracle, CoreWeave, Cerebras, Terawulf, and Cipher Mining, there’s so much more compute that needs to be sold on top of that.  Even if both doubled their spend in a year, we’d still need at least another two Anthropic or OpenAI-sized compute customers — either in aggregate or as separate companies — at a time when I can’t find a single other company spending even a hundred million dollars a year on compute. Most AI startups (and customers) want to pay Anthropic or OpenAI directly to access their models , which means that either Anthropic and OpenAI need to use roughly twice the amount of compute they do today and then some to meet the capacity being built. This will require them to do something either historic or impossible. This is not hyperbole! OpenAI, per The Information , plans to burn $852 billion through the end of 2030. Anthropic has, per The Information, agreed to spend $330 billion on compute on Microsoft, Google, and Amazon , at least another $30 billion on compute with CoreWeave , and another $63 billion in TPUs bought from Broadcom .  To reach this point, Anthropic projects it will hit $174 billion in annual revenue by the end of 2029, and OpenAI $284 billion . Both have made ridiculous claims of profitability ( with Anthropic actively conning investors with a “profitable” quarter based on discounted bills ) in the next few years that are immaterial to the larger point that they need actual, real cash to meet their obligations.  This is, again, not hyperbole. If we assume that the services in question are profitable, sustainable businesses, then revenues attached to AI services must exceed those driven by AI compute by a reasonable margin. It isn’t enough for us to have a few AI companies that spend a lot more on compute than they take in revenue, because at some point venture capital subsidies will run dry.  This isn’t happening. Putting aside the profitability part for a second, OpenAI and Anthropic account for 89% of all AI startup revenues , with the nearest competitor being Cursor with its pathetic $3 billion in annualized revenue . These are rookie numbers. They are insufficient. We need so much more than this. Again, not hyperbole! These are OpenAI and Anthropic’s own revenue projections — $184 billion and $174 billion respectively — that they expect to hit by the end of 2029. These are the same projections that have been used to make their $1.1 trillion in compute commitments, much of which make up 50% of Google, Amazon, and Microsoft’s remaining performance obligations : These commitments reflect expected revenue and demand for OpenAI and Anthropic’s services, but they’re commitments, which means that they need to be paid even if that demand doesn’t exist.  This is a huge problem for these companies. If they buy too much compute and don’t have the demand and revenue to support it, they’ll go bankrupt.  To be clear, that’s not my opinion, it’s what Anthropic CEO Dario Amodei said to Dwarkesh Patel in February, emphasis mine: That is not good! As I’ve covered before , buying compute is a knife-catching game where you have to guess how much you need for a particular year, and if you guess correctly you don’t lose as much money but if you guess wrong you run out of money.  It should be far more worrying to executives that the single-largest AI company is basically saying that if he mistimes growth his company explodes! Per Business Insider , Uber COO Andrew Macdonald said this weekend that it was becoming “harder to justify AI costs within the company”: Anthropic’s meteoric revenue growth has come from both AI startups burning more tokens ( as Opus 4.7 appears to burn more than ever ) and large organizations doing some form of “token-maxxing,” meaning that they tell their employees to use AI as much as they want, usually with KPIs that specifically track AI usage, as is the case at Meta , Amazon, and Zillow . Even organizations that aren’t actively incentivizing their engineers to burn more tokens are finding they’re blowing through their budgets at record speed. The situation with Uber’s COO was caused by his CTO saying back in April that the company had burned through its entire annual token budget in four months. Similarly, my reporting on Zillow’s AI spend showed that it will likely max out its annual Cursor budget by the end of May. The problem, as Macdonald said, is that nobody can seem to track all of this spend to an actual return on investment. This isn’t a situation where somebody is saying “the ROI is low but improving” or “we’re on the path to working that out,” but “it’s very hard to actually draw a line between “what we’ve spent” and “a reason we’re spending it.” This makes it hard for Uber to say how much it should reduce its token budgets. If you can’t measure the return on investment, how do you measure how much you’re meant to spend? What is “enough”? Because right now it’s clear that whatever they’re spending is too much , which means that there’s a ceiling to Anthropic and OpenAI’s revenue story.  OpenAI and especially Anthropic cannot afford for this conversation to be happening, because it suggests there’s a ceiling to the amount that people will spend on AI. It appears there’s a limit to which organizations can be abused and manipulated into believing that “the future is here,” and that limit is when they pay millions for something that doesn’t appear to have a measurable return on investment.  Anthropic and OpenAI need organizations to willingly spend 10% to 100% of their headcount on AI, as their revenue projections are clearly tied to every organization maintaining a significant spend on tokens in perpetuity.  There’re really two problems: This is budgetary poison. Right now, the vast majority of AI token spend is experimental , and if companies are already hesitating at the amounts they’re spending, Anthropic has no way to keep growing, and they also have no super secret models or harnesses or products that are going to reverse this trend. Nobody knows why they’re spending so much money or even how much money they might spend in a given month , which makes it tough to view Anthropic’s ( suspicious ) revenue growth as anything but a chaotic money-dump driven by CEOs that don’t know what their companies actually do and have been beguiled by the AI grift machine . And as I wrote up last week , OpenAI had a negative 122% operating margin in Q1 2026, and ChatGPT growth has stalled. It is unclear what its API revenue is, but it’s likely much less than Anthropic despite shoving its enterprise customers onto token-based billing not long after they did. As I’ve said: this cannot happen, and neither Anthropic nor OpenAI can afford to slow down. Their revenues must grow to over $100 billion by 2028, as their compute commitments demand it. Their growth must continue.  It’s been a little under four years of endless confidence about the inevitable growth of generative AI, and by extension the eternal success and growth of OpenAI. Yet in reality, its economics have only ever soured, and its growth appears to be collapsing.  In October 2024, The Information reported that OpenAI believed it would turn profitable in 2029, that its total losses between 2023 and 2028 would be $44 billion , and that its (non-GAAP, every one of these numbers is non-GAAP) gross margin would be 41% in 2024, though it would end up being a point lower at 40% in the end. OpenAI would then project a gross margin of 49% for 2025… but it ended up at 33% anyway .  OpenAI would also say on September 5 2025 that it would actually burn $115 billion through 2029 , but that “burn” assumed that it would have revenues of $60 billion in 2027, $100 billion in 2028, $145 billion in 2029, and $200 billion in 2030, when it would “become profitable” in some undiscussed manner. Two weeks later on September 19 2025, The Information would report that actually OpenAI would spend “about $450 billion to rent servers through 2030,” but not otherwise update the burn-rate. On November 4, 2025 , OpenAI CEO Sam Altman would say that the company had hit $20 billion in ARR and had made $1.4 trillion in commitments “over the next 8 years,” and a few months later On February 20, 2026 , OpenAI would claim that it had targeted “around $600 billion in compute commitments by 2030.” The very same day, The Information would report that it planned to spend $665 billion on compute through 2030 , that it missed gross margin projections (without sharing what those margins might be), and that ChatGPT had hit 910 million weekly active users that month, 90 million short of its goal of 1 billion by the end of 2025. It’s very obvious by now that OpenAI has been making up all of its projections, and that none of the numbers actually add up. My own reporting from November 2025 from actual Azure personnel suggests that OpenAI’s Q1 to Q3 revenues were billions lower than every other reported figure, and I think it’s likely that OpenAI is overstating its revenues.  In any case, on May 22, 2026 , The Information would report that OpenAI’s Q1 2026 operating margin was negative 122%, and that its Q1 average weekly active users (WAUs) sat at 905 million — suggesting that growth has stalled. OpenAI had anticipated that it would cross the one billion WAU mark by the end of 2025 — and it blamed its failure to do so on fiercer competition, primarily from Google’s Gemini. For OpenAI to afford its compute commitments, it has to make or raise $852 billion in the next four years. It must have that cashflow, or it will run out of money or be sued out of existence by its numerous counterparties from CoreWeave, Microsoft, Amazon, and Cerebras. In the final part, I’m going to get into the depths of destruction — the unraveling of the greater data center debt industry, the massive damage to private credit to come, potential shareholder lawsuits against NVIDIA, and the consequences of the deaths of OpenAI and Anthropic. What If…We’re in an AI Bubble? I also want to add that I realize three headlines didn’t make the cut — what if there’s not a bailout, what if I’m wrong, and what if I’m right — and I intend to cover all three of them in future free newsletters.  Nevertheless, today’s is an absolute beast, a 16,000 word conclusion to the first multi-part Where’s Your Ed At Premium.  What If The AI Industry Moves To Entirely Token-Based Billing?  What If Organizations Can’t Afford To Keep Spending On AI? What If The AI Capacity Crunch Never Ends (And Data Centers Aren’t Getting Built)? What If CoreWeave Can’t Keep Up With Its Capacity Demands? What If Hyperscalers Can’t Build Data Centers Very Fast? What If Hyperscalers Have Warehouses of Uninstalled GPUs? What If Hyperscalers Write Off A Large Chunk of GPUs? What If Data Center Construction Demand Collapses?  What If Venture Capital Funding Stops Flowing To AI Startups? What Would Make Venture Capital Stop Funding AI Startups? What If Most AI Startups Go To Zero? Scenario: OpenAI and Anthropic Go Full FTX, Scooping Up Dying AI Startups To Keep The Industry Afloat With Circular Financing Scenario: Venture Capital’s Post-AI Depression What If Inference Isn’t Profitable? AI Has Become An Existential Reckoning For The Valley NVIDIA’s customers are taking years to even begin making back the billions of dollars its chips and the associated construction costs. NVIDIA is selling far more GPUs every quarter than can realistically be installed in the space of a year. NVIDIA’s revenue stream is entirely based on organizations forecasting demand years into the future. NVIDIA’s revenues are, by extension, dependent on how long organizations believe that building data centers is a good idea. NVIDIA is absolutely, without a doubt, warehousing at least a million Blackwell GPUs . It’s difficult-to-impossible to actually measure the ROI of AI spend. It’s difficult-to-impossible to actually know how much it’ll cost to complete a specific task with AI. What if data center debt stops being issued? What if private credit had to write off most of its data center loans? What if the AI bubble blows up Taiwan’s ODM server manufacturers? What if NVIDIA is misrepresenting how many GPUs are shipped, sold and operational? What if OpenAI and Anthropic don’t go public? What if Oracle doesn’t get paid by OpenAI? What If OpenAI Dies? What if Anthropic Dies?

0 views
Jeff Geerling 3 days ago

It's hard to justify buying a Framework 12

My nephew just graduated high school, and wants a laptop. When he decides what computer to buy, price (or more precisely, value ) is the most important attribute. Apple's MacBook Neo upended the 'value laptop' equation—Apple's not supposed to be both the cheapest option and the best value... but it seems like that's squarely where the Neo landed for the good-but-cheap laptop category. My nephew is also my godson, and to kick off his computing journey, I thought I'd let him choose from a Framework 12 I bought to test, or the MacBook Neo I bought a couple months ago to use around the studio.

0 views

What's going on with Gemini?

Google is in a strange spot right now. They've got arguably the deepest research bench in the industry, their own custom silicon, and effectively unlimited money - and yet most developers I talk to barely touch Gemini day-to-day. The recent Google I/O announcements crystallised a lot of what I find confusing about their AI strategy, so I wanted to write down where I think they actually stand. The consensus seems to be that currently Anthropic and OpenAI are very much in the lead for frontier model intelligence, with each of those two labs trading blows every month. This may change in the near future - if Anthropic releases Mythos-class models that OpenAI doesn't have an answer to - but right now I think most practitioners would agree that GPT5.5 and Opus 4.8 are roughly in the same ballpark. After that, you have Google, with Gemini 3.1 Pro being in benchmarks ahead of the Chinese models but behind the flagship Anthropic/OpenAI models. In my personal experience though I've had better results from the best-in-class Chinese models (GLM 5.1 and Qwen 3.7) than Gemini 3.1 Pro at software engineering tasks. The main model announcement at Google I/O was Gemini 3.5 Flash. The benchmarks of it were underwhelming at coding: Gemini 3.5 Flash on the Artificial Analysis Coding Index - solidly mid-pack. Source: Artificial Analysis . However, the model is super fast - roughly 4x faster in tokens per second than the aforementioned Anthropic/OpenAI models: Output tokens per second - Gemini 3.5 Flash at 206 t/s, far ahead of Opus 4.8 and GPT-5.5. Source: Artificial Analysis . This definitely is really interesting development, especially for user facing applications which can appear very sluggish to users. But - the big but - is the huge price increase they announced - 3x more expensive than the previous flash release. At $9/MTok it is vastly more expensive than the best in class Chinese models, and I'm struggling to see where this fits - if you want best in class intelligence you pay the extra for Opus/GPT5.5, if you want cheap but not-as-clever the Chinese models fit the bill well. The risks around Chinese models are somewhat overplayed in my opinion - you can self-host a lot of them, or use US-based inference providers via OpenRouter. Having said all that, perhaps really this model isn't designed for external use in the same way that the OpenAI/Anthropic models are. Clearly Google consumes an enormous amount of tokens internally - for all their products like AI mode, Gmail, etc. If you look at it that way, the model makes far more sense. The speed of the model really matters for a lot of the Google use cases - AI mode is very user driven and Google knows better than anyone that speed really matters. And the actual serving cost Google pays is almost certainly a fraction of the external facing price, so that becomes irrelevant. The most interesting part of this story though, is this excellent comment on Hacker News from someone that estimated the size of the model and the fact that it should run on one TPU 8i card (Google's latest custom inference hardware). This does give Google a huge advantage. They are the only frontier lab that (currently) designs its own AI hardware. While other labs certainly optimise their models to the hardware, and also no doubt have a lot of say in driving the Nvidia/AMD roadmaps to their specifications, the model teams and hardware teams in Google almost certainly collaborate to a far greater level than the other labs. This really matters. If you have a very good steer on upcoming hardware you know the right size of models to target training runs to aim for. And equally, research from Google Deepmind can go straight into the hardware roadmap without any negotiations. [1] It'll be very interesting to see how this continues to develop. Inference efficiency will be the key driver to actual unit economics in AI, and Google may develop an outsized lead in this. The one real weakness I think Google has though, is their confusing and incoherent strategy on coding agents. While Anthropic has Claude Code, and OpenAI has Codex, in true Google style they have ended up with a smorgasbord of tools. There is currently Antigravity, Jules, Gemini Code Assist, Gemini CLI and AI Studio all doing slightly different things. This doesn't include some other agentic SWE tools they have for specialised purposes (like Android Studio). They announced that Gemini CLI is being discontinued and folded into Antigravity, but I very rarely come across any developer using Google-based SWE tooling. This is a huge issue for Google - there is no doubt that Claude Code and Codex is producing a lot of very detailed telemetry and training data that can be used to improve further models. Without this being resolved, Google does have an extreme weakness in the fastest growing - at least revenue-wise - segment of AI. While I definitely wouldn't write Google off - they do have enormous structural advantages in other areas - I get the feeling that because Google has such a bespoke internal software development workflows [2] their isolation from what "the rest of the industry" does in software is so large it's perhaps hard for them to really reason about agentic tooling for the rest of the industry. My read is that Google is playing a genuinely different game to OpenAI and Anthropic. Gemini 3.5 Flash only looks strange if you assume it's meant to win the same race - priced and tuned for Google's own gigantic internal token consumption, with the TPU advantage baked in, it makes complete sense. Where they're actually behind is the developer-facing surface: a confused tangle of coding tools and an org that struggles to reason about how the rest of us build software. If Google sorts out the agent story, the structural advantages underneath - the silicon, the research, the integration - could make them very hard to beat. That's a big if. But I wouldn't bet against them. While it's hard to say if there was any truth in this - or it was just a negotiation strategy - there were rumours of OpenAI being unhappy with direction/progress Nvidia was making earlier this year: https://finance.yahoo.com/news/sam-altman-pushes-back-report-213000823.html ↩︎ Google engineers have an enormous amount of home built/custom/internal tooling that is uncommon outside of Google-scale companies. They use different source control, build tooling, testing infrastructure and build deployment to the rest of the industry - for very good reasons! But this stack is absolutely overkill for 99% of companies, and when you are used to thinking about SWE at Google scale I suspect it is very difficult to reason how people build software outside of that ecosystem. ↩︎ While it's hard to say if there was any truth in this - or it was just a negotiation strategy - there were rumours of OpenAI being unhappy with direction/progress Nvidia was making earlier this year: https://finance.yahoo.com/news/sam-altman-pushes-back-report-213000823.html ↩︎ Google engineers have an enormous amount of home built/custom/internal tooling that is uncommon outside of Google-scale companies. They use different source control, build tooling, testing infrastructure and build deployment to the rest of the industry - for very good reasons! But this stack is absolutely overkill for 99% of companies, and when you are used to thinking about SWE at Google scale I suspect it is very difficult to reason how people build software outside of that ecosystem. ↩︎

0 views
Unsung 5 days ago

“The pipeline of future experts is thinning from both ends.”

I generally avoid think pieces about AI because a) a lot of them are boring, and b) they rarely match the pragmatic posture of this blog. But this essay on a new No One’s Happy blog was really interesting to read, and feels different in a few ways. First, it examines what happens as AI slop spreads in the context that is less discussed – in a workplace: This is a new form of slop, and it is more expensive than the public kind, because the people producing it are being paid a salary to do so. […] The cost of producing a document has fallen to nearly zero; the cost of reading one has not, and is in fact rising, because the reader must now sift the synthetic context for whatever the document was originally about. A lot in the essay feels pertinent to Unsung as real craft is not feelings or fluffiness. Real craft is deep expertise : Generative AI can produce work that looks expert without being expert, and the failure arrives in two shapes. The first is when novices in a field are able to produce work that resembles what their seniors produce, faster or more advanced than their judgment. The second is when people generate artifacts in disciplines they were never trained in. The two failures look similar from a distance and are not the same. Research has mostly measured the first. The second is what it is missing, and in my experience it is the riskier of the two. The term for this new challenge is, apparently, “output-competence decoupling.” Other parts of the essay come back to a topic – toxic velocity – we covered before : The current generation of agentic systems is built around the premise that the human is the bottleneck — that the loop runs faster and cleaner without the awkward delay of someone reading what is about to happen and deciding whether it should. This is, in a great many cases, exactly backwards. The human in the loop is not a vestige of an earlier era; the human is the only part of the loop with skin in the game. Removing the H from HITL [Human In The Loop – eds. note] is not an efficiency. It is the abandonment of the only mechanism the system has for catching itself. And one last thing that differentiates this essay from many others is the last “what to do about it” section. #ai #craft

0 views
Simon Willison 5 days ago

I think Anthropic and OpenAI have found product-market fit

Anthropic are strongly rumored to be about to have their first profitable quarter. Stories are circulating of companies surprised at how expensive their LLM bills are becoming from usage by their staff. I think this is because OpenAI and Anthropic have both found product-market fit. I currently subscribe to the $100/month Max plan from Anthropic and the $100/month Pro plan from OpenAI. If you are a heavy user of coding agents these plans are a fantastic deal. I just ran the ccusage tool on my laptop to get an estimate of how much I would have spent if I were to pay for API tokens in the past 30 days and got: That's $2,180.16 worth of tokens for $200 - not bad at all! I'm a moderately heavy user of these tools, but I'm certainly not running agents every hour of the day and night. I had assumed that companies making extensive use of agents were getting similar discounts. It turns out I could not have been more wrong about that. I haven't been able to track down the exact date, but at some point in the last six months Anthropic switched their Enterprise plan (originally "Claude seats include enough usage for a typical workday" back in August 2025 ) to $20/seat/month plus API pricing for usage. This story about the change from The Information is dated Apr 14, 2026, but cites an Anthropic spokesperson claiming that the pricing change occurred in November 2025. Existing customers are finding out about the change as they renew their contracts. OpenAI made a similar pricing change in April. The Codex rate card ( Internet Archive copy ) currently says: Note : On April 2, 2026, we updated Codex pricing to align with API token usage, instead of per-message pricing. This change was applicable to new and existing Plus, Pro, ChatGPT Business and new ChatGPT Enterprise plans. On April 23, 2026, we made this update for all existing ChatGPT Enterprise plans as well, inclusive of Edu, Health, Gov, and ChatGPT for Teachers. It's a little harder to decode as they quote prices in "credits", but as far as I can tell those credit costs are an exact match for the API token costs listed for those models. All of which is to say that as of April 2026 the "Enterprise" cost for both OpenAI Codex and Anthropic Claude Code/Cowork is the same as the listed API price. GPT-5.5 (released April 23rd) is 2x the API price of GPT-5.4. Opus 4.7 (April 16th) is around 1.4x the price of Opus 4.6 when you take their new tokenizer into account. So April saw both leading model companies release new frontier models with a higher API price, and both companies now have measures to lock their enterprise customers (who tend to sign year-long deals) at those API prices, not the previous extreme discounts. Why these sudden aggressive moves on pricing? Both Anthropic and OpenAI are planning to IPO, but I suspect there's a more important factor here: I think they've finally found product-market fit, with the coding/general-purpose agent products embodied by Claude Code/Cowork and Codex. Tools like ChatGPT are wildly popular, but that wild popularity has been difficult to turn into revenue. In February OpenAI boasted more than 900 million weekly active users for ChatGPT, but only 50 million - 5.6% of that - were paying consumer subscribers. Charging $10-$20/month per user is an OK business, but you'd need 1-2 billion subscribers sticking around for four years to cover $1 trillion in infrastructure . Companies spending $200+/month/user will get you there a whole lot faster - and as noted above, as a power-user I'm at ~$1,000/month in API costs per vendor already. Coding agents really did change everything. These are tools which burn vastly more tokens, but are also quickly becoming daily drivers for the work carried out by extremely well-compensated professionals. Right now that's still mostly software engineers, but a coding agent is a tool that can automate anything you can do by typing commands into a computer... so they are clearly applicable to a much wider set of skilled knowledge workers. As I've discussed on this site at length , the models released in November 2025 elevated agents to being genuinely useful. We've had six months to get used to that idea now - it's no wonder companies are beginning to spend real money on this technology. You could argue that ChatGPT achieved product-market fit when it became the fastest-growing consumer app in history back in February 2023... but it certainly wasn't making any actual money back then. Coding agents plus enterprise pricing marks the point when these companies start making very real revenue. Maybe even enough to start covering their costs! As further evidence that enterprise agents represent product-market fit for these companies, consider their open job listings. OpenAI have 703 open jobs right now, of which I'd categorize 229 (32.6%) as relating to enterprise sales and support - account executives, "Go To Market", "Forward Deployed Engineers" and the like. Anthropic have 390 open jobs , 105 (26.9%) of which look enterprisey to me. It's pleasingly ironic that these AI labs have picked a business model with such a heavy demand on human labor - enterprise sales contracts don't close themselves without a whole lot of humans in the mix! (I ran this analysis by scraping their job sites with Claude Code, then having it use Datasette's JSON API to pipe that data into Datasette Cloud where I used Datasette Agent for the analysis, exported here . Dogfood!) I started digging into this in response to a growing volume of stories claiming that large companies were sounding the alarm because their AI usage costs had grown so large. The most widely cited of these stories appear quite overblown to me. The most discussed has been Uber, based on this report where CTO Praveen Neppalli Naga indicated that Uber had "maxed out its full year AI budget just a few months into 2026", mostly thanks to Claude Code. Given that Claude Code only got really good in November it's entirely unsurprising to me that a budget set in 2025 may have failed to predict demand for that tool in 2026! That Uber story was further fueled by comments made by Uber's COO, Andrew Macdonald, on the Rapid Response podcast. I tracked down the segment and there really isn't much there. Here's what Andrew said: But then you sometimes go and talk to your senior engineering leaders and you're saying, OK, how many projects that were on the cutting room floor got moved above the line because of the productivity gains because 25% of our code commits were via Claude Code last quarter? That link is not there yet, right? I think maybe implicitly there's more that is getting shipped. But it's very hard to draw a line between one of those stats and, OK, now we're actually producing like 25% more useful consumer features, right? And that line is hard to draw. Somehow this fragment turned into headlines like Uber's COO says it's getting harder to justify the money spent on AI tokenmaxxing , because the market for stories about AI failures remains enormous. The other popular story around this is Microsoft starts canceling Claude Code licenses , ostensibly to encourage their engineers to dogfood their own Copilot CLI agent instead - but The Verge reporter Tom Warren says "sources tell me the decision is also a financial one", triggered by the June 30th end of Microsoft's financial year. I think both of these stories support my "product-market fit" hypothesis. The best advice I ever heard on pricing a product was that your customer should suck air through their teeth and then say yes. Uber's budget overrun and Microsoft's seat cancellations look like that effect playing out in practice. The big AI labs spend billions of dollars on both training and inference. Credible figures are hard to come by, but we did get one huge hint as to the figures involved from, oddly enough, the recent SpaceX S-1 : [...] in May 2026, we entered into Cloud Services Agreements with Anthropic PBC (“Anthropic”), an AI research and development public benefit corporation, with respect to access to compute capacity across COLOSSUS and COLOSSUS II . Pursuant to these agreements, the customer has agreed to pay us $1.25 billion per month through May 2029 [...] The Anthropic announcement said that this deal meant they could "increase our usage limits for Claude Code and the Claude API", heavily implying that Colossus is being used for inference, not model training. Anthropic already have vast amounts of compute from other providers. The fact that they're willing to spend $1.25 billion per month for extra capacity from just one of their vendors hints at how big these inference budgets have become. Over the past two years my impression has been that OpenAI made more of their income from subscription revenue while Anthropic made more from their API. Anthropic's API revenue was historically quite dependent on a small number of large API customers - this VentureBeat story from August 2025 quotes "sources familiar with the matter" suggesting that just Cursor and GitHub Copilot were responsible for $1.2 billion of the company's then-$4 billion revenue. Today Anthropic are rumored to hit $10.9 billion in the second quarter , potentially even operating at a profit for the first time. This pivot-to-Enterprise suggests that the labs have realized that the real money lies in cutting out the middlemen. Anthropic's Claude Code directly competes with Cursor and Copilot. No wonder Cursor are investing in their own models ! I've called November 2025 the November inflection point because that was when GPT-5.1 and Opus 4.5, combined with their respective coding agent harnesses, got good - good enough that we've spent the last six months adapting to agent systems that can reliably get useful work done. I think April 2026 is a new inflection point where the revenue implications of this have started to land, to the benefit of the frontier AI labs and with material impacts on the budgets of large companies. We'll know for sure how real this moment is when the S-1 documents for the upcoming Anthropic and OpenAI IPOs give us some real, audited numbers to get our teeth into. You are only seeing the long-form articles from my blog. Subscribe to /atom/everything/ to get all of my posts, or take a look at my other subscription options . Enterprise customers are now paying API prices I think they've found product-market fit And they're ramping up The AI-failure stories around this are pretty thin We also know the labs are spending a lot API revenue is becoming less important April is a new inflection point $1,199.79 for Anthropic Claude Code $980.37 for OpenAI Codex

0 views
Stratechery 5 days ago

The SpaceX IPO and Data Centers in Space

Listen to this post : It’s hardly the biggest problem in the world — or perhaps the height of privilege to consider it a problem at all — but one of the most annoying consumer experiences is booking an Uber Black and realizing you got assigned a Tesla Model Y (Uber finally stopped allowing new Model Y’s onto Black last year ). Buckle up for an uncomfortable back seat, basic plastic finishes, and, all-too-often, potential car sickness from a driver who hasn’t completely mastered the Tesla’s aggressive regenerative braking. Still, the fact that the Model Y ever made it to the Black level is a testament to the brand Elon Musk built. Back in 2016, when 300,000 people dropped $1,000 each in a matter of hours to reserve an as-yet-unreleased Model 3, I explained that the phenomenon was because It’s a Tesla : The real payoff of Musk’s “Master Plan” is the fact that Tesla means something: yes, it stands for sustainability and caring for the environment, but more important is that Tesla also means amazing performance and Silicon Valley cool. To be sure, Tesla’s focus on the high end has helped them move down the cost curve, but it was Musk’s insistence on making “An electric car without compromises” that ultimately led to 276,000 people reserving a Model 3, many without even seeing the car: after all, it’s a Tesla. This is the same brand halo that landed what is, if we’re honest, a pretty basic car on the Uber Black list. What actually makes these cars compelling is the extent to which they are computers on wheels: I know plenty of very rich people who drive a Tesla not for the finishes but rather the Full Self-Driving (Supervised); there is nothing like it on the market, at least when it comes to cars you can own. Tesla appears to be doubling down on this point of differentiation: the company stopped production of the Models S and X earlier this year, focusing production resources on the CyberCab and robots; if you want your car to drive itself, you’ll get the same model as everyone else. It reminds me of Andy Warhol’s famous quote : What’s great about this country is that America started the tradition where the richest consumers buy essentially the same things as the poorest. You can be watching TV and see Coca-Cola, and you know that the President drinks Coke, Liz Taylor drinks Coke, and just think, you can drink Coke, too. A Coke is a Coke and no amount of money can get you a better Coke than the one the bum on the corner is drinking. All the Cokes are the same and all the Cokes are good. Liz Taylor knows it, the President knows it, the bum knows it, and you know it. That “tradition” is scale, and America is indeed better at it than any other country in the world; and, amongst Americans, no one pursues and seeks to leverage scale quite like Musk. From a press release from American Airlines: American Airlines today announced a sweeping modernization of its narrowbody inflight customer experience with the installation of Starlink, the fastest Wi-Fi in the sky, on more than 500 narrowbody aircraft beginning in Q1 2027. Starlink is widely regarded as the world’s most advanced satellite constellation using a low Earth orbit to deliver broadband Internet capable of supporting inflight streaming, online gaming, collaborative meeting tools and more. With thousands of satellites in low Earth orbit, Starlink can deliver multigigabit connectivity to aircraft using its Aero Terminal, which can support up to 1 Gbps per antenna. “As a premium global airline, we are continuously seeking out world-class partners like Starlink to deliver what our customers need and want,” said American Airlines Chief Customer Officer Heather Garboden. “The addition of Starlink solidifies American as a leading airline in keeping passengers connected in flight.” As part of American’s commitment to an elevated onboard experience, Starlink will enable seamless streaming, browsing and real-time communication capabilities across American’s domestic and short-haul international routes. I linked to the press release just for the amusement of American Airlines, which has in recent years built its strategy around offering anything-but-premium on routes you need, billing their Starlink deal as a commitment to “an elevated onboard experience.” That may have been the argument for United’s Starlink deal when it was announced in 2024 , but by this point it’s tablestakes , which is surely exactly how Musk wants it. Starlink is the consumer-facing business of SpaceX, generating $8.7 billion in revenue last year and $4.4 billion in profit; while it’s not totally clear exactly how SpaceX accounts for launch costs, obviously Starlink benefits greatly from the fact that it has access to SpaceX’s launch capacity. That launch capacity has resulted in over ten thousand active satellites in low Earth orbit, delivering low latency high speed Internet anywhere in the world — including in the air. That’s the carrot for airlines; the stick is the prospect of everyone else having the same service, and customers making flight decisions based on the quality of Internet access available. There is a similarity to Tesla in this way. Musk companies at their best don’t win the game; they change the rules through scale, such that billionaires buy economy cars because they actually drive themselves (with supervision), and airlines transform the consumer experience on their own dime. Musk makes all-in bets — whether that be in terms of launch capacity or in autonomous driving — not by making rational short-term business decisions, but by starting with the desired end state and working backwards. Tech has a long history of silly charts — there is an entire category known as Bezos charts — and the SpaceX S-1 has one that made me laugh. It came in the discussion of SpaceX’s total addressable market: We believe we have identified the largest actionable total addressable market (“TAM”) in human history. We estimate that our quantifiable TAM is $28.5 trillion, consisting of $370 billion in Space from space-enabled solutions; $1.6 trillion in Connectivity across $870 billion in Starlink Broadband and $740 billion in Starlink Mobile as well as additional opportunities in enterprise and government; $26.5 trillion in AI across $2.4 trillion in AI infrastructure, $760 billion in consumer subscriptions, $600 billion in digital advertising, and $22.7 trillion in enterprise applications. For illustrative purposes of sizing our addressable market opportunity, we exclude China and Russia from our global estimates. This image is approximately to scale vertically, but certainly not horizontally: I could use the help in really wrapping my mind around the $26.5 trillion AI opportunity, given it’s more than 13 times the space and connectivity opportunity combined! In all seriousness, the numbers are obviously absurd, but then again, everything about this IPO is absurd. SpaceX is seeking a $2 trillion valuation on a mere $18.67 billion in revenue with $4.9 billion in losses last year, and growth actually slowed from 35% to 33%. That slowdown happened despite the addition of xAI (and thus also X), which tipped the company from a small profit to that massive loss, thanks to $5.1 billion in AI R&D expense. That R&D, keep in mind, went towards building a model that is in 5th place, and whose entire founding team recently left the company. But sure, $26.5 trillion AI opportunity! This is not to say that SpaceX won’t get its desired valuation. Tesla’s valuation never made any sense right up until the Models 3 and Y actually worked out, causing Tesla’s share price to soar (and even then it was hard to ever build a financial model that justified the new share price). Musk’s ability to make his own reality starts with investors; from 2021’s Mistakes and Memes and comparing Apple and Tesla: This comparison works as far as it goes, but it doesn’t tell the entire story: after all, Apple’s brand was derived from decades building products, which had made it the most profitable company in the world. Tesla, meanwhile, always seemed to be weeks from going bankrupt, at least until it issued ever more stock, strengthening the conviction of Tesla skeptics and shorts. That, though, was the crazy thing: you would think that issuing stock would lead to Tesla’s stock price slumping; after all, existing shares were being diluted. Time after time, though, Tesla announcements about stock issuances would lead to the stock going up. It didn’t make any sense, at least if you thought about the stock as representing a company. It turned out, though, that TSLA was itself a meme, one about a car company, but also sustainability, and most of all, about Elon Musk himself. Issuing more stock was not diluting existing shareholders; it was extending the opportunity to propagate the TSLA meme to that many more people, and while Musk’s haters multiplied, so did his fans. The Internet, after all, is about abundance, not scarcity. The end result is that instead of infrastructure leading to a movement, a movement, via the stock market, funded the building out of infrastructure. I explained in that Article why I generally did not cover Tesla’s financial results, and the reasoning extends to why I don’t expect to cover SpaceX’s: Musk is the master of memes, and is himself a meme. He offers a dream — Mars, fully autonomous vehicles, an addressable market of $28.5 trillion — and positions his companies and their stock as access to that dream, and through the alchemy of capital markets, transforms shared delusion into mass market reality. Musk’s track record matters in this regard. Building an electric car company was possible, as was full self-driving (supervised); at the same time there were ever increasing government mandates and programs around decreasing emissions that acted as the stick to Tesla’s carrot. Similarly, landing rockets was possible, and the new market creation downstream from correspondingly lower launch costs was comprehensible. That Musk succeeded in both instances gives him the benefit of the doubt. The question that matters, then, is not if the numbers make sense right now (they absolutely do not); what matters is if the dream is even possible, and if there are actual reasons to think it might happen. I think that data centers in space meet these conditions. The first question about data centers in space is if they are even possible, and I think the answer is clearly yes. The key thing to consider is that there is no requirement that these data centers look anything like data centers on earth. On earth we build massive buildings full of GPUs with massive infrastructure for cooling those GPUs and massive power plants (or a connection to a grid which connects to massive power plants) to power those GPUs. The idea of transporting these massive structures to space sounds implausible, and it is! However, there is no reason that space data centers would look like data centers on earth. What makes far more sense is to think about an individual satellite as something akin to a rack. Right now the largest Starlink satellite in orbit is the V2 Mini Direct-to-Cell, which measures 7.4 meters by 2.7 meters by 0.3 meters (estimated); an NVL72 rack from Nvidia, meanwhile, measures 2.2 meters by 1.1 meters by 0.6 meters, so we’re already in the right size range. The V2 Mini Direct-to-Cell consumes (and dissipates) up to an estimated 25kW of energy; the NVL72 up to 135kW, and it can fit a 1 trillion parameter model quantized to FP4. The big shortcoming for a rack-satellite is power and its dissipation, but going from 25kW to 135kW is certainly within the realm of possibility — and given that you don’t need much of the cooling and power distribution usage on earth, something closer to 100kW might deliver similar performance. There are other issues to address, including the problem of radiation screwing with calculations, reliability, etc., although those two concerns could be addressed in part by using larger chips (which are less efficient, but also use less power); these rack-satellites will also be disposable, like Starlink satellites, ameliorating reliability issues. The key factor, however, is that a fleet of racks, interconnected with lasers (as Starlink’s already are), each with their own solar panels and radiator arrays for cooling (deploying 200+ square meters of radiators per rack will be a huge challenge), is possible . The next question about data centers in space is if there is a use case for them — the carrot — and I already made the argument that there is in The Inference Shift . Specifically, there are three types of workloads developing around LLMs: training, answer inference, and agentic inference. From the section making the case for “agentic inference”: Critically, this articulation of an agentic-specific memory hierarchy implies a necessary trade-off of speed for capacity. Here’s the thing, though: lower speed isn’t nearly as important a consideration if there isn’t a human in the loop. If an agent is waiting around for a job that is being run overnight, the agent doesn’t know or care about the user experience impact; what is most important is being able to accomplish a task, and if entirely new approaches to memory make that possible, then delays are fine. If delays are fine, then all of the focus on pure compute power and high-bandwidth memory seems out of place: if latency isn’t the top priority, then slower and cheaper memory — like traditional DRAM, for example — makes a lot more sense. And if the entire system is mostly waiting on memory, then chips don’t need to be as fast as the cutting edge either. This represents a profound shift in future architectures, but it also doesn’t mean that current architectures are going away: At the same time, these categories won’t be equal in size or importance. Specifically, agentic inference will be the largest market by far, because that is the market that won’t be limited by humans or time. Today’s agents are fancy answer inference; in the future true agentic inference will be work done by computers according to dictates given by other computers, and the market size scales not with humans but with compute. It’s agentic inference that makes the most sense for racks in space, and conveniently enough, that is also the market that is likely to be the largest in the long run. The third question about data centers in space is if there is a stick. Specifically, while I think that racks-in-space are both a lot more viable than people think, and a lot more relevant to agentic inference than current modes of compute, it is at the end of the day cheaper and easier to build on earth, all things being equal. All things are not equal, however: right now we are at the very beginning of the AI buildout and already one of the biggest constraints is not just power (expected), but zoning (unexpected). I wrote in an Update last week : That leads to an interesting contrast to globalization: when companies were closing down American factories and laying off workers and moving operations to China, none of the affected towns or workers had a say. They just suddenly no longer had a job, and a huge number of cities across the Rust Belt no longer had a reason to exist. People simply had to move, or worse, retreat to things like alcohol or drugs. AI, however, is the opposite: building data centers requires permission, which is to say that people actually have a say. Again, I am not at all saying that these people are well informed about data centers, or about the economic impact on their communities, much less the economic impact of AI generally; what I am noting is that people who didn’t have a say in globalization are suddenly finding they do have a say about AI, and it’s not a surprise they are expressing their disapproval by blocking data centers. In that Update I made the case that data center builders — and by extension the companies that use them — should straight up pay people for permission to build data centers in their communities. At a minimum, however, that increases the costs of terrestrial data centers. What seems very plausible in the long run is that the demand for compute ends up being so large that there eventually is nowhere left to build, making the vast expanses of space not just an alternative but in fact the only choice. If all of this happens — and there are a lot of “if”s here! — then suddenly that $2 trillion valuation starts looking reasonable. SpaceX is already monetizing xAI’s first data center, Colossus 1, to the tune of $15 billion/year for 300MW of capacity; that’s 3,000 racks-in-space. Anthropic, meanwhile, will probably make 3x the revenue on that capacity; it remains to be seen if xAI can get back in the state-of-the-art game, but if so then the amount of revenue it can generate per rack-in-space will be commensurately higher. Even without xAI, however, SpaceX has the potential to be a monopoly provider of marginal compute capacity. There are, needless to say, a massive number of assumptions baked into this argument, including assuming a huge number of engineering challenges are solved, Starship actually works, SpaceX gets sufficient supply of the right kinds of chips, compute demand is massively larger, agentic inference unbundles current architectures, and data center opponents are successful. The risk attached to all of these assumptions should discount the valuation you put on this business, which is to say I still think this IPO is nuts. At the same time, I’m glad it exists, for multiple reasons. The first one is the most obvious one: Musk, for all of his faults, has already pushed humanity forward on multiple vectors, including electric cars, self-driving, reusable rockets, satellite Internet, etc., and I’m excited to see him try and do more. The second is that I am in fact concerned about our ability to muster enough compute to fully realize the gains from AI, and am very worried about a replay of nuclear power, where our failure to build denied us the opportunity to even imagine what could be invented in a world of unlimited energy; the fact Musk is proposing an alternative path to unlimited compute is a relief. The third is that I appreciate the extent to which this IPO is a return to what an IPO should be: the opportunity for people to contribute capital to actually build the business, and to benefit if it works out. As I noted, I can’t make a financial model that necessarily justifies this valuation, particularly based on current financials, but neither can a VC investing in the Series A of a company. SpaceX has already invented a lot, and its early investors are going to make a lot of money with this IPO; at the same time, there is still so much more to invent that there remains a lot of upside — and, to be very clear, a lot of risk. It’s a testament to SpaceX’s ambitions that retail investors get to play VC. And hey, you get Mars upside for free! Training will continue to matter, and Nvidia’s current architecture, including high-speed compute, large amounts of high-bandwidth memory, and high-speed networking, will likely continue to dominate. Answer inference will be a meaningful market, albeit a relatively small one, and speed from chips like Cerebras or Groq (I explained how Nvidia is deploying Groq’s LPUs here ) will be very useful. Agentic inference will gradually unbundle the GPU, which alternates between stranding high-bandwidth memory (during the prefill process) and stranding compute (during the decode process), in favor of increasingly sophisticated memory hierarchies dominated by high capacity and relatively lower cost memory types, with “good enough” compute; indeed, if anything it will be the speed of CPUs for things like tool use that will matter more than the speed of GPUs.

0 views

Promises and perils

One of the just-so stories we keep hearing about AI is that it’s inevitable, that the technology is here and will continue to be here, and we better get on board or get left behind. These stories have the ring of a threat because they are, explicitly and otherwise, threatening. They are also familiar . Fear that there may be no alternative to the will of the AI arise because we have been told for decades that there is no alternative to neoliberalism, that there is no alternative to the mediation of all society by profit-driven markets, no alternative to the universal power of private self-interest that continually tries not to better the world, but to maximize it’s own profit and hence power. Stories about the “promises and perils” of AI ring true, not because the AI is poised to hunt all of us down, but because the stories reflect real experiences of technology, capitalism, and ideology; they reflect the capitalist developments of the incomprehensibility of technology, the invisibilization of labor, enclosures, proliferating neoliberal bureaucracies, and the sense that there is no alternative to capitalism and the status quo. Blix & Glimmer, Why We Fear AI , page 56 In other words, the threat isn’t so much that AI is inevitable as that the ongoing—and likely expanding—immiseration of workers is unstoppable. This is the subtext of the strange and conflicted messaging that we get from the hype men: when they say that you better learn AI or be left behind, they are admitting that a great many people will be left behind. And if you—smart and clever and hardworking person that you are—are somehow able to make it to the other side of the line, you’re supposed to find relief or pride at having done so, and not horror at all the people suffering in your wake. You’re supposed to be as uncaring as the capital that uses you. But getting through this gauntlet is no guarantee of getting through the next one—and there will be a next one, because the plain aim of the technocrats is to immiserate everyone, eventually. From the capitalist perspective, anyone with skills enough to negotiate a comfortable wage is a cost in need of cutting. Add to that the fact that AI’s whole pitch is that the more you use it, the more data it gathers, the more likely it becomes capable of mimicking you well enough to convince the fools above you that it can do your job. So get-in-or-get-left-behind is something of a trick—everyone is left behind, eventually. Which is both terrifying and clarifying. Terrifying in that the capitalists really do have the ability to do us harm—they have been doing so, already. Clarifying in that there really isn’t any reason to stay on the path they’ve laid out for us. It leads nowhere good. Meanwhile, there aren’t very many people up ahead, and there are a whole lot of us back here. Let’s see what we can do. View this post on the web , subscribe to the newsletter , or reply via email .

0 views

Why We Fear AI

Hagen Blix and Ingeborg Glimmer make a compelling case for why we fear AI: our fears of what AI will do to us are really just our fears of what capitalism is already doing. In this way, AI isn’t so much a novel new technology as an acceleration of long-existing patterns in neoliberal capitalism—automation, deskilling, unaccountability, surveillance, and increasing precarity amidst shrinking welfare systems. But therein also lies a clue as to how to counter it, in that only organized, democratic control of labor can stand up to capital. When we see through the hype, we know what work we have to do. View this post on the web , subscribe to the newsletter , or reply via email .

0 views

Revenge of The Business Idiot

If you liked this piece, you should subscribe to my premium newsletter. It’s $70 a year, or $7 a month, and in return you get a weekly newsletter that’s usually anywhere from 5,000 to 18,000 words, including vast, detailed analyses of NVIDIA , Anthropic and OpenAI’s finances , and the AI bubble writ large . My Hater's Guides To Private Credit and Private Equity are essential to understanding our current financial system, and my guide to how OpenAI Kills Oracle pairs nicely with my Hater's Guide To Oracle . This week, I’ll publish the final part of my ongoing series (“ What If…We’re In An AI Bubble? ”) about the factors and events that will cause the AI bubble to finally pop, focusing on what consequences might follow the collapse of OpenAI and the wider data center  Subscribing to premium is both great value and makes it possible to write these large, deeply-researched free pieces every week.  Today I’m going to speak from the heart, and tell you that we’re ruled by fucking imbeciles. AI is a perfect storm of failed concepts and organizations, and the apex of the Era of the Business Idiot , an epoch where we’re ruled by people so thoroughly disconnected from the actual workforce that it was inevitable that a technology would be created specifically to grift them. Just ask Aaron Levie, CEO of Box :  LLMs are dangerous for many, many reasons, but the under-discussed one is how well they play to a certain kind of executive imbecile. Generative AI is — to quote Mo Bitar — really good at doing an impression of work, much like most managers and c-suite executives, and even if it’s completely incapable of doing something, it’ll absolutely say it can and tell you you’re amazing for suggesting it. And that’s why Business Idiots love it.  Where regular human beings would say annoying things like “that’s not possible within that timeline” or “we don’t have the resources to do it,” AI will say “of course, right away!” and burn as many tokens as possible. When it makes mistakes, it’ll apologize — as it should because it failed you — but then promise to do better next time, all while costing so much less, at least in theory, than a regular, stinky human being.  It’ll create a PRD (product requirements document) of a theoretical software project with the confidence and vigor that you need to take it immediately to a software engineer and say “build this immediately,” and when the software engineer tells you a bunch of bullshit about it not being possible, it’ll spit out several convincing-sounding responses. Fuck, why even bother talking to that engineer at all? Claude Code can mock up a prototype that you can then shove in their fucking face before you fire them for not using AI to do it themselves. I realize I sound a little churlish and dismissive of those who may or may not actually get something out of AI, but this entire industry feels like a mixture of kayfabe and ignorance, slathered with a kind of angry desperation that reflects the distance between reality and fantasy, driven by people that don’t do any fucking work.  Any executive-level fuckwit you’ve met in your life now has a seemingly-powerful tool that can burp up mimicry of open source software and, if you constantly prompt it, eventually get something half-functional onto some sort of web server. When you face bugs, it’ll try and fix them, sometimes also “fixing” (adding or deleting code) from elsewhere to be helpful, like when Cursor using Anthropic’s Claude Opus 4.6 model deleted an entire production database and all its backups . It will never, ever say no, even if it’s incapable, even if it has no thoughts, even if what you are asking is equal parts impossible and unreasonable in both its timescale and scope. A Business Idiot, given his druthers, can sit there and fuck around and make an LLM spit out something that makes him feel like he’s coding, which in turn makes him feel that you, a lazy and stupid engineer, could do even more with the power of AI. It doesn’t matter that it costs an absolute shit-ton of money, or that there’s no way to measure its efficacy. The Lion does not concern himself with things like “efficacy” or “productivity,” and the Lion is increasingly tired of your whining! The Lion doesn’t even understand what it is you do every day other than not doing what The Lion is asking for! You laugh, but this is genuinely how the majority of managers and executives think and act, and now they have a special chatbot that can fart out functional-enough prototypes to convince a Business Idiot they can do anything, because executives and managers do not regularly do much work. As a result, they have little idea what work looks like other than when they look over your shoulder, which is why they wanted you back in the office, and their distance from production is why the same people who were anti-remote work are now aggressively trying to shove AI down your throat .  Organizations aren’t burning millions or hundreds of millions of dollars a year on AI because it’s good, they’re doing it because they are run by people who do not know what the fuck they’re doing.  Generative AI is catnip for hall monitors, snitches, toadies, and any other group that hates work and loves talking down to others. Put another way, it ingratiates losers who believe that learning to do or being good at something is a waste of time, because they deserve to just do what they want without any of that messy “effort.”  While I’m not saying every LLM user is an imbecile, they’re built to convince the mediocre and incurious that they’re remarkable, and it turns out that a great many of them run venture capital firms and Fortune 500 companies. I also want to be clear that while there are sane and normal people who use these things, they’re mostly drowned out by a crowd of people that oscillate between bootlicking and regurgitating capitalist mythology in a way that makes it hard to trust anybody who spends significant amounts of time using an LLM.  One thing you’ll notice about the most moistened AI boosters is that they lack much degree of pride in their work. Everything they say must, at some point, compliment the mindless, unprofitable, unreliable tool underneath it — how “incredibly powerful” it is, how it’s “only getting better,” how it’s “only the beginning” of something that’s eaten over a trillion dollars and absorbed the majority of venture capital .  It isn’t about the work, or the craft, or the thought behind it. Everything is a numb, mindless death march toward saying “job done” and burping out some sort of pseudo product, if one even exists. I’m not even being sarcastic! Per Bloomberg , Salesforce has been marketing “powerful AI products” that don’t actually exist: In a rational society, Salesforce’s stock would take a beating and the SEC would open an immediate and brutal investigation.  Sadly, our society is oriented around the power fantasies of the mediocre and spiritually-dead losers, people bereft of pride or joy in the things they create that believe that they’re owed everything .  They’re Business Idiots, and they are your enemy. Even those who believe they’re aligned with the Business Idiots by supporting and using Large Language Models are the enemy, because The Business Idiots believe that “AI” will simply remove anybody else from the picture, automating work, creativity , communication, friendship , and that includes anyone that helped its ascent.  And yet none of it’s really working, because Business Idiots don’t really know how anything works. As I said back in the original piece , think of The Business Idiot as a kind of con artist, except the con has become the standard way of doing business for an alarmingly large part of society.  Salesforce, one of the most-prominent hypesters behind the AI bubble has spent millions of dollars on advertising and marketing to promote a product that doesn’t exist in the way that it’s being sold.  Only an economy oriented around coveting and coddling losers would have let AI get this far. Every single story about AI has to either directly gloss over the obvious financial and technological issues or start speaking in the kinds of vague theoreticals reserved for cults and multi-level marketing scams. Even Bloomberg’s piece — which is pretty critical! — helps gaslight Salesforce’s customers by quoting an executive blaming their own processes for Salesforce’s outright lies: What the fuck does that mean? What’re you talking about, Madhav? What “autonomous vision”? What complex things? Do you even know? Hello? Even in this very critical piece , the endless pursuit of “fairness” — the Business Idiot’s favourite weapon when they don’t want to be graded on their actual work — means that we have this slop-adjacent explainer that mostly amounts to “yeah you know sometimes their shit needs to be better and then one day, wow , boom! We’re gonna have all sorts of stuff happening.” But this is the world the Business Idiots have created, as I described last year : This naturally created a tech industry (and a larger economy) dominated by executives that were rewarded for growth, which meant that our tech products are inherently oriented around that growth:  The problem with an economy dominated by Business Idiots is that it eventually loses its connection to the wider concept of production or solutions to customers’ problems, because that might cause management to interact with the real world and, by extension, have actual problems themselves. The problems that Microsoft, Google, Meta and Amazon solve on a daily basis are those related to its shareholders. How do we keep growing? How do we keep people engaged with our products? How do we convince our customers to pay more for our customers? And how do we keep people buying our stock? Thankfully, The Business Idiots have captured both the media and the markets, twisting the definition of a “good company” into one measured by these very same questions. It doesn’t matter that Facebook is deliberately broken or Google Search’s results were intentionally made worse because number go up, and that’s all The Business Idiot cares about! It doesn’t even matter that 10% of Meta’s 2024 revenue came from scams or that its Kylie Jenner-branded chatbot led a man with dementia to his death or that its John Cena-branded chatbot would roleplay about having sex with children or that it wants to spend $125 billion or more on AI in 2026 because Meta’s ad sales have yet to slow down .  It doesn’t matter that Meta CTO Andrew “Boz” Bosworth has overseen multiple unprofitable, unpopular products or is hated by basically every single person I’ve ever talked to at Meta — The Wall Street Journal will still write a glowing profile saying he’s a “blunt, outspoken provocateur” that’s “transforming Meta” by “unleashing AI.” One can be a colossal fucking loser that everybody hates, lay off thousands of people, fail to make anything of note, oversee multiple failures, and the Business Idiot’s consent-manufacturing machine will help wall you off from reality.  “But Ed,” I hear you cry. “You can’t call somebody like Andrew Bosworth a loser. He’s a huge success! He made lots of money!” You’re falling for the Business Idiot’s biggest trap: that having wealth or being a C-suite executive is proof that you’re not a disconnected loser.  Boz, like every other oaf destroying your favourite tech products, is the ultimate loser — he’s succeeded by taking credit for other people’s ideas, firing people when his own ideas fail, and repeating the cycle as many times as he wants because that’s what being an executive means to him. Boz has no pride in his work. If he did, he’d have resigned over the failures of both the metaverse and Meta’s wasteful, directionless AI efforts, or even over how fucking awful Facebook has become. The sad truth is that he doesn’t care! He doesn’t give a shit. Boz, like every other Business Idiot, exists to extract value from others and get rewarded by shareholders. As he said in 2018 , to Boz, “all the work [Meta does] in growth is justified.” That includes deliberately making notifications less useful, injecting clickbait and AI slop into your feed, and hiding chronological feeds behind an Escher painting of different menu options.  Boz is indicative of the vast majority of CEOs and upper-level management of most of the world’s organizations. If you read this and feel self-conscious, it’s because you secretly know I’m talking about you or somebody you know. One can be incredibly-rich and well-known and yet a huge, unbelievable loser, because being a loser is deep within your soul. A loser is somebody who takes from others, claims others’ work as their own, and demands more credit for having done so. A loser is somebody who believes work and creation is beneath them, and that they are owed the fruits of labor regardless of their actual contributions to the world. This is why so many people have such an abnormal reaction to AI, promoting and defending it like it’s their religion or nation state. While many people use LLMs and see them as a kind of word calculator or search engine, so many more see within it the chance to ascend above the proles who “work” or “create,” because they find the process of labor or effort so utterly loathsome. When somebody badmouths AI, the Business Idiot must defend it with everything they have, because attacking LLMs is attacking the output of an LLM , which is in turn a judgment on those who are tolerant of its mediocrity and impossible-to-avoid hallucinations . You see, if you demand good work with intention , that might mean the Business Idiot actually had to do something , and that’s not what The Business Idiot signed up for. We are slaves to middle management and the middle management mindset, we are living in their world, and it will collapse because they never really understood anything to begin with. LLMs impress the writers who do not want to write, the coders who don’t want to code, the researchers who don’t want to research, and the lawyers that don’t want to actually understand case law. Those that desperately tell you how powerful AI is and that you simply must use it are looking for you to validate their own laziness or distaste for effort, and those who are impressed with LLMs’ outputs tend to be people with low standards.  The aggression with which AI boosters and executives act toward those who aren’t impressed suggests a genuine intellectual and moral weakness. Nobody who’s this insistent, aggressive and violative with their language of “it’s here and if you don’t adopt it you’re stupid and dead” has ever been right about anything. Nobody this desperate, insistent and forceful has ever had good intentions, good vibes or brought good omens — they are always bearers of some kind of con.  Most technology is sold on elevating and ascending human beings. AI cheapens every interaction by creating a work-shaped product from a person that doesn’t respect you enough to give you work that’s barely fit for a human because it wasn’t made for one.  This is why being an AI booster requires you to debase yourself. You must accept becoming a dogshit dealer that loves accepting and receiving low quality goods. You must celebrate intentionless and decaying slop, and defend it and the machine that made it with your entire being. You must sully yourself — treat its unexceptional, sloppy and unreliable outputs as signs of sentience, or at least the proof that digital sentience is possible. You must defend horrible, abrasive, ugly, loud monoliths of steel full of $50,000 graphics cards. You must say they are necessary, and you must aggressively antagonize those who do not.  That’s because they’re not defending LLMs so much as they are the greater form of Rot Economic capitalism . The Business Idiots have successfully changed our experience of buying and using software from one of “paying for a service” to “accessing powerful technology,” reframing every mistake as the necessary pain of new innovations and every mediocre output as proof that the tech industry can still innovate , because critiquing these things — asking for them to be anything approaching the autonomous, reliable and powerful technology that everybody claims they are — is considered “improper” or “biased” or “skeptical.”  Oh yes, they use “skeptical” as a pejorative. This aggression only proves that the management sect is scared. LLMs were meant to be the thing that replaced all workers, but the actual outputs and outcomes don’t seem to be resulting in anything changing other than lots of things becoming worse or more-expensive. Every AI booster will say “AI is writing all the code at some organizations,” but never seem to explain what happens as a result, such as whether software is being shipped faster, or better software is being made, or, well… anything.   The answer is simple: they don’t know because nothing has actually changed. Organizations writing massive amounts of code using LLMs are facing massive product stability issues and, in the case of Zillow, spending millions of dollars a year to turn their codebase into a confusing, intentionless slop and increasing software reviewer loads by 29,000 hours a month.   This is only made possible in an economy run by people who don’t do any work, and a tech and business media that exists to ingratiate them. I want to lead with a surprising comment: I don’t think LLMs, as a tool, are a grift. There are use cases, though those use cases are miniscule compared to the egregious promises and extrapolations made by the majority of the media and the executive sect, and absolutely nothing about them warrants the amount of money invested in them.  That being said, I think LLMs lend themselves perfectly to grifting. Sam Altman helped propagate a technology perfect for conning people with potential, a larger extrapolation of Altman’s own life of taking dogshit — Loopt, for example ! — and parlaying it into larger opportunities. It can make a really half-hearted demo of a lot of things, and that’s good enough to sell to Business Idiot.  Dario Amodei took this grift and perfected it. Anthropic is a company purpose-built to con people into giving it by money by making people feel smart. LLMs can do work-shaped stuff, sometimes, as long as you debase yourself to accept mediocre and often-broken stuff that you have to keep a vigilant eye on, and either use a subsided product that loses Anthropic money or pay a shit ton of money as an enterprise to Anthropic and it still loses money.  The media was also primed for the grift. Reporters are never incentivized or supported to actually spend meaningful time understanding technology, meaning that the vast majority lean toward access journalism or, at best, the most kindly, “objective” (read: pro-business) takes that result in “wow, isn’t the future great?” no matter how good the thing they’re using actually is. Editors are, in many cases, entirely disconnected from the process of reporting or writing, let alone the underlying technology their reporters cover, which leads them to at best live in a world of “I sure don’t trust these CEOs but their technology sure is powerful.”  As a result, all a technology has to do is either look or sound plausible. Can LLMs write all code? Not really! But because they can write some code and there are lots of eager people on Twitter saying it’s powerful, that’s all it takes to write the sentence “software engineers are writing most of their code using LLMs.” Can Anthropic actually take down Figma? God no, but the mere existence of Claude Design is enough to write that it might . All it takes is the hint of something to be true for it to be written about as gospel. Each statement adds another bullet point to Anthropic’s investor deck so that it can raise another $30 billion in funding, which in turn validates any journalist’s beliefs in Anthropic’s ability to destroy other companies with a product the journalist has not and never will use.  Business Idiots did well to pressure modern journalism into conflating scrutiny with a lack of curiosity. To ask too many questions is “unfair.” To not immediately assume that LLMs are getting “exponentially better” is to be an ignorant luddite. To not assume that everything will work out like it did with Uber or Amazon Web Services is to “ignore history.”  Grifters took advantage of this industrialized intellectual weakness using a tool purpose-built to do enough of an impression of something to impress the media and executives. It worked, because both are sold to in much the same way — by telling a plausible-enough story that ingratiates somebody who is never the end user of the product in question.  If a journalist gets curious, an LLM can make a good-enough impression of somebody writing software to fool somebody who doesn’t really know what they’re doing, and if you prompt it again and again and again, it can get something functional out the door. This is all it takes for somebody — a reporter or an executive — to extrapolate that because they were able to do something (even though the LLM did it), a subject-matter expert would be able to do even more.   As a result, LLMs are fantastic tools for grifters. Somebody that doesn’t really like doing anything other than getting applause for other people’s work can now run multiple concurrent agents, endlessly tweak prompts and tell everybody that they’re an “AI specialist,” with their LLMs making them seem busy in a way that’s hard to argue with because there’s so much bullshit going on.   An ethically-questionable “AI beat reporter” (though this is not across the board) can easily become prominent by simply writing up whatever it is the companies are excited about and reporting on leaks of Slack conversations, creating the appearance of “scrutiny” without ever scrutinizing or questioning the ethics or underlying economics. An oafish product manager with terrible ideas can now pump out half-functional scripts and software that sort of does something, and when their manager — somebody who also doesn’t do any work — sees what they’re doing, they can happily report to their manager that the person in question is “AI-first.” And when you’ve oriented your entire economy around middle managers, vice presidents, and executives that don’t do any real work, this shit seems magical. AI companies are natural grifting instruments. Because AI startups are so capital-intensive, they naturally require tons of money, which means that venture capitalists have something to invest in, and because there’re always so many rounds , valuations are constantly being pumped . Because AI models can be plugged into anything , by extension any AI founder can pretend that any industry can be automated using AI, and because venture capitalists don’t build stuff or really know stuff anymore, they’re naturally impressed by basically any demo or plausible-sounding promise, especially when an LLM can make something that looks like software. Because AI data centers are so capital-intensive, they require endless amounts of risky debt, but that risk allows private credit to take investments from insurance and pension funds desperate for yield, and because everybody involved is a Business Idiot, nobody has actually thought about what happens if these things don’t work out.  AI allows everybody to grow as long as everybody ignores the big, obvious problems with its efficacy and underlying economics . All you have to do is keep up the kayfabe that the problems aren’t problems and the solutions are imminent, or if you want to pretend to be a critic, you can also suggest that all of this is inevitable. Don’t worry about the fact that data centers aren’t getting finished , or that OpenAI and Anthropic make up 75%+ of all AI compute capacity , or that they make up more than 50% of Amazon, Google and Microsoft’s revenue backlog , or that both of them are horrendously unprofitable outside of brazen accounting tricks that would only work on a business and tech media intent on believing everything they say . Don’t worry about it! Stop asking. Don’t worry about Claude deleting entire databases in seconds, they’re gonna fix that somehow, some day.  That ignorance is a sign of laziness, and of the dominance of the Business Idiot mindset. Everybody wants this to be Uber ( it isn’t ) or Amazon Web Services ( it isn’t ) because it allows them to avoid learning stuff or making informed decisions. If it’s like Uber or Amazon, you can just throw your hands up and say “it’ll work itself out!” which is way, way harder than explaining to me how an industry that loses billions of dollars with no path to profitability doesn’t run out of money at some point.  This is, again, part of the grifter’s toolkit. When you don’t want somebody to think about what’s actually happening, you point them toward something that ingratiates them. Somebody who is rude and mean and asking about those billions of dollars of losses is a hater — somebody who says “well, Amazon Web Services lost a lot of money!” is historically-aware and erudite , even if actual history tells you that Amazon Web Services cost around $50 billion before becoming profitable, or around a quarter of Amazon’s 2026 capex . This isn’t to say everybody making this argument is lazy, just that they’re unwitting pawns in a larger grift where mythology is used to support the biggest waste of capital in human history.  And really, that’s the larger LLM grift: encouraging people to accept or sell lazy, half-baked shortcuts instead of fundamental units of labor or production, all while making them feel smart for doing so. It is a technology that perfectly fits the grifter strategy of giving people as little proof as possible to prove something is real, then letting them fill in the blanks with whatever will make them feel like they’re “ahead,” even if being “ahead” means "mournfully accepting that their job might be automated.” Yet I challenge you any time you hear somebody saying that “AI is here, and it’s transformative” to ask them what the fuck that means , because while “it” might be real, it’s unclear what they actually mean by “it.” The grifters want you to immediately start filling in the blanks, assuming that CEOs saying they’re laying people off because of AI aren’t blatantly lying and that AI has done something, somewhere, that remotely warrants any of this waste and endless propagandizing.  And they want you to do that because they’re losing. If you’ve ever been in a bad relationship — romantic or otherwise — you’ll know the feeling of trying to find any way to prove that things will improve, and the amount of times you’ve ignored something glaringly, obviously wrong. “They’re going through a lot,” “they don’t need to tell me what I need to hear, I know they feel it inside,” “they’re busy right now,” and every other rationalization of somebody not being good to you or interested in you is an exercise in self-deception to avoid dealing with an uncomfortable truth.  Any time you’ve ever found yourself looking for shreds of proof that things are going well is the exact time you should be leaving somebody, yet you’ve likely stayed and sought them out like Sherlock Holmes before he spends thousands of dollars on therapy. Every time you stick around a little longer, you do so based on increasingly-questionable data and the knowledge that changing course will require a brutal reckoning with reality. Sometimes you stick around forever, because making more bad decisions is sometimes harder than making one good yet difficult one. People are making the same mistake with AI.  Right now, everybody is ignoring many, many warning signs at once, all because of short-term thinking. Because hyperscalers’ existent businesses have yet to slow down, everybody assumes — without any actual proof — that AI is somehow driving growth. Conversely, nobody seems to have an answer as to how big tech makes the $2 trillion to $3 trillion of brand new revenue it needs to justify its trillions of dollars of planned capex, and even the Financial Times only sees Amazon making any kind of return on hyperscaler AI investment by 2030:  And even here, in a piece called “the impossible maths of the AI boom,” The Financial Times deliberately finds a way to make things look better by removing every single operating cost!   Those covering Anthropic’s so-called “profitable” second quarter are intentionally ignoring that Musk deliberately discounted the months of May and June in an obvious attempt to engineer a headline. They’re also ignoring the obvious mismatch between Anthropic CFO Krishna Rao’s sworn affidavit from March 6 2026, when he said it had “exceeding” $5 billion in lifetime revenue, which doesn’t line up with any of its previously-reported or stated annualized revenues . The answer, in the end, is that it’s just easier to ignore this stuff, because taking it seriously would require thinking about Anthropic in skeptical terms, which would, in turn, require you to start questioning the fundamental stability of the AI industry. And they need you to do that because they’re fucking losing. OpenAI had a negative 122% non-GAAP operating margin in Q1 2026 , and ChatGPT growth has stalled. Despite its so-called profitability, Anthropic has had to raise a combined $75 billion (between Google, Amazon and investors) since the beginning of the year. Both OpenAI and Anthropic had to lower their gross margin projections at the end of 2025.  Anthropic and OpenAI — neither of whom have any path to sustainability or profitability outside of accounting shenanigans and willing co-conspirators — now make up 50% of all upcoming hyperscaler revenues , and the only way either of them can pay is if somebody, either a venture capitalist or hyperscaler, chooses to give them the money. Nobody has an explanation as to how that works or who funds it, other than that “hyperscalers are some of the most-profitable, cash-rich companies in the world ( as their cashflows drop to their lowest levels in history ),” and that “both of these companies are growing incredibly fast.” Anthropic’s growth is a direct result of Business Idiots controlling a large portion of our economy. Nobody — not a single company — has been able to express in clear-set terms based on their actual bottom line a conversion of “I spent this much money and got this in return.”  In fact, it seems like the opposite is happening. As I’ll mention in greater depth later, Andrew Macdonald, the Chief Operating Officer of Uber, recently gave an interview where he said that the company’s ballooning AI costs are “harder to justify,” in part because there’s no way to link its token spend to useful new features .  Everybody spending millions of dollars on AI tokens is experimenting. As I’ve discussed previously , nobody really knows how to measure the ROI of AI, and the naturally-chaotic nature of LLMs makes it impossible to measure how much it might even cost: Marc Benioff isn’t spending $300 million a year on Anthropic tokens because it’s doing something . He’s doing it because he, like every Business Idiot, has no idea what to do other than spend money, hire people, or fire people. Spending lots of money on AI allows him to say that Salesforce is an “AI-first organization,” and then blatantly lie for two years running that he’s “not hiring any more engineers” despite the many, many job listings on Salesforce’s website for engineering positions.  It’s kayfabe that exists to distract you from the fact that Agentforce only has $800 million in annualized revenue, or around $66 million a month for a company that makes $11 billion or more a quarter .  Seriously, somebody please show me a company spending millions of dollars on AI tokens that can also express a clear, indisputable return on investment. Show me the actual returns. Show me the processes automated and what those processes being automated do to offset these remarkable costs. All of this fucking bloviating about how AI is inevitable and real and so powerful never seems to result in a profit . While companies can vaguely say “oh we saved X number of hours from this,” I am still waiting for somebody to say “we saved this much money and this is how investing in these tokens is profitable.” It’s always something vague, like when Klarna said it estimated ChatGPT would “drive $40 million in profit improvement in 2024,” a stat that it never explained or returned to. Klarna CEO Sebastian Siemiatowski once told Sam Altman to use Klarna “ as his guinea pig ” — only to have to hire back the humans it tried to replace with LLMs after a massive wave of customer complaints . Klarna once said that its chatbots did the work of 700 people , a blatant lie that it got away with because the media doesn’t want to scrutinize an era built on deception. That’s because underneath the puffery and the propaganda and the pervasive sense of inevitability, the AI industry is losing. Anthropic and OpenAI’s revenue growth is only possible thanks to a near-perpetual misinformation campaign that overstates both the current and future capabilities of LLMs, and a near-society-wide ignorance at the executive level. Every story about Anthropic’s customers burning millions of dollars’ worth of tokens comes back to one unfortunate fact: nobody knows how much it’s costing but whatever it costs today isn’t sustainable.  For example, and as I mentioned earlier, Uber COO Andrew Macdonald said this weekend that it was becoming “harder to justify AI costs within the company”: I believe that Uber’s experience is indicative of effectively every company’s experience with AI. Business Idiots, disconnected entirely from production, demand their workers burn as many tokens as possible, incentivizing them to do so for reasons that only make sense to somebody who doesn’t do any work.  And burn as many tokens as they could, Uber’s engineers did. Four months into the year, Uber had exhausted its entire AI budget — in part because it created a leaderboard of the biggest AI users , giving employees an incentive to run wasteful tasks and prompts, if not for bragging rights, then at least to show the higher-ups that they’re onboard with the new direction. .  AI is meant to be this ultra-powerful streamlining tool that changes the workplace forever, yet the practical result appears to be “we’ve spent a bunch of money on something that makes our least-sentient managers excited.” Too many members of the media work overtime to find ways to either ignore or explain away these problems. Stories about how Anthropic and OpenAI have agreed to a combined $1.048 trillion in compute commitments fail once to ask how they might get that money, other than to suggest that both may become cash flow positive by either 2028 or 2030 , again with no discussion as to how other than “they will.”  For them to do so, they will need…well, a trillion dollars over the next four years, either through revenue or funding. That’s an insane amount of money — more than any startup or even public company has had to raise in history — and the fact that more people aren’t talking about that suggests that they either don’t care or don’t want to.   The same goes for those covering NVIDIA and other semiconductor companies. While the largest company on the stock market once again beat analyst expectations and raised guidance, few ( other than JustDario, it seems ) noticed that despite all that extra revenue , NVIDIA only saw its cash and equivalents grow by $600 million quarter-over-quarter.  Why? Because it’s investing tens of billions of dollars investing in AI data center companies like IREN , CoreWeave , and, of course, both Anthropic and OpenAI, and has agreed to spend an unbelievable $30 billion on cloud service agreements in the next six years, quite literally paying its customers to buy its products in the most blatant circular financing since the dot com bubble. This is what an industry does when it’s in distinct, existential distress. NVIDIA is now the fifth-largest purchaser of AI compute behind OpenAI, Anthropic, Microsoft and Meta at a time when AI compute is meant to be facing a supply crunch , which suggests that while demand may exist on a low level for those trying to pick up a few hundred H100s, the only customers for data centers full of Blackwell GPUs (at least, those that actually exist ) are Anthropic, OpenAI, an organization with no clear AI strategy and a CEO that can never be fired, and the company selling the GPUs. That’s a big fucking problem considering that there are tens of gigawatts of data centers being developed that will require around $380 billion in annual revenue to substantiate. There is, at this point, little proof that the AI data center “boom” is anything other than the largest real estate speculation in history.  Some will point to the difficulty one might have finding GPUs, carefully ignoring how the majority of capacity is taken up by OpenAI and Anthropic , leaving the vast majority of customers to fight for scraps thanks to the extremely slow pace of data center construction . Others will say that guidance from companies like NVIDIA and Samsung prove that “the demand is there.” Forgive me, I’m going to be a little stern. I know, I know, you’re gonna say “Ed, you can’t paint with such a broad brush!” but I can find no data center debt deal that makes me feel like anybody was really thinking too hard when they put it together. Blue Owl agreed to invest up to $10 billion in Stargate Abilene after a single fifteen-minute conversation , despite the only tenant being OpenAI, a company that couldn’t afford to pay for the compute it committed to, and nobody ever having built a gigawatt-scale data center in history. This was likely because Blue Owl took advantage of the Business Idiots who run Crusoe: This is, to be clear, a huge scam, and something that should’ve horrified investors, except said investors are also Business Idiots that saw a big number and said “whoopie!” Money men with little connection to how long stuff takes to build , let alone the underlying technology being sold or the companies that might actually pay for the compute saw the potential to “back the next industrial revolution” and fell over themselves to take part.  Like every greedy dullard, Business Idiots backing data centers are easily won over by the blatant lie that a data center is an “ AI factory ,” conjuring up images of large buildings that print money with little human labor needed. In reality, data centers are vast, labor-intensive construction projects connected to large, labor-intensive power projects , filled with GPUs that are so expensive that they require billions in debt that are upgraded on a yearly cycle, with customers that may or may not exist by the time it actually turns on. Calling them “AI factories” is an intentional attempt to simplify projects that have more in common with building cities than any kind of modern factory. These Business Idiots are too informed by other Business Idiots, like the sell-side and buy-side analysts that have no interest in talking about what might happen in the distant future when they can conjure up plausible-sounding statements that pump their bags. Every single buy and sell-side analyst should have said CoreWeave, IREN, and other NVIDIA-backed neoclouds are dangerous investments fueled by circular finance. Instead, almost every single one has upgraded them as a result of NVIDIA’s continued investments , despite these investments being a sign that these businesses can’t last.  The few hedge funds and private equity firms I speak to that have any kind of mental clarity are facing pressure from investors misinformed by analysts and the media. Hundreds of billions of dollars — at least $178.5 billion in America alone in 2025 — have been sunk into data center construction based on flawed information, astronomically more flawed than the assumptions that led to the dot com bubble bursting , as I covered in my premium piece from a few months ago . This is like if they built out all that dark fiber for what would amount to a few hundred internet users in 10 years. These people see NVIDIA’s continued revenue growth as a sign that “demand is unstoppable,” yet that “demand” is entirely contingent on how long investors are willing to ignore reality, much like the rest of the AI industry can only continue as long as everybody keeps up the kayfabe of its supposed inevitability. It’s time to stop, and force these failsons to stand on their own two feet.  I’m growing tired of the amount of people I read saying that “AI is real, but the economics are irrational,” as if these facts are entirely divorced from one another.  A GB200 NVL72 rack will be just as expensive to run in 2030 as it is today, and an incomplete data center will still take just as many hundreds of millions or billions to finish in the future too. There are, I believe, at least $200 billion worth of data centers that will never make even a quarter of their costs back before collapsing, and that’s assuming that they ever turn on or their customers exist by the time they do so.  AI is only “real” because everybody is willing to ignore its blatantly-obvious problems. The only reason that every app has an AI feature or every AI company can still sell a $20, $100, or $200-a-month subscription is because venture capital has yet to walk away from an industry that relies on eternal subsidies. AI data centers only continue to have revenue as long as venture capital and hyperscalers support Anthropic and OpenAI, and their revenues only continue to grow as a result of an endless, society-wide media campaign built on misinformation and API revenues driven by unsustainable venture-backed startups and businesses run by people excited to blow millions of dollars for no reason.  AI is only as “real” as the excuses that get made for it, and the amount of money those who subsidize it are willing to lose. Venture capitalist subsidies are the only reason that companies like Perplexity or Lovable are alive , which in turn means that a large chunk of both Anthropic and OpenAI’s API revenue is only made possible through those subsidies.  Demand for data centers is, by extension, only as large as these subsidies can sustain. Much of this is substantiated by the myth of executive intelligence. Most assume that Sundar Pichai, Satya Nadella and Andy Jassy wouldn’t be as stupid as to burn a trillion or more dollars on data centers for an unprofitable product with demand that only exists because of their own subsidies…except that’s exactly what’s happening. These men have no other hypergrowth ideas , and are more willing to annihilate their cashflows and dominate the Earth with half-finished data centers than to admit that their core businesses will eventually decline. And because these hyperscalers were so aggressive with their buildouts, the Business Idiots conflated that hunger with some sort of proof of massive demand for AI.  Yet even NVIDIA’s own earnings show that demand is incredibly-centralized, with 54% of its Q1 FY27 revenue ($44 billion out of $81.6 billion) coming from three customers , up from two customers accounting for 30% ($13.2 billion) in Q1 FY26. I assume one or more of these are hyperscalers, which means that NVIDIA’s continued growth hinges heavily on the idea that big tech will continue to dump trillions of dollars into its GPUs in perpetuity. I’m repeating myself, but this is not what a healthy industry looks like . If AI data center demand were evenly-distributed and sustainable, NVIDIA’s revenue wouldn’t depend mostly on three customers. Similarly, the entire media wouldn’t be loudly ignoring a short seller report that suggests that 20% of its FY26 revenue came from illegal sales to China . As I’ve said, AI is only as real as its subsidies. ChatGPT is only free to hundreds of millions of people because OpenAI is able to raise hundreds of billions of dollars, much like Anthropic is only able to subsidize its subscribers anywhere from $8 to $13.50 for every dollar of revenue because of endless venture welfare.  The underlying economics suggest that no subscription-based AI service — let alone a free one — makes any kind of financial sense, and the only reason that everybody has had such unrestrained access is because the media and the markets approved it, and the people with the money are deluded and disconnected from the process of value creation on almost every imaginable level. Any statements around “Anthropic actually being profitable on inference” are products of fantasy and magical thinking , distilled copium for people that would rather delude themselves into believing that none of this ever made sense. Again, the assumption is that “companies would never just burn a lot of money,” but that too is catering to the greater myth of executive competence, something that nobody who spends any amount of time around managers or executives would ever believe.  GitHub Copilot let people burn thousands of dollars on a $39-a-month subscription as a means of expressing growth. I absolutely, 100% believe that both OpenAI and Anthropic are doing the same, and that neither of them has some magical way of making inference cheap enough to justify letting people burn thousands of dollars on a $100-a-month or $200-a-month subscription. To give them the benefit of the doubt is to empower them to continue to raise money by conning their investors and the general public, and to continue perpetuating an era of software that runs contrary to what makes technology good. Their goal is simple: to ram as much of this through to as many people as possible to get them to spend as much money as possible…until they work out a way to make OpenAI or Anthropic or these endless data centers into something approaching a real business. One of the greatest mistakes we can make in our lives is to assume that the rich and powerful have any idea what the future holds, or that they have any grand plan or strategy.  It’s very likely that Dario Amodei and Sam Altman’s plan is to keep burning money until somebody who works for them comes up with a way not to, and in the interim their plan is to get as many users as they can to keep raising money.  Similarly, Microsoft, Google, Meta and Amazon’s plan is to keep building data centers in the hope that they’ll have a reason to use them by the time they’re built. There is no other plan. They do not have a secret invention coming. They do not have AGI in a box in their office. They do not have anything, and the reason they’re spending so much money and shoving AI into everything you use is because they have no fucking clue what to do. This is why Dario Amodei makes wild claims about AI replacing 50% of all white collar workers or Microsoft AI CEO Mustafa Suleyman claims all white collar labor will be automated in 18 months — because the actual products themselves aren’t impressive enough to win you over or justify the hundreds of billions of dollars being sunk into AI. They say these things to make you think that they have a scary and powerful technology behind the scenes that does not exist . And yes, that includes Mythos . The forceful, harassment-grade incursion of AI services into our daily lives is not a sign of its power, but a gesture of the lack of confidence and fear in the hearts of its progenitors. Good shit sells by telling you why it’s good — dodgy shit sells by tricking and scaring you and taking advantage of Business Idiots who think that using an LLM to type emails and spending 12 hours a day on Twitter constitutes “work.”  I believe the vast majority of these data centers go unused and/or unfinished, and that most AI startups will die once the venture capital subsidies dry up . I believe that neither OpenAI nor Anthropic have a future, and that their revenues are only made possible through venture subsidies for startups using their models and the experimental revenue of Business Idiots that don’t really know why they’re “doing AI” in the first place.  AI demand remains a result of a societal psychosis and a weakness in those who are meant to scrutinize the untrustworthy. Its unraveling will be framed as impossible to see coming because nobody in power had bothered to look.  It’s easy to feel hopeless. We’re at a point where the greed and the shamelessness and the stupidity is at a fever pitch.  We’ve reached a time the mask has started to slip, and the C-suite imbecile class is unabashed about its loathing of people, as was the case when the CEO of UK bank Standard Chartered CEO (Bill Winters) talked about how those at risk of losing their jobs to AI are “lower-value human capital,” at the same event where he said the company would likely shed nearly 8,000 roles in the coming years due to AI.  Winters would later apologize for his choice of words — although, to be clear, he was being absolutely honest when he made those remarks. That is what he believes.   Everything feels rough because the AI industry is equal parts desperate and over-confident. AI executives believe that they can cram enough promises of money into the system that the system would rather cannibalize itself than admit that it made a mistake. Sundar Pichai, Andy Jassy, Larry Ellison, Elon Musk, Mark Zuckerberg and Satya Nadella will gladly annihilate hundreds of billions of dollars to avoid the inevitable, but once they do, it’ll be gruesome.  At the same time, the things that they need to happen — actual profitability, actual returns on investment, actual tangible proof that this is a real thing rather than something they all have to actively conspire to keep alive — aren’t happening at all. Each week, we hear about new AI megaprojects that will dominate our countryside with blinding lights, endless noise and fume-belching gas turbines at such a scale that it feels impossible it could ever stop. The system is absolutely going to try and exhaust itself to keep it going. The government bought $9 billion of Blackwell GPUs , which, to be clear, isn’t a “Too Big To Fail“ moment so much as it’s a way to keep NVIDIA’s plates spinning for another quarter. In truth, the amount of money that NVIDIA needs to keep this going is so extreme that it is now a test of how long the debt markets and the hyperscalers can keep sustaining the hype. A trillion dollars in annual revenue is necessary by the end of 2028, which would require over 30GW of data center capacity to be built by then at a time when only 5GW at most appears to be under construction .  Nevertheless, even the sweatiest, least-trustworthy boosters have begun to sneak in statements about “we’re probably not in a bubble,“ or “yeah it’s a bubble, but it’s a good bubble.” Jeff Bezos, when asked about the AI bubble, said that you “ shouldn’t worry about it ,” which…is not really sufficient, is it Jeff?  None of this is to say that the mood is good! The vibes are disastrous. Everybody is exhausted. Those who love AI vibrate with a strange soullessness, constantly talking about the incredible power of AI without ever showing what it did or, perhaps, what all that supposed saved time got them. It sucks to work at basically any hyperscaler right now.  Basically every person in every job has had somebody intimate they’re going to lose their job to a computer every time they open the newspaper or use a website, and every app has some sort of desperate, vulgar pop-up about a feature that will generate some bullshit, obfuscating the features you actually want to use in favor of those that might lose the company money, because the company has to prove to the people that invested in the company that they’re “futuristic.” Alternatively, their CEO has either mild or severe AI psychosis to the point that they have decided to violate your user experience. AI is a non-consensual technology at its heart. But they are losing. They all know it. They are acting desperate.  It seems that there are nearly as many announcements of new large data center developments as there are cancellations of said data center projects. While hyperscalers can dismiss that as a simple reallocation of capital, and nothing to worry about, it’s harder to ignore the growing backlash against these facilities from locals — and the success that locals have achieved in blocking (mostly temporary, but some permanently) any future developments .  And it gets worse. Anthropic had to conspire with Elon fucking Musk to conjure up a single profitable quarter to swindle the media and its investors one last time . In response, OpenAI either leaked or had leaked immediately following that it had a negative 120% margin and ChatGPT growth had stalled . Anthropic is either the single-most successful grifter of all time or speed-running a con where it fudges together numbers to raise endless amounts of money to keep its billion-dollar burn going.  These are not the actions of honest, sustainable companies that will exist in the future. I believe that we are on course for a truly horrible crash, the likes of which may rewrite the venture capital industry and mortally wound one or more hyperscalers, as well as fundamentally divide society on so many levels into those that fell for this and those that did not. This will, in the short term, be absolutely fucking horrible for our markets and our wider economy as a result of the time-bomb of private credit and private equity. In the long term, I see it as a “They Live” moment for many millions of secret imbeciles and cretins in our midst, and I don’t think it’ll be easy to wash the stench off for those that really pledged themselves to the graveyard smash here. We will win, long term. What they are doing is not working. The future will not be without pain, nor will it be easy, or pleasant, or something I relish in. But in the long term I think this is a moment where the greater Business Idiot incursion faces a reckoning with a reality it believed it could change through sheer force of will. These people don’t know how to build things that work anymore, and thus the only thing they can do is spend money and fire people. They believe in nothing other than growth, and one cannot exist on belief and hype alone, at least not forever. And I can’t wait to watch what happens when it collapses. I’ll close this piece with the regular CTA — please, subscribe to my premium newsletter ($7 a month, $70 a year, you’re gonna love How OpenAI Kills Oracle and The Hater’s Guide To Private Credit — but with a little explanation as to why I do the things I do. I write this newsletter to hopefully do three things: I do it because I believe, fundamentally, that these people — Altman, Amodei, Nadella, and the many, many other villains that I’ve mentioned in these pages — are bad people, and their values are the antithesis of my values. I care about people, and humanity, and truth, and they do not.  I deeply love technology, and feel it made me the person I am today. It allows me to do wonderful things, connect with wonderful people, and discover endless troves of incredible information. The computer is marvelous. The computer has done many wonderful things for me, despite what all the Business Idiots say. I see LLMs as a violation of everything that great computing stands for. The AI industry encourages its users to both accept and present low-quality work and demands that they constantly defend the industry from those who would demand better from it. It is inefficient, power-intensive, environmentally destructive, and inherently sold based on things that it might do, providing far more value to scam artists and con men than it does to its end users.  This is a mask-off moment for both the ruling class and those captured by capital, and an opportunity to look around you and see who is most-easily fooled. No industry of value needs to mislead you or make you feel bad for not adopting their technology. No trustworthy individual will ever see the need to humiliate or attack somebody for being insufficiently excited about a product. No CEO that talks of a theoretical future as a means of selling you software in the present should be trusted. No technology that makes mistakes with regularity should be defended. And no industry that demands everything from us — our land, our energy, our water, our jobs, our art, our writing, our attention and every dollar we have — should ever be treated with anything but revulsion. That “demand” is almost-entirely funded by debt. That “demand” is not an extrapolation of demand for AI services , but for debt’s hunger to invest in something private credit and banks believe in. Private credit and banks believe in data centers based on flawed maths and magical thinking, because they too are run by Business Idiots. AI data centers are themselves a grift, convincing investors that they’re backing large infrastructure projects akin to housing developments or factories, rather than warehouses full of expensive hardware for customers that may or may not actually exist. First, to tell you that the Business Idiot class wants you to doubt yourself, because whether you recognize it or not, they’re engaged in acts of information warfare against you. Second, to remind you that facts are facts, and numbers are numbers, and that no amount of puffery or obfuscation can change pure mathematical reality. The AI bubble is exactly that, a bubble, and like all bubbles, it will eventually pop . Third, to remind you what it is we’re fighting for. Because every newsletter I write isn’t simply about highlighting mathematical stupidity, or corruption, or dishonesty.

0 views
Nicky Reinert 6 days ago

Digital Dilemma: Why Google Accounts Should Be Treated as Critical Infrastructure

Google and Microsoft accounts have become part of digital public life. Why account suspensions are more than just an email problem.

0 views
DHH 6 days ago

Basecamp Five

I've been working on Basecamp for half my life, and nearly my entire professional career in software. The first code was written in the summer of 2003 when I was just 23. Now I'm 46, and we've just released the fifth major version.  It's an incredible update to a service that continues to help about a million users a day avoid dropping the ball when working with others. It's AI accessible, but not agent hysteric. It's still famously easy to use, still executes the basics beautifully, and still focuses on the small to medium-sized teams we've been serving in the Fortune 5,000,000 for decades. Here are just three of my favorite new features in Basecamp 5: Lexxy editor: Our new text editor finally brings tables, markdown, and live syntax highlighting for code to Basecamp. Oh, and voice notes. It's built on Meta's Lexical editor toolkit, and it's going to ship as the default for Action Text in the next major version of Rails. Keyboard accessible: After moving to Linux, building Omarchy, and acquiring a taste for mechanical keyboards, I've come to love navigating the computer primarily through hotkeys. So with a lot of effort, Basecamp is now a delight to drive through the keys, and you don't have to be a brainiac to remember them all: just hold down SHIFT, and they're revealed in the interface. SHIFT + S opens the sidebar, ESC moves focus between it and the main page, SHIFT + C starts composing a comment/chat line/answer. The permanent sidebar: If you live in Basecamp, like I do, it's to stay on top of all the new things that are constantly happening in a busy account, and that's just gotten so much faster with the new permanent sidebar. Before, we had a Hey! menu in the top bar. You'd get a little dot when something was new, then you'd open it, click, and the menu would close. If you had five things that were new, it'd be open-click-close, open-click-close, five times. Being able to zoom through these now with just the return key, tap, tap, tap, and I've read three new things. So good. And there's so much more. Jason put together a great summary on the new marketing site, which in itself is brand new too. A back-to-basics design in many ways. As our entire industry is getting swept up in agent hysteria (and I love AI as much as anyone!), we thought it better to focus on the human communication that's the cornerstone of Basecamp. The new site just speaks plainly to that mission and shows you the software right at the top. Another thing that's back is color, specifically in the logo. Basecamp's clever but flat paperclip logo has been replaced with a modern take of our original rolling mountains. In full three dimensions, with depth and a gradient. Love it.  Overall, I'm really proud of what we've built with Basecamp Five. We're inching in on a quarter of a century in service! We still have customers who signed up back in early 2004! This is the kind of legacy that makes me beam, and the new version is just ace.  If you've tried Basecamp in the past, it's time to take another look. If you haven't tried it yet, you're in for a treat.

0 views

Netherlands Seizes 800 Servers, Arrests 2 for Aiding Cyberattacks

Authorities in the Netherlands have arrested the co-owners of two related Internet hosting companies for operating IT infrastructure used by Russia to carry out cyberattacks, influence operations and disinformation campaigns inside the European Union. The two men were the focus of a 2025 KrebsOnSecurity story about how their hosting companies had assumed control over the technical infrastructure of Stark Industries Solutions , an Internet service provider sanctioned last year by the EU as a frequent staging ground for cyber mischief from Russia’s intelligence agencies. An investigator with the Tax Intelligence and Investigation Service (FIOD), the Dutch financial crimes agency, during the raid. Image: FIOD. The Dutch daily news outlet de Volkskrant reports that the Dutch financial crime agency FIOD on May 18 arrested a 57-year-old from Amsterdam and a 39-year-old from The Hague, charging them with violating sanctions law by directly or indirectly making economic resources available to EU-sanctioned entities. The Dutch investigation focuses on Stark Industries, a sprawling hosting provider that materialized just two weeks before Russia invaded Ukraine. As detailed in this May 2024 deep-dive , Stark quickly became the source of massive distributed denial-of-service (DDoS) attacks against European targets, and emerged as a top supplier of proxy and anonymity services that showed up time and again in cyberattacks linked to Russia-backed hacking groups. That report identified two Moldovan brothers — Ivan and Yuri Neculiti and their company PQHosting — who were providing one of Stark’s two main conduits to the larger Internet. In May 2025, the EU sanctioned PQHosting and the Neculiti brothers for aiding Russia’s hybrid warfare efforts. But as KrebsOnSecurity observed in September 2025 , those sanctions failed to target Stark’s remaining connection to the Internet — an Internet service provider based in the Netherlands called MIRhosting . MIRhosting is operated by Andrey Nesterenko , a 39-year-old Russian native who runs the business out of the Netherlands.  News that PQHosting and the Neculiti brothers were about to be sanctioned by the EU leaked in the media nearly two weeks before the sanctions were announced last year. During that time, the Stark network assets were transferred from PQHosting to a new entity called the[.]hosting , under the control of the Dutch entity WorkTitans BV . And as our September 2025 report showed, WorkTitans was controlled by Nesterenko and a 57-year-old from Amsterdam named Youssef Zinad . On top of that, WorkTitans was getting connectivity to the larger Internet solely through MIRhosting, where Zinad had worked previously. On May 18, Dutch financial crime investigators arrested Nesterenko and Zinad, and searched three businesses in Enschede and Almere and two data centers in Dronten and Schiphol-Rijk. A statement from the Dutch authorities said they also seized laptops, telephones and more than 800 servers. A message to the-hosting customers immediately after 800 of its servers were seized by Dutch authorities. The message says that unfortunately data stored on the server has been lost and cannot be recovered. De Volkskrant said it reviewed data showing WorkTitans and MIRhosting were the most-used networks in pro-Russian attacks on Danish government bodies between November 13 and 19, 2025, the week of Denmark’s municipal elections. The publication wrote that prior to Nesterenko’s arrest, the MIRhosting founder denied that he knew his servers had been misused by pro-Russian cybercriminals. “He said he had ended all services with the Neculiti brothers when the EU sanctions came into force in May 2025,” and the he “reserved all rights to take action against ‘harmful and incorrect publications,” de Volkskrant wrote. MIRhosting released a statement saying it has initiated an internal investigation into the alleged facts concerning the elections in Denmark, and that it has temporarily paused services to WorkTitans as a precautionary measure while the matter is being reviewed further. “Based on our preliminary findings, there are no indications that the services over which we exercise control were actually used to influence the Danish elections,” the statement reads. “No anomalies or spikes were observed in our network traffic during the period mentioned in the publication; had large-scale DDoS attacks occurred, such activity would have been evident. Furthermore, prior to the media publication, we had not received any complaints, abuse reports, or official requests regarding suspicious activities or misuse of our network. Meanwhile, our regular operational activities continue, and our service to our other clients remains fully intact.” Born in Nizhny Novgorod, Russia, Mr. Nesterenko grew up as a piano prodigy who performed publicly at a young age. In 2004, Nesterenko founded MIRhosting’s parent Innovation IT Solutions Corp. , which has the notable distinction of being the company responsible for hosting stopgeorgia[.]ru, a hacktivist website for organizing cyberattacks against Georgia that appeared at the same time Russian forces invaded the former Soviet nation in 2008. That conflict was thought to be the first war ever fought in which a notable cyberattack and an actual military engagement happened simultaneously. Responding to questions shared via email, Nesterenko said MIRhosting does not support cybercrime, sanctions evasion, or illegal activity, and that the allegations and arrest by Dutch authorities have been extremely harmful to him and his company. “The transition to the.hosting was not intended to evade sanctions,” Nesterenko wrote. “The hardware and customer portfolio had already been transferred to WorkTitans before the sanctions appeared. Closing or damaging a legitimate Dutch infrastructure company will not stop cybercrime, but it will harm many people who have done nothing wrong.” Far less is public about the 57-year-old Zinad, who reportedly has been keeping a low profile since our story last year. De Volkskrant reported that Zinad blocked access to his LinkedIn account, had gone months without responding to emails, WhatsApp messages and phone calls, and told a colleague that illness was forcing him to lead a somewhat more reclusive life. Mr. Zinad’s now-defunct LinkedIn profile. It was full of posts for MIRhosting’s services. Mr. Nesterenko claims Zinad was never an employee of MIRhosting. “He helped me and MIRhosting with certain business tasks under a normal business-to-business arrangement between companies,” Nesterenko explained. However, in previous emails to KrebsOnSecurity, Nesterenko carbon copied Mr. Zinad (who had a @mirhosting.com email), explaining that he was part of the company’s legal team. Also, the Dutch website stagemarkt[.]nl lists Youssef Zinad as an official contact for MIRhosting’s offices in Almere. Mr. Zinad has never responded to requests for comment. Nor did de Volkskrant have any luck tracking him down. The publication said it repeatedly asked Mr. Zinad (referred to here as simply “Z”), but he reportedly avoided every form of contact. “‘I am unavailable but will respond to your message as soon as possible,’ reads an automated reply on WhatsApp on 2 October 2025,” de Volkskrant reported. “It is the only response de Volkskrant would receive in months. He did not pick up his phone and did not call back. When an acquaintance asked him via LinkedIn to contact the reporter, he blocked access to his LinkedIn page. At an address in Almere where Z.’s personal limited company is registered, no one was present in April. The corner house’s blinds were drawn, and a pile of rubbish bags lay outside next to a container, as if someone had recently left. A neighbour said he knew the man but did not know where he was staying. Z. was later arrested at a residence in Amsterdam.”

0 views
ava's blog 1 weeks ago

beware of EU-washing

Among all this talk of European sovereignty and switching to European alternatives in a move to better privacy and less support of Big Tech, I wish for more emphasis on not just blindly copying US products and slapping an EU label on it. I see news like the Germany’s Federal Office for the Protection of the Constitution backing away from using Palantir and using a software solution from France instead. I’m supposed to feel happy reading this, and admittedly I did not yet dig into ArgonOS deeply - but all I can think of as a first reaction is “I don’t want an EU version of Palantir.” I don’t want ‘GDPR-compliant’ facial recognition and behavioral surveillance in our cities. I don’t want more privacy-friendly warfare (???). I don’t want more tech-enabled discrimination from next door. I don’t want supposedly European alternative that’s still based on AWS and Microslop. We need to be critical and take a stand against EU-washing, in which unethical business concepts or structures get painted in a more ethical light using the (increasingly less warranted) good reputation of the EU about human rights. We aren’t better for being from a different area, or just because it’s a different company name slapped on; it’s because we are supposed to have strong consumer protections and rights, resist the promise of easy money through unlimited data mining, and stand up against fascism. I don’t want us to compete with evil; I don’t want us to stoop to that level at all. Go hard on these copycats. Taking concepts from Fascism Land isn’t worthy of praise and they don’t deserve you as a customer or fan. Make them prove it first and ask them the hard questions. Boycott their shit if it is the same garbage, go to protests, write to representatives, be vocal online, support NGO’s that work against this. No one gets a pass for being European. I won’t lower my standards and values. Reply via email Published 24 May, 2026

0 views
James O'Claire 1 weeks ago

App Marketing: Free App Analytics vs all the “Free” paywall companies

When SensorTower acquired AppMagic earlier this week it got me thinking about why. AppGoblin and many other tools offer many free and open resources for what SensorTower and AppMagic charge thousands for. Take a look at the paid vs free vs free (but limited) of the various ASO and app marketing services out there. None of them are anywhere near as expensive as SensorTower. I think that SensorTower sees this coming and wanted to acquire their biggest competitor to try keeping it’s moat as “the” destination for app analytics.

0 views
iDiallo 1 weeks ago

The commencement speech that shook the world

There he was, the man at the helm of innovation. Eric Schmidt, the former CEO of Google. The man who once said, google doesn't need to record your conversation, it already knows everything about you. Yet he didn't see this one coming. In his speech, he looked clear-eyed into the crowd of graduates and told them that AI is inevitable. There was a group of people who will have a hard time joining the workforce. Companies keep using AI as the excuse for laying off workers. Dario keeps telling us by next year, AI will take over all jobs and there is nothing we can do. They will have nothing, and they better embrace it and be happy. Well, they will have a school loan, but that’s it. If you were an external observer, maybe an alien watching humanity from a distance, you would think that AI is a new species that emerged from a lake and is taking over the world. You would never tell that the people spreading this fear are also the ones selling the tool that they swear will turn us all into gods. It's not just a capable tool that can be useful for coding, writing, and retrieving existing information. No. It's the word itself. The all or nothing. The alpha and the omega. And it comes as a monthly subscription from a handful of companies. What Mr Schmidt was saying to these graduates is that we are done innovating. Now we regurgitate. And then he was booed. He tried to keep talking but the boos were overwhelming. Somewhere between his words, he managed to say that being anti AI is akin to being anti immigrant, trying to score points. I don’t think it worked. When I read the news, I first read through the transcript of his speech. It seemed as if he was able to go through his material. But I was wrong. Watching his performance on video was a whole other story. There was a man, who had practiced his speech in front of a mirror, most likely with Gemini listening and providing feedback. It probably told him that his speech was brilliant. That it was grounded, and encouraging. "The kids will love it." What he failed to see is that booing was not in the training data. No way Gemini was going to tell him that, because it is programmed to be agreeable and friendly. So yes, he was booed every time he mentioned AI. But I think there is a lesson in innovation right there. AI learns from existing material. It can hardly deviate from what it already knows. When you ask it to come up with something new, it will show you something that seems new, because obviously you don't know all the material it was trained on. If it doesn't have the information, it can't tell you that it doesn't know. It will make it up instead. But what these graduates did was exactly what differentiates people from the robots. They did something completely unexpected. They booed Eric Schmidt. They innovated. Something tells me that kids will be just fine. I may never be able to convince my mom to stop sharing AI videos , but my kids will learn to navigate it. They will continue to innovate. Maybe AI will be like a calculator for them. They will know how and when to use it for maximum benefit and prevent the psychosis altogether. It's never gonna be a replacement for thinking. I was really happy to see the booing because it was the last thing they expected. Just a few days ago, the president of Glendale Community College was booed as well because she chose to have an AI read the graduates names and it butchered the process. AI is not inevitable. It's a nice addition. That’s it. We don't just have to accept things as they come to us. The most valuable human quality in an age of automation is exactly what AI cannot provide: the ability to defy expectations, introduce chaos, and create truly original outcomes. That is precisely the power of human innovation. For fun, I copied the commencement speech from a video and passed it through gemini, it complained about the grammar, but as I said earlier, it couldn't predict the outcome: The emotional arc is clear: acknowledge fear → restore agency → define values → point toward possibility. It reads like something that wants to meet graduates where they actually are, rather than giving them generic optimism. Then I asked it if it would result in booing: What makes the speech work is that it does not say: Instead, it says: That framing is resilient. Not so resilient after all. Anyway, congrats to the 2026 graduates! “everything will be fine,” “stop worrying,” or “technology is inherently good.” the future is undecided, humans remain morally responsible, and participation matters.

0 views

Premium: What If...We're In An AI Bubble? (Part 2)

Last week I ran the first part of my What If…We’re In An AI Bubble? Series, where I asked questions and posed scenarios as to the consequences of the many, many questions I’ve asked over the last few years. It quickly became one of my most-read articles I’ve ever written, and for those of you who joined me for the first time last week, here’s a quick list of what we’ve covered already: As I mentioned last week, I believe one of the many problems with the analysis of the AI bubble is that people are willing to consider individual facts — like that AI is too expensive for everybody involved and data centers are not being built at the speed that we believed — but never the gestalt of their consequences.  For example, if data center construction slows to a crawl ( as I’ve discussed is already the case ) there’s a cascade of events that will occur: It’s really easy to say “wow, this stuff needs a lot of debt!” and “wow, this stuff takes a while!” but actually sitting and thinking about what that means logically leads you to some gruesome outcomes.  And to be clear, there’s not really an alternative to that scenario if data center construction slows. Even in an optimistic scenario, if data centers that started being built in 2024 don’t get finished until 2027 or 2028, that means that NVIDIA’s “latest” GPUs are perennially two or three years in the future.  While some capacity exists, I believe there are at least one million Blackwell GPUs sitting in warehouses waiting to be installed years into the future, which means that projects are going to launch in a year or two with potentially three-year-old GPUs, or said projects are going to have to either replace their orders with Vera Rubin or dump aged capacity onto a market saturated with Blackwell GPUs. The argument against what I’m saying is that there’s “insatiable” demand for AI compute — that “any viable compute on the market will be used,” which is true in measurements of days or months, but breaks down in the space of a year. As I mentioned a few weeks ago, AI’s demand story is a lie , because capacity is mostly taken up by Anthropic and OpenAI, creating the illusion of demand by absorbing most available inventory, while simultaneously obfuscating the fact that other sources of demand are simply non-existent in any meaningful numbers..  Many are conflating “there’s not much available” with “there’s so many people that want GPUs” without quantifying what “so many” means or how much they want, when the remaining performance obligations from Google, Amazon, and Microsoft have, outside of OpenAI and Anthropic, effectively plateaued, as is also the case when you remove these companies from CoreWeave order book.  If there were incredible, insatiable, indisputable demand, RPOs would be exploding across the board. Instead, nobody seems interested in buying capacity at scale outside of Anthropic, OpenAI, and the hyperscalers supporting them — or, in some cases, the likes of NVIDIA providing backstops to compute providers, agreeing to buy surplus compute in the case that they’re unable to sell it themselves. This is, to be clear, something that shouldn’t happen if there was genuine, distributed demand.   The sheer scale of the supposed AI data center buildout is in the tens of gigawatts of capacity, which translates to  $10 billion to $15 billion per gigawatt in annual revenue. I can find no examples of anybody but Anthropic and OpenAI spending billions on compute.  Both companies need to make or raise a combined $1.25 trillion in the next four years to afford their compute commitments across Oracle, Microsoft, Google, Amazon and CoreWeave.  The counter-argument to everything I’m saying is effectively two points: The latter is far from compelling, but I can see how somebody would believe it.  So much money appears to be flooding into companies like AMD, Samsung, and Sandisk — tens of billions of dollars to the point that it’s creating shortages across basically every component imaginable — which naturally might make you think that demand would exist at the other end. For the consumer, that perception becomes even more believable when you notice how consumer electronics are getting more expensive. Certain games consoles, nearly six years after their initial release, are more expensive than they were at launch. Typically, the inverse is true.  Meanwhile, smartphones and PCs are expected to ship with weaker specs or high prices, in part because of shortages of key components, caused by demand for AI data center hardware.  The thing is, demand for AI compute doesn’t have to exist for AI data centers to get built. While some have clients signed up in advance, said deals were signed so many years before construction will complete that it’s hard to guarantee that they’ll be willing — or solvent enough — to pay.  I also imagine most clients have signed contracts that have milestone dates for delivery of compute capacity. If data centers are delayed, clients likely have a contractual out, much like Microsoft does with its $17 billion compute deal with Nebius . In any case, in a frothy debt market full of desperate speculation, these projects are being funded by the very same private credit firms that piled into SaaS companies between 2018 and 2022 under the assumption that every software company will grow in perpetuity. When due diligence is so weak in private equity and private credit that Apollo’s John Zito says that their valuations are “ all wrong ,” it’s hard to believe that the same financiers are diligently making sure that enough revenue exists to justify these massive data center debt deals. The same questionable attention to detail applies to venture capital, which has seen ( much like private equity ) its investment model slow to a crawl since 2018 , with an average TVPI (total value paid in) slow to a horrifying 0.8 to 1.2x since 2018, meaning that for every dollar invested, you’re at best likely to get even money in return.  These are the very same investors telling you that every AI company is worth perpetually-growing amounts of money, that everything will work out perfectly , that somebody will work out how to make AI profitable, and that AI is both here to stay and doing incredible things , even if they can’t really explain what those things might be. In reality, none of these people have any idea how to turn around these rotten economics. Data centers are massive money-losing operations that in the best case scenario take five years to make a single dollar of margin, and their customers are eternally-unprofitable AI startups that rely on a constant flow of venture capital dollars.  The AI bubble is entirely built by people who hope somebody else will solve their problems. AI labs depend on venture capitalists to fund them, hardware providers to invent silicon that makes their businesses profitable, and their AI startup clients to find ways to make profitable businesses using their APIs. In turn, AI startups rely on AI labs to work out a way to make their models cheaper so that AI startups can make their business models profitable.  Put another way, everybody’s response to “how does this become profitable” is “don’t worry, somebody will work it out, but don’t worry, they’re going to at some point.” Today, I want to explore what happens if they don’t.  What if…We’re In An AI Bubble?  What If The AI Industry Moves To Entirely Token-Based Billing?  What If Organizations Can’t Afford To Keep Spending On AI? What If The AI Capacity Crunch Never Ends (And Data Centers Aren’t Getting Built)? What If CoreWeave Can’t Keep Up With Its Capacity Demands? What If Hyperscalers Can’t Build Data Centers Very Fast? What If Hyperscalers Have Warehouses of Uninstalled GPUs? What If Hyperscalers Write Off A Large Chunk of GPUs? What If Data Center Construction Demand Collapses?  OpenAI and Anthropic can’t expand much further than their current capacity. As they both make up 50% of Amazon, Google and Microsoft’s revenue backlogs , hyperscalers will be unable to make the majority of the revenue they’ve promised their shareholders. The $178.5 billion in US data center debt from 2025 will go mostly unpaid, as a great deal of it is project financing that’s dependent on revenue from data centers that won’t be built and thus won’t be making any revenue. NVIDIA, which claims to have shipped over 3 million Blackwell GPUs in 2025, will have trouble selling its next-generation Vera Rubin GPUs, as nobody will have anywhere to put them. Alternatively, we’ll see write offs of billions of Blackwell GPUs that will now be considered obsolete. Banks that are already afraid of “choking” on data center debt will stop issuing it, because these investments will not be paying off. It will become very difficult for anybody to afford to buy more NVIDIA GPUs, because AI data centers — which cost around $44 million per megawatt — require massive amounts of upfront capital expenditures, making it unlikely-to-impossible that somebody has the money lying around. That the amount of revenue flowing to both NVIDIA and associated hardware companies making CPUs, RAM, and solid-state storage is proof that there’s demand for…services run on them. What if venture capital funding stops flowing to AI startups? What would make venture capital stop funding AI startups? What if most AI startups go to zero? What if OpenAI and Anthropic became AI’s lender of last resort?  What if AI broke venture capital’s back?  What if inference isn’t profitable?

0 views

News: OpenAI Had A Negative 122% Non-GAAP Operating Margin In Q1 2026, and ChatGPT Growth Has Stalled

New revelations about OpenAI’s finances paint a dim picture for the company, as The Information reported it generated just $5.7bn in the first quarter of 2026, with an adjusted operating margin of -122%.   This means that for every dollar of revenue the company generated, it lost $1.22.  As The Information’s Sri Muppidi noted , these operating margins were adjusted — and, presumably, didn’t conform to GAAP (or generally accepted accounting principles) standards — and excluded certain “large line items”, like stock-based compensation.  By that maths, that means that OpenAI lost $6.95 billion in the quarter, and because this is non-GAAP, it’s quite possible that losses are much higher, revenues are lower, and its margins are worse. The piece does not specify if operating margin includes or excludes training costs, nor does it break down what other exclusions there may be other than stock-based compensation. The report also claims that OpenAI is “on track” to hit its goal of generating $30bn in revenue for 2026, although if it maintains these disastrous margins, it would end up losing $36.6bn.  Meanwhile, ChatGPT’s user growth has stalled. While weekly active users hit 920m in February, the average for the quarter sat at 905m, suggesting lower numbers in either (or both) January or March. OpenAI had expected to hit 1 billion weekly active users in 2025. This suggests that ChatGPT’s growth has stalled. As I’ve noted in the past, weekly active users are a fairly novel metric, with most companies using monthly active users to represent adoption. I’ve also speculated that the reason why OpenAI has favored this metric is because it’s easy to manipulate . OpenAI reportedly had 55m paying ChatGPT customers at the end of Q1 — up from 47m people at the end of the year.  Assuming a userbase of 905m users, this means that OpenAI has a conversion rate of roughly 6%. It's likely worse, as monthly active users should, at least in theory, be a higher number, as it captures every weekly user in addition to less-active users over the course of a month. Nevertheless, while this represents an improvement over the 2.583% rate in February of last year , it’s likely improved as a result of cheaper ad-supported ChatGPT “Go” subscribers at $5 or $8 a month, depending on geography. OpenAI also gave away a free annual ChatGPT Go subscription to literally every Indian subscriber in late October 2025 , though I cannot confirm if they’re counted in the total. As I wrote up yesterday , Anthropic leaked (or had leaked) that it believed it would have a non-GAAP EBIT operating profit in Q2 2026 entirely as a result of Elon Musk discounting two months of compute costs for that specific quarter , and it makes me wonder why we’re suddenly, in the space of 24 hours, talking about operating margins or operating profits for two companies that have hidden behind annualized revenues and obfuscated financials for several years. If I had to guess , it’s likely that investors have begun to demand firmer, more “real company”-adjacent numbers, and while Anthropic was able to find a clever way to manipulate them as a means of raising funding, OpenAI was forced to share numbers a little closer to reality. What’s clear is that we’re in an information war between two companies that burn billions of dollars, with one of them ( OpenAI ) allegedly planning to file for an IPO as soon as today .  Anthropic clearly wants to position itself as the stable, reliable, economically viable alternative to OpenAI, but can only do so with a kind of financial engineering only made possible in a media climate bereft of scrutiny.  Nothing has changed about the core economics of generative AI to suddenly make things profitable, other than the ingenuity of CFO Krishna Rao and his willingness to move numbers around a spreadsheet.  Nevertheless, it’s interesting that Anthropic appears to be leapfrogging OpenAI in revenue. In early May, Anthropic claimed to have $45bn in ARR . By contrast, in March, OpenAI claimed to have topped $25bn in ARR . While OpenAI brought in a billion dollars more than Anthropic in Q1 2026, The Information couldn’t get ahold of OpenAI’s numbers for Q2 2026, but at $45 billion in ARR - $3.75 billion in a month - Anthropic may have taken the lead. That is, of course, if its numbers actually line up with reality, something I’ve disputed multiple times .  Nevertheless, if investors become convinced that OpenAI is falling behind, it’ll be much harder to raise another round at or above its current $852 billion valuation.  Perhaps that’s why OpenAI is rushing to go public - it realizes it might have tapped out private investors. If you liked this piece, you should subscribe to my premium newsletter. It’s $70 a year, or $7 a month, and in return you get a weekly newsletter that’s usually anywhere from 5,000 to 18,000 words, including vast, detailed analyses of  NVIDIA ,  Anthropic and OpenAI’s finances , and  the AI bubble writ large . My Hater's Guides To  Private Credit  and  Private Equity  are essential to understanding our current financial system, and my guide to how  OpenAI Kills Oracle  pairs nicely with my  Hater's Guide To Oracle . This week, I’ll publish the second part to my ongoing series (“ What If…We’re In An AI Bubble? ”) about the factors and events that will cause the AI bubble to finally pop.  Subscribing to premium is both great value and makes it possible to write large, deeply-researched free pieces every week.  The Information reports that OpenAI generated $5.7bn in revenue for the first quarter of 2026 based on discussions with sources familiar with its financials. With adjusted negative margins of -122%, this means that for every dollar of revenue OpenAI made, it lost an additional $1.22, or around $6.95bn on a non-GAAP basis. OpenAI is "on track" to hit goal of $30bn in 2026 revenue, but margins suggest losses of over $36.6bn. OpenAI continues to struggle converting free ChatGPT users to paying customers, and overall user growth has stalled.

0 views

Anthropic's "Profitability" Swindle

Yesterday, the Wall Street Journal ran a story about how Anthropic is “about to have its first profitable quarter,” specifically an operating profit, or EBITDA profitability: Interesting! That’s a lot of certainty considering we’re barely through the first half of the second quarter, and quite a specific number given the fact that June hasn’t started! And all of these numbers are mysteriously leaking exactly while it raises its funding round! Oh there’s also one important note: The Journal adds at the bottom of the article that “ ...it is unclear what accounting methods Anthropic has used to book revenue and costs, as the company isn’t yet required to follow the financial-reporting requirements of a public company. ” That’s right —-- Anthropic is possibly going to be EBITDA profitable for a single quarter, on a non-GAAP basis.  Anyway, I wonder how Anthropic did it? Because based on this unhelpfully-labeled diagram from the Journal, it appears ( as I said last year ) that its costs scale linearly with its revenues, except they…magically didn’t in the second quarter? I wonder if it'll stay profitable? That’s also interesting. So Anthropic may be profitable very specifically in Q2 2026 , but might not be afterward. It’s almost as if it found a way to specifically cut its costs in May and June somehow… …because it did! Remember that deal Anthropic signed with SpaceX to take over Colossus-1 ? Well it’s also taking over some or all of Colossus-2, paying SpaceX $1.25 billion a month starting in May and June… when it’ll have a reduced fee as it ramps up! Per SpaceX’s S-1 : That’s $15 billion a year in compute costs, but reduced to an indeterminately-discounted level for the precise months that Anthropic is using to tell investors and the media that it has an operating profit. That operating profit is a result of accountancy rather than any improvements to its business model. While I wouldn’t say this is cooking the books, it’s definitely a shiatsu-grade massaging of the numbers. Anthropic has deliberately leaked a quarterly “profit” where it knows it can suppress its costs, specifically made sure that the journalist gave it the out of “costs might increase,” and released it on the day of NVIDIA’s earnings as a means of keeping the AI bubble inflated. Nothing has changed. If Anthropic paid full-rate for its compute in those two months, its economics would shift back to what they’ve always been per my reporting from last year on its AWS costs — a business that has costs that linearly increase with its revenue growth. I also severely doubt that Anthropic managed to make the cost of running its services profitable in the space of six months. Per The Information in January , Anthropic missed on its gross margin projections, saying that its inference costs were 23% higher than the company had anticipated. How did Anthropic, which faced a massive influx of new business to the point that Anthropic was forced to buy more compute from Elon Musk , magically become profitable? Other than that discount, of course. I have a few guesses: Nevertheless, the revenue side is where the real problems lie. So, Anthropic has said it brought in $4.8 billion in revenue in Q1 2026, and projects to hit $10.9 billion in Q2 2026. This is tough to reconcile with previous reporting. On February 12, 2026, Anthropic claimed it had reached $14bn in annual recurring revenue (ARR) . As a reminder, ARR is an accounting tool largely used primarily by startups, where a snapshot of a single month’s income is taken and multiplied by twelve. This gives you an implied monthly revenue of roughly $1.17bn.  On March 3, 2026, Dario Amodei would claim Anthropic had reached $19bn in ARR — which works out to $1.58bn per month . Two days later, on March 9, Krishna Rao — Chief Financial Officer at Anthropic — would declare under oath in a court filing that Anthropic had brought in revenues “exceeding $5 billion to date. ”  Keep in mind that The Information had previously reported that Anthropic had $4.5 billion in revenue in 2025 , which I already found difficult to match with Rao's statements. While boosters may claim that “exceeding” could mean literally any number they want above $5 billion, I find it doubtful that the CFO of Anthropic would, under oath, lead the court to believe its business was 30% to 40% smaller than it was, especially when trying to convince it that the damage of being labeled a supply chain risk would ruin its business. At this point it’s impossible to reconcile the 2025 reporting with that $5 billion number. If we assume that the ARR claims made by Anthropic are correct, we can presume that it made revenues of roughly $2.5bn in March ( given that it claimed it had $30 billion in ARR on April 6 ), $1.58bn in February, and $1.17bn in January, for a total of $5.25 billion.  I realize that figure is in excess of what the Wall Street Journal had and, in some world, those numbers could be cherry-picked using particular periods to the point that the real revenues would be in the region of $4.8 billion. That's possible. But they don’t make a lick of sense when you bring up what Krishna Rao said. If we believe Anthropic’s leaks —-- putting aside all of the ARR figures for a second —-- this means that Anthropic: While I acknowledge that Anthropic has grown significantly, that level of stratospheric growth does stretch the limits of credibility. Moreover, the fact that previous ARR figures are inconsistent with the leaked charts from Anthropic further raises questions about the credibility of any numbers from the company.  The only real defense that anybody has here is that Krishna Rao, under oath , lowballed the US government and a judge to such a dramatic extent that he hid in excess of $4 billion in revenue.  And as I’ve discussed before — and FlyingPenguin helpfully collated — adding up Anthropic’s previously-reported ARR from January 2025 to March 3, 3rd 2026 already gets us to around $6.66 billion.  I can imagine this has felt like a big victory for boosters — proof that AI can be profitable, that inference is profitable, that some sort of business model is emerging…and I’m sorry, that’s not what’s happening. Dario Amodei and Elon Musk worked out a sweetheart deal, which they - framed as a “ramp-up,” - that allowed Anthropic to artificially depress its costs. I also question how much of a ramp-up there really was, or what Anthropic’s actual compute constraints were, because it immediately loosened rate limits for Claude subscribers on announcing the deal , meaning that it immediately started having higher inference costs, which…somehow led to it making a higher profit? Or did Musk — as literally described in its S-1 — have SpaceX charge Anthropic less for two specific months to make the numbers look better? In July, Anthropic will start paying SpaceX $1.25 billion a month,  - or $15 billion a year, - on top of all of its other compute deals with Google, Amazon and Microsoft.  If we assume that its spend is comparable on AWS and Google Cloud — and it’s most-assuredly more! — that means Anthropic is spending around $3.75 billion in compute costs, or $11.25 billion a quarter, or $45 billion a year.   There’s also a very compelling argument that Anthropic’s costs will increase and will eat up that profitability , to once again quote the Wall Street Journal: I also have to wonder: if you’re so profitable, why not IPO? Why not take this to the public markets?  Unless, of course, you’re only non-GAAP EBITDA profitable based on a two-month-long discount specifically covering the period in which you’re profitable. And, of course, when you’re not a publicly-traded company, and so you don’t actually have to publish any numbers (and no, leaking them doesn’t count), and you’re not subject to SEC oversight.  I will give Dario Amodei credit: nobody does financial engineering and a press-led information war better than Anthropic. The willingness of the press to eat up incongruent numbers and the eagerness of many to jump up and find obtuse ways to explain away the obvious problems is only made possible when a company has perfected the art of manipulation and ingratiation of those who want to feel like they’re “first.” If you take this as incontrovertible proof that Anthropic is profitable, you are deliberately ignoring the blatantly obvious ways these numbers are being massaged. We’ve got its CFO saying numbers that don’t match up with these leaks or Anthropic’s own marketing materials, and the aggressive and deluded way in which many people ignore them is equal parts frustrating and depressing.  Let me speak directly and with more empathy than usual: if you want Anthropic to win, you should be just as skeptical of these numbers as I am. You should want to smash my face in the tarmac with the most crystal-clear, impossible-to-argue with numbers, bereft of asterisks or discounts from suppliers or obfuscated accounting metrics.  You should want better from your heroes. If you truly think this company is amazing, unstoppable, and leading the tech industry to a glorious era of innovation, there shouldn’t be this many questions, and the metrics shouldn’t be this murky .   Every other time when a company has played this level of silly, weird bullshit has led to disaster — for example, WeWork claimed to be profitable since the second month of its operations , and repeated claims of profitability throughout its existence , and it turned out that it was only “profitable” if you removed things like “ some of the costs of doing business .” I get why you’re so defensive, and I get why you want this to work. A lot of you are very excited about generative AI, and being excited about it has given you a tremendous community of equally-excited people. I get that you like these tools.  And I need you to know these companies are laughing at you.  Anthropic timed this leak to focus on a specific quarter where it artificially suppressed costs, and gave you the flimsiest proof imaginable, specifically-crafted for you to share it as a triumph and spread the idea that “AI labs are actually profitable,” when their core economics haven’t changed. Costs increase linearly with revenue, and will continue to do so in perpetuity.  I genuinely can’t wait for both OpenAI and Anthropic to file their S-1s. If you liked this piece, you should subscribe to my premium newsletter. It’s $70 a year, or $7 a month, and in return you get a weekly newsletter that’s usually anywhere from 5,000 to 18,000 words, including vast, detailed analyses of  NVIDIA ,  Anthropic and OpenAI’s finances , and  the AI bubble writ large . My Hater's Guides To  Private Credit  and  Private Equity  are essential to understanding our current financial system, and my guide to how  OpenAI Kills Oracle  pairs nicely with my  Hater's Guide To Oracle . This week, I’ll publish the second part to my ongoing series (“ What If…We’re In An AI Bubble? ”) about the factors and events that will cause the AI bubble to finally pop.  Subscribing to premium is both great value and makes it possible to write large, deeply-researched free pieces every week.  For large enterprises, Anthropic is taking prepayment of tokens —-- say, $50 million intended to be spread over 12 months that it takes in as revenue. This would both inflate revenue numbers and depress costs, because Anthropic wouldn’t have actually provided the compute necessary to earn that revenue yet. Anthropic is already offering discounted tokens for Claude users through the “buy extra credits” page on their accounts, with discounts ranging from 10% to 30%. It may very well be booking this up-front. Anthropic could be front-loading annual commitments of any kind —– subscriptions to Claude, enterprise or team agreements, and so on. Anthropic could have ratcheted down training to ease the burden on its infrastructure to provide inference.  Made over 90% of its lifetime revenues in the first quarter of 2026.,  Made virtually no revenue in its previous years, and…  Leaked completely imaginary run rates to the media for years.

0 views