On June 24, the company the world thinks of as Nvidia's single biggest customer showed off a processor Nvidia had nothing to do with. OpenAI unveiled Jalapeño, its first custom chip, co-designed with Broadcom and built for one job: inference, the work of running a trained model every time someone types a prompt. The chip is real and the partnership is real. Most of the performance numbers traveling with it are not yet anything you can check.
OpenAI built a chip for the part of AI that never stops billing
Jalapeño is an ASIC, a chip designed for a single purpose instead of for general computing. Its single purpose is inference: the step where a finished model answers a query, the part that fires on every message you send ChatGPT. Training a model happens once. Inference happens forever, on every prompt, from every user, across every product the company ships.
OpenAI calls it an Intelligence Processor. It was co-developed with Broadcom to run OpenAI's full stack top to bottom, and manufactured by TSMC. Initial deployment is targeted for the end of 2026, the first chip in what the two companies describe as a multi-generation platform. That is the verified spine: a real chip, a named foundry, a real date.
The 50 percent everyone repeated belongs to Broadcom, not OpenAI
Here is where the coverage blurred. The figure that traveled across every feed, that Jalapeño performs on par with Nvidia's Blackwell chips and Google's in-house silicon while running inference at roughly half the cost, came from Broadcom CEO Hock Tan, describing early lab testing. It is a vendor's claim about its own product, made before any independent benchmark exists.
OpenAI's own claim is narrower and more careful. The company says only that early testing shows performance per watt "substantially better than current state-of-the-art", and it adds that final performance is still being measured. Those are not the same statement. One is a number you could build a business case on. The other is a hedge.
Nobody outside the two companies has run the chip. Read the 50 percent as a target, not a result.
OpenAI used its own models to help design the chip
The detail that should have led the coverage sits a few paragraphs down in the announcement. OpenAI went from blank page to tape-out in about nine months, the point where a design is frozen and handed to the factory, and it calls that the fastest advanced-chip design cycle it knows of. A leading-edge processor usually takes years. They did it in three quarters of one.
How? In part by pointing its own AI models at the design work. The company that builds the models aimed them at the problem of building the chip that runs the models. Greg Brockman, OpenAI's president, said the degree of acceleration "was very surprising to us." When the people closest to a technology are surprised by what it did, that is worth sitting with.
EE Times built its whole headline around the same point: the pepper gets the laugh, and the nine-month design loop gets the footnote it does not deserve.
Owning the model was never enough; OpenAI wants to own the cost
Every model OpenAI serves runs on hardware it buys from a competitor. Nvidia supplies the chips inside the overwhelming majority of AI data centers, which means OpenAI's core economics have always carried Nvidia's margin inside them. You cannot set the price of a thing you rent from your rival.
This is the path Google walked with its TPUs and Amazon walked with Trainium: design the chip, own the stack, and stop paying a competitor for the most important part of the business. OpenAI is the largest company built purely on AI to follow them down it. The bet is not that Jalapeño wins a benchmark. What OpenAI is betting is that owning the chip lets it bend the cost of every answer it sells, on its own schedule, without asking anyone.
Why it matters
Inference is a recurring bill. Training is a capital cost you pay once to produce a model; inference is an operating cost you pay on every interaction, forever, and it climbs with every new user. As AI turns from a thing people try into a thing people lean on all day, that bill becomes the dominant line in the business. Control the cost of inference and you set the economics of consumer AI.
That is the lever Jalapeño reaches for, and it is why a chip nobody has independently tested is still a serious move. The performance claims may prove inflated. Its strategic logic does not depend on them. A chip that merely matches Nvidia at lower cost, on hardware OpenAI owns outright, already changes who depends on whom.
And the quieter loop underneath keeps turning. OpenAI used its models to help design a chip that will run its models faster, which will help design the chip after that. Each turn of the wheel leaves the company a little less dependent on the supplier it rents from, and on the slow human pace that chip design used to demand.
OpenAI joined Google and Amazon in a club defined by a single rule: depend on no one for the thing that prints your money. So when one company owns the model, the chip that runs it, and the tools that design the next chip, is that a company finally standing on its own, or the early shape of a monopoly that happens to call itself open?
Originally published as an Instagram carousel on @recul.ai.