GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIBusinessOpenAITech

OpenAI and Broadcom unveil Jalapeño, their first custom AI inference chip

OpenAI is done renting compute. With Jalapeño, the company is building its own silicon optimized for the workloads that power ChatGPT, Codex, and beyond.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Jun 25, 2026, 4:02 AM EDT
Share
We may get a commission from retail offers. Learn more
OpenAI and Broadcom leaders display the Jalapeño inference chip.
Photo: OpenAI
SHARE

OpenAI and Broadcom have just pulled back the curtain on Jalapeño, their first custom AI inference chip — and it’s a moment that signals something bigger than just another silicon announcement. This is OpenAI’s declaration that it’s done renting compute; it wants to own the full stack, from the model weights down to the transistors that crunch them.

The chip made its debut on June 24, 2026, delivered physically to Sam Altman and Greg Brockman by Broadcom CEO Hock Tan and President Charlie Kawwas. It’s a striking visual: the AI software company’s leaders holding the actual piece of custom silicon that will power the next generation of their products. But the real story isn’t the handoff ceremony — it’s what this chip represents in the escalating arms race for AI infrastructure.

The nine-month miracle

Here’s the number that should make semiconductor veterans sit up: nine months. That’s how long it took to go from blank sheet to tape-out — the industry term for sending a finished chip design to the fab for manufacturing. In the world of high-performance ASICs (application-specific integrated circuits), that’s virtually unheard of. Typical development cycles stretch 18 to 24 months, sometimes longer.

How did they pull it off? The answer sits at the intersection of software-hardware co-design and, well, AI designing AI. OpenAI’s own models were used to accelerate portions of the chip design and optimization process. The same models that power ChatGPT and Codex helped engineers lay out circuits, optimize memory hierarchies, and validate timing closure faster than human teams alone could manage. It’s a recursive loop: AI designs the chips that run AI better, which then designs even better chips.

Richard Ho, who leads OpenAI’s hardware program, puts it plainly: “Jalapeño was designed from the ground up for LLM inference using detailed insights from our close collaboration with OpenAI researchers. We optimized the architecture around the kernels, memory movement, networking, and serving patterns that matter most for frontier AI models.“

Not another generic accelerator

What makes Jalapeño different from the parade of AI chips we’ve seen over the past few years? For starters, it’s not a training chip. It’s purpose-built for inference — the act of actually running a model to generate answers, write code, or power an agent. That’s a crucial distinction. Training gets the headlines, but inference is where the money gets spent and where users feel the latency.

Most AI accelerators on the market today — NVIDIA‘s H100 and H200, AMD’s MI300X, Google‘s TPUs — were originally architected with training in mind, then adapted for inference. Jalapeño went the other direction. It’s a blank-slate design informed by the actual workloads OpenAI runs every day across ChatGPT, Codex, the API, and their upcoming agentic products. The architecture reduces data movement, balances compute and memory resources, and targets realized utilization much closer to theoretical peak performance.

Broadcom’s contribution goes beyond just manufacturing. They brought their Tomahawk networking silicon — the same high-speed interconnect technology that powers hyperscale data centers — and their deep expertise in taking custom designs to volume production. Celestica rounds out the partnership, handling board, rack, and system integration.

The ASIC vs. GPU trade-off

Industry analysts have been quick to point out the inherent tension here. An ASIC like Jalapeño is, by definition, less flexible than a GPU. You can’t just recompile your PyTorch model and run it; the software stack has to be co-developed alongside the hardware. But that specialization buys you something NVIDIA’s general-purpose GPUs can’t match: performance per watt and cost efficiency at scale for a specific workload.

Broadcom CEO Hock Tan didn’t mince words when speaking to press: the chip is “just as good” as NVIDIA’s Blackwell GPUs and Google’s TPU data center tools in practice. That’s a bold claim, and OpenAI says a detailed technical report with benchmarks is coming in the months ahead. Early testing shows “performance per watt substantially better than current state-of-the-art.“

The ten-gigawatt ambition

This isn’t a one-off project. The Jalapeño announcement is the first milestone in a partnership announced back in October 2025, targeting 10 gigawatts of custom AI accelerator capacity. To put that in perspective: 10 gigawatts is roughly the output of five Hoover Dams. It’s the energy consumption of more than 8 million U.S. homes. OpenAI and Broadcom aren’t building a chip; they’re building a multi-generation compute platform with deployment beginning in late 2026, significant ramp in 2027, and full-scale operations targeted for the first half of 2028.

The rollout plan is phased: “small development” deployments in late 2026, then scaling aggressively. Microsoft — OpenAI’s primary cloud partner and investor — is expected to be a key deployment partner for these gigawatt-scale data centers.

Following the vertical integration playbook

If this strategy sounds familiar, it should. Google pioneered the custom AI chip path with TPUs starting in 2015. Amazon followed with Trainium and Inferentia. Meta has its MTIA program. Microsoft has Maia. Apple’s M-series chips showed the industry what vertical integration looks like at the consumer scale. OpenAI is now executing both the Google TPU playbook and the Apple vertical integration model simultaneously — combining hardware sovereignty with software and ecosystem control.

The logic is straightforward at OpenAI’s scale. When you’re serving hundreds of millions of users and burning through compute budgets that rival the GDP of small nations, even a 10% improvement in inference efficiency translates to massive savings. Custom chips can cut computing costs by an estimated 40-50% according to industry analysts. They also free you from a single supplier’s roadmap, pricing, and allocation constraints.

The networking piece nobody talks about

There’s a quieter but potentially more disruptive element to this announcement: networking. Broadcom’s Tomahawk Ethernet silicon is being positioned as a direct alternative to NVIDIA’s InfiniBand ecosystem. For years, NVIDIA has used its Mellanox acquisition to lock customers into InfiniBand for high-performance AI clusters. Broadcom and OpenAI are betting that standard Ethernet — with the right silicon and software stack — can match or beat InfiniBand for LLM inference workloads.

If they’re right, it chips away at one of NVIDIA’s deepest moats. The networking layer is where multi-chip, multi-rack, multi-data-center scaling lives or dies. Controlling that layer means controlling how efficiently you can shard models across thousands of accelerators.

What this means for the rest of the industry

Jalapeño arrives at an inflection point. NVIDIA’s Blackwell architecture is ramping. Google just announced TPU v8 with separate training and inference variants. Amazon is reportedly in talks to sell Trainium chips directly to outside companies. AMD’s MI350 series is on the horizon. The “GPU-only” gold rush has definitively ended; we’re now in the hyper-specialized accelerator era.

For OpenAI specifically, this is about more than cost savings. It’s about product differentiation. When you control the silicon, you can optimize for the specific serving patterns your products need — whether that’s ultra-low latency for interactive ChatGPT sessions, high-throughput batch processing for Codex, or the unique memory access patterns of agentic workflows that take dozens of steps. The “flywheel” Greg Brockman describes is real: better infrastructure enables better models, which become better products, which drive more usage and revenue to fund the next generation of infrastructure.

The skeptic’s checklist

It’s worth keeping a few caveats in mind. First, we haven’t seen independent benchmarks yet — only OpenAI’s early internal testing claims. Second, the software stack (kernels, compilers, runtime, scheduling) is arguably harder than the hardware, and OpenAI is building much of it from scratch. Third, deploying at a gigawatt scale involves supply chain, power, cooling, and real estate challenges that make chip design look simple by comparison. Fourth, the AI model landscape shifts fast; a chip optimized for today’s transformer architectures might need significant changes for whatever comes after.

Jalapeño isn’t just a chip. It’s a statement of intent. OpenAI is signaling that it intends to be a full-stack AI company — not a model lab that rents compute from cloud providers, but an infrastructure company that controls its own destiny from the transistor up. Whether they can execute on the ambition remains to be seen, but the industry just got a lot more interesting.

The next time you type a prompt into ChatGPT and get a near-instant response, there’s a growing chance that the electrons moving through the data center are flowing through silicon that OpenAI designed itself. That’s a profound shift — and Jalapeño is just the first bite.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:ChatGPTOpenAI Codex
Leave a Comment

Leave a ReplyCancel reply

Most Popular

Anthropic bundles chat, Cowork, and Code into one enterprise desktop app

Elon Musk confirms “Starmind” as SpaceX’s AI satellite constellation name

Perplexity unveils a legal-specific AI Computer for Counsel

OpenAI calls developers to DevDay 2026 – apply before July 10

Summer Sale gives Nothing’s lineup a more tempting price tag

Also Read
Airline seatback inside a Southwest Airlines aircraft featuring a promotional card announcing Starlink WiFi service. The sign reads “It’s Here! You’re on one of the first planes featuring Starlink WiFi,” with Southwest and Starlink branding displayed at the top. A smartphone mounted on the tray table shows the onboard internet portal offering free WiFi access. The image highlights the rollout of Starlink’s high-speed satellite internet service on Southwest Airlines flights.

Southwest Airlines now has Starlink WiFi onboard

View from inside an airplane cabin showing a passenger holding a smartphone near an oval aircraft window. Outside, the airplane wing extends above a blanket of clouds under a blue sky. The image highlights in-flight connectivity and mobile device usage during air travel, commonly associated with onboard internet services such as Starlink Aviation.

Starlink Wi-Fi launches on American Airlines flights in early 2027

Overhead view of a person working at a wooden desk, typing on a laptop surrounded by a notebook, smartphone, and a cup of coffee. Large promotional text across the image reads “Tag @Claude in,” with “@Claude” highlighted inside a salmon-colored rounded label. The warm-toned workspace and productivity-focused setting illustrate Anthropic’s Claude AI being referenced or included in conversations and workflows.

The logic behind Claude Tag’s identity model

A blurred, warmly lit office or workspace forms the background of a promotional graphic featuring the text “@Claude” in large white serif lettering inside a rounded salmon-colored label. The soft-focus scene includes shelves, furniture, and ambient lighting in shades of brown and orange, creating a professional and inviting atmosphere associated with Anthropic’s Claude AI assistant.

Anthropic launches Claude Tag beta for enterprise and teams

Intricate abstract blue and purple 3D geometric art with smooth curves and bold contrasts.

OpenAI’s Daybreak shifts focus from finding bugs to fixing them

Logo featuring a stylized orange asterisk-like symbol followed by the word 'Claude' in bold black serif font on a light beige background.

Anthropic launches Japan Claude Community Ambassador program after 290+ global meetups

OpenAI logo displayed prominently against a vibrant background with gradient colors blending from blue to green and yellow. The logo features a geometric design of an interlocking hexagonal pattern in black.

Samsung rolls out ChatGPT Enterprise to all employees worldwide

Screenshot of the Perplexity Computer interface featuring a command panel for AI-powered tasks and automation. The dashboard includes a search bar, an Orchestrator mode selector, Deep Research tools, custom skills, and planning options, designed to help users perform research, workflows, and computer-assisted tasks.

Perplexity Computer adds a Command Panel

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.