GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIBusinessOpenAITech

OpenAI chooses Cerebras for ultra-fast AI inference

Cerebras’ wafer-scale chips give OpenAI a new way to run AI at real-time speed.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Jan 14, 2026, 10:00 PM EST
Share
We may get a commission from retail offers. Learn more
OpenAI and Cerebras logos displayed side by side, separated by a vertical line, on a blue-green gradient background.
Image: OpenAI
SHARE

OpenAI’s partnership with Cerebras is essentially a bet that the future of AI will be real-time, always-on, and limited less by GPUs and more by electricity and cooling. It is about taking the kinds of models people already use every day and making them feel as responsive as a live conversation or a local app, even at a massive global scale.​

At the heart of the deal is a huge number: 750 megawatts of ultra low-latency AI compute that Cerebras will dedicate to running OpenAI’s models. That capacity will be rolled out in multiple phases through 2028, making this one of the largest high-speed AI inference deployments announced so far. Unlike a typical GPU cluster stitched together from thousands of cards, Cerebras builds systems around a “wafer-scale engine” – a single, giant chip the size of an entire silicon wafer, with compute, memory, and bandwidth living side by side. By keeping everything on one enormous piece of silicon instead of hopping across a network of discrete accelerators, Cerebras cuts out many of the latency bottlenecks that slow traditional AI inference.​

This is exactly the pain point OpenAI wants to address. Today, when a user asks a complicated question, generates code, or kicks off an AI agent, there is a multi-step dance behind the scenes: the request travels to a data center, the model runs across multiple machines, results are stitched together, and then streamed back. That process works, but it is not always instant, especially at peak demand or with dense workloads like code generation and long-form reasoning. OpenAI describes its overall compute strategy as building a “resilient portfolio” that matches different workloads to the hardware that makes the most sense for them, and Cerebras is being slotted in as a dedicated low-latency inference tier. In practical terms, that means certain classes of prompts – the ones where every millisecond matters for user experience – can be routed to this faster Cerebras-backed layer.​

The companies have been circling each other for years. Cerebras has pitched itself as an alternative to GPU-bound AI infrastructure, claiming that its wafer-scale systems can run large language models at speeds up to an order of magnitude faster than conventional GPU setups for some workloads. Early benchmarks on Cerebras hardware, including models from the Llama family, show token generation rates that significantly outpace many GPU-based deployments, which is exactly the sort of improvement OpenAI needs for “always-on” assistants, live coding copilots, and real-time agents. For Cerebras, this deal is a validation moment: its CEO, Andrew Feldman, framed it as a decade-long journey culminating in a multi-year agreement that could push wafer-scale technology into the hands of hundreds of millions, and eventually billions, of users.​

There is also a bigger context here: OpenAI is quietly building out a vast physical footprint to feed its models’ hunger for power and cooling. The Cerebras announcement lands alongside a separate partnership with SB Energy, backed by SoftBank, that involves a $1 billion investment to build and operate a 1.2 gigawatt AI data center campus in Milam County, Texas, powered by new solar and battery storage. A gigawatt is enough electricity to power roughly three-quarters of a million US homes at any given moment, which gives a sense of the scale of the facilities needed to run next-generation AI systems. When you start to pair that kind of renewable-heavy power infrastructure with 750 MW of specialized inference hardware, you begin to see how seriously OpenAI is treating AI as critical infrastructure rather than just cloud software.​

For users, many of these moves will show up in subtle ways before they’re obvious headline features. Interfaces that used to stutter or lag may start to feel “local” even when they are calling massive models over the network. Latency-sensitive use cases – think real-time customer support, multiplayer gaming assistants, trading and risk agents, or live language translation – stand to benefit the most from a dedicated low-latency tier. In that sense, Cerebras plays a similar role to the early broadband providers of the web era: most people will never see the wafer-scale chips themselves, but they will notice when AI stops feeling like a slow remote service and starts behaving like a native part of everything they do online.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Leave a Comment

Leave a ReplyCancel reply

Most Popular

Perplexity open-sources Bumblebee, its dev laptop security scanner

Wireless Phomemo D420D label printer is discounted for a limited time

CMF Headphone Pro drops to $69 with 30% off across all colors

Also Read
Stylized Firefox browser mockup displaying multiple travel-themed webpages with a purple color scheme, including hotel booking and Greece travel discovery pages, layered across dark and light browser windows against a purple abstract background.

Mozilla is rebuilding Firefox with Project Nova

Firefox VPN interface showing a “Choose VPN Location” menu with countries including Canada, France, Germany, United Kingdom, and United States of America, with Germany highlighted and a cursor pointing at the selection against a purple-themed background.

Firefox’s built-in VPN now lets you pick your location

Collage of 15 accessibility advocates and creators arranged in three rows against a blue PlayStation-themed background featuring the triangle, circle, X, and square symbols. Top row, left to right: Ben Breen (SightlessKombat), Cameron Keywood, Cesar Flores, Christopher Robinson, and David Deacon. Middle row, left to right: Dr. Amy Kavanagh seated outdoors with a guide dog, James Rath posing with a dog, James Toland wearing headphones and glasses, Li Brady with green-highlighted hair, and Mikey Starovoytov smiling at a table with hands clasped together. Bottom row, left to right: Paul Lane in a suit and bow tie, Ross Minor outdoors, Sam Kitchen wearing glasses and a red hoodie, Shaz Shanghanoo in dramatic and beautiful makeup, and Steve Saylor wearing glasses in colorful lighting.

Sony levels up PS5 accessibility with a new PlayStation Studios Council

Blue PlayStation State of Play promotional graphic featuring the PlayStation logo and “STATE OF PLAY” text on the left, with large 3D PlayStation controller symbols — square, triangle, cross, and circle — stacked on the right against a glowing blue background.

Sony locks in June 2 State of Play with Wolverine and 60+ minutes of PS5 news

An iPhone 17 Pro is horizontal in the center of the frame. A soccer field is visible on the screen of the iPhone, displaying the view from the camera. Behind the iPhone, a soccer net and stadium are visible but out of focus.

Apple TV’s next big test: an MLS match shot entirely on iPhone 17 Pro

Apple App Store logo

Apple is revising App Store age ratings for Australian and Vietnamese users

Illustration of a mobile AI Controls settings screen with toggles for blocking AI enhancements, translations, and page summaries, displayed on a purple gradient background with Firefox branding in the corner.

Firefox adds simple AI controls to its mobile app

UI design concept showing four mobile app onboarding screens for a reading app called Bookworm, displayed in a brown-themed dark mode interface with genre selection, account setup, and bookshelf features. A large overlay prompt in the center reads ‘Switch to brown color scheme and dark mode.’

Figma launches an on-canvas AI design agent for real product workflows

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.