GPT-5.5 Instant replaces GPT-5.3 as OpenAI’s everyday ChatGPT model

OpenAI is rolling out GPT-5.5 Instant as the new default brain behind ChatGPT, promising answers that are noticeably smarter, cleaner, and more personal without slowing the experience down.

At a high level, GPT-5.5 Instant replaces GPT-5.3 Instant as the model you get by default when you open ChatGPT, and it’s also available to developers under the chat-latest label in the API. OpenAI is pitching this as an everyday workhorse: faster than the heavyweight “Thinking” models, but significantly more accurate and context-aware than the previous Instant generation. The company says it made a series of under-the-hood changes that together cut down on hallucinations, tighten up answers, and make the assistant feel more like it remembers who you are and how you like to work.

One of the headline claims is reliability. In OpenAI’s internal evaluations, GPT-5.5 Instant produced 52.5 percent fewer hallucinated claims than GPT-5.3 Instant on “high-stakes” prompts in areas like medicine, law, and finance, where confidently wrong answers are a serious problem. It also reduced inaccurate claims by 37.3 percent on especially difficult user conversations that had previously been flagged for factual errors, suggesting better recovery when a discussion gets tricky or goes off the rails. On benchmarks, the gains are visible: on AIME 2025, a competition-level math test, GPT-5.5 Instant scored 81.2 versus 65.4 for GPT-5.3 Instant, and it posted higher scores on multimodal reasoning and document parsing tests like MMMU-Pro and OmniDocBench. In practice, that translates to better handling of math-heavy work, complex charts, dense PDFs, and mixed text-and-image problems that used to trip up older models.

OpenAI is also trying to fix something that’s harder to quantify but very noticeable if you use ChatGPT a lot: response quality. GPT-5.5 Instant is tuned to be more concise, less over-formatted, and generally less “talkative for the sake of it.” The company shows side-by-side examples where the new model gives shorter advice that still covers the important details, trims back headings and bullets, and avoids the tendency to wrap simple answers in multi-layered frameworks and emoji-laden pep talks. OpenAI explicitly calls out that GPT-5.5 Instant tries to avoid clutter like gratuitous emojis and unnecessary follow-up questions, a subtle change that can make long sessions feel smoother and more human instead of like you’re reading documentation every time you ask for help.

Underneath those tone changes is a more capable model across everyday tasks. GPT-5.5 Instant is better at deciding when it should call out to web search, so you’re more likely to get updated information where it matters instead of guesses. It’s also stronger at interpreting images and uploaded files: things like reading a screenshot, understanding a chart in a PDF, or parsing a messy document are now closer to “ask a colleague for help” than “cross your fingers and hope it works.” OpenAI’s own examples highlight that the model is more willing to revisit its earlier steps when it catches an inconsistency, instead of confidently locking in a wrong conclusion; it can notice that a candidate answer fails when plugged back into an equation, correct the algebra, and then rerun the solution. That self-correction behavior doesn’t make it infallible, but it does bring it a bit closer to the kind of reasoning people expect from a careful assistant.

The other big piece of the GPT-5.5 Instant story is personalization. OpenAI has been slowly leaning into memory features, and with this release, the Instant model gets notably better at using context from your past chats, uploaded files, and connected Gmail (if you choose to link it) to shape responses. Instead of treating every conversation like a blank slate, GPT-5.5 Instant can recognize patterns in what you’ve asked before, the tone you prefer, recurring projects, or even your go-to places and brands, and then tailor suggestions accordingly. OpenAI’s example: when asked for a new tea place, GPT-5.5 Instant doesn’t just dump a list of popular options; it leans on the fact that you already frequent a specific shop and prefer a certain style of tea, then recommends spots that feel like a natural extension of your existing habits.

To address the obvious privacy and control questions, OpenAI is introducing what it calls “memory sources” across ChatGPT. When a response uses personalization, you can see a breakdown of what context contributed to it – for example, specific saved memories or prior conversations – and you can delete or correct those if they’re outdated or you simply don’t want them in the system anymore. Memory sources won’t be exposed when you share a chat with someone else, so a link you share won’t suddenly reveal the private context that shaped the answer on your side. OpenAI is upfront that this view isn’t yet a perfect log of every single factor the model considered, but it’s a step toward making personalization less opaque and more like a settings panel you can actually adjust instead of a black box.

On the rollout front, GPT-5.5 Instant is already starting to show up as the default model for ChatGPT users, replacing GPT-5.3 Instant in the main interface. For developers, the same model is accessible via the API under names like chat-latest (and variants that different outlets shorthand as “gpt-5.5 chat-latest”), effectively making it the new default choice if you just ask for the latest chat model. GPT-5.3 Instant isn’t disappearing overnight: OpenAI says it will remain available to paying users for roughly three months, accessible through model settings before being retired, a slower deprecation cycle that seems aimed at avoiding the backlash the company faced when it sunset earlier models too aggressively. Enhanced personalization from past chats, files, and Gmail is rolling out first to Plus and Pro users on the web, with mobile apps and broader availability to Free, Go, Business, and Enterprise users coming over the following weeks.

The context here is that OpenAI has been in a rhythm of shipping new default models every few months, each one trying to juggle three things: capability, cost, and latency. GPT-5.5 Instant is the latest iteration of that balancing act, leaning heavily on “better answers without slowing you down” as its core pitch. It inherits the broader GPT-5.5 family’s improvements in coding and knowledge work, but wraps them in a profile tuned for everyday chats, quick research, and productivity workflows where a snappier response often matters more than squeezing out the last bit of reasoning depth. That also keeps it distinct from models like GPT-5.5 Thinking, which target the truly hard reasoning problems but at higher cost and latency.

Of course, none of this changes the basic rule of using large language models: you still need to sanity-check important outputs, especially in areas like health, finance, and legal decisions. What GPT-5.5 Instant offers is a lower error rate, better self-correction, and a nicer interaction style layered on top – the kind of incremental upgrade that’s easy to underestimate but quickly becomes noticeable when you’re relying on ChatGPT for dozens of small tasks every day. For everyday users, the change will probably feel less like a radical new product and more like ChatGPT quietly becoming the version of itself you wish it had been a year ago: quicker, more accurate, a bit more “you,” and a bit less exhausting.