GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AINVIDIAPerplexityTech

Nemotron 3 Ultra rolls out to Perplexity Pro, Max, and Computer

As Nemotron 3 Ultra comes online in Perplexity and Computer, users gain a more robust foundation for long-running AI tasks that span multiple steps, tools, and evolving sources.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Jun 6, 2026, 1:49 AM EDT
Share
We may get a commission from retail offers. Learn more
Close-up screenshot of an AI model selection menu displaying several large language models, including Claude Sonnet 4.6, Claude Opus 4.8, and Nemotron 3 Ultra. The Nemotron 3 Ultra option is highlighted and selected, marked with a checkmark and a “Max” badge, while a large cursor points toward the model name. The interface emphasizes choosing an advanced AI model within a chatbot or AI platform.
Image: Perplexity
SHARE

Nemotron 3 Ultra landing on Perplexity’s Pro and Max tiers – and inside Computer – is one of those upgrades that quietly changes what you can actually do with an AI assistant, even if the interface looks almost the same at first glance. It is not just “a new model option”; it is NVIDIA’s flagship open frontier reasoning system, built specifically to power long-running agents that think for longer, juggle more context, and still respond fast enough to feel interactive.

If you’ve ever hit the limits of current models while running deep research sessions, complex coding tasks, or sprawling “plan this entire project” prompts, this is the sort of backend change that matters more than any shiny new chat UI.

When NVIDIA says “open frontier model,” they are not just using marketing language. Nemotron 3 Ultra is a 550 billion parameter Mixture-of-Experts model with around 55 billion parameters active at any given time, using a hybrid Mamba‑Transformer architecture tuned for throughput and long-context reasoning. That architecture, combined with NVIDIA’s NVFP4 and BF16 tricks on Blackwell-class GPUs, lets the model push out up to roughly six times higher inference throughput than comparable open LLMs at similar accuracy levels, which is a very polite way of saying “it runs frontier-sized brains without feeling like dial‑up.”

The other big number that matters is the context window: Nemotron 3 Ultra has been stretched to handle up to 1 million tokens of context, after being pre-trained on about 20 trillion tokens and then post-trained with supervised fine-tuning, reinforcement learning, and multi-teacher distillation. Long context is not new as a concept, but tying a 1M-token window to a model explicitly optimized for long-running agents means it is designed to keep absorbing intermediate steps, tools calls, and retrieved documents over time instead of falling apart halfway through a session.

Under the hood, NVIDIA’s technical report reads like a checklist of every modern LLM optimization you would expect in a 2026-era frontier model: Latent MoE routing, multi-token prediction, NVFP4 pretraining, multi-environment RL, and “reasoning budget control” to keep the model from overspending compute on trivial turns. The result, according to NVIDIA’s own benchmarking, is up to around 5.9x higher throughput than some of the largest open competitors on challenging long-output workloads, while keeping accuracy on reasoning and agentic benchmarks in the same ballpark.

That’s the model itself. The more interesting story, especially for power users in the US who already live in Perplexity all day, is what happens when you drop something like Nemotron 3 Ultra into a real product with actual users and noisy, messy, open-ended tasks.

Perplexity has been leaning hard into “agentic search” for a while now, layering open models like Nemotron 3 Super into its stack to orchestrate browsing, retrieval, and synthesis rather than just generating text from a static prompt. Nemotron 3 Ultra is essentially the bigger, more obsessive sibling in the same family – one that is built to run deeper and longer chains of reasoning, coordinate tools, and keep more of the evolving conversation in its head while it works.

By making Nemotron 3 Ultra available specifically to Pro and Max subscribers, Perplexity is doing two things at once. First, it is turning those paid tiers into a kind of “frontier open-model lab” where the best open weights from NVIDIA are wired directly into a consumer-facing agent stack instead of staying locked away in enterprise demos or research labs. Second, it is quietly normalizing the idea that open models are not just the cheap or privacy-friendly option – they can be the fast, capable default for demanding, long-running workflows.

If you are on Perplexity Pro or Max, the practical implication is simple: when you spin up longer Computer runs, ask the assistant to manage complex multi-step tasks, or rely on it as a “do this in the background while I keep working” companion, a lot of that orchestration can now ride on Nemotron 3 Ultra. The more your usage pattern looks like “agentic” rather than “quick one-off Q&A,” the more you benefit from a model that is tuned for extended reasoning and throughput at scale.

Computer is where this gets especially interesting. Perplexity’s Computer feature is essentially an agent shell: it opens tabs, navigates pages, runs tools, and stitches it all together into something coherent for you. Long-running agents in that environment need two things: they have to be able to keep context over many steps, and they have to be efficient enough that you are not staring at a spinner for minutes every time they think.

Nemotron 3 Ultra was built for exactly that: long-running “agentic” workflows where context grows continuously as the agent calls tools, reads more data, and updates its internal plan. A 1M-token context window lets an agent keep stacking up intermediate results, logs, and partial drafts without having to constantly prune away earlier context, which is precisely what kills coherence in a lot of existing long sessions.

The Mixture-of-Experts design is the other half of the story here. Because only a subset of experts are active per token, Nemotron 3 Ultra can deliver what is essentially frontier-scale capacity while still offering significantly higher throughput and lower effective cost than a dense model with similar total parameters. For a user, that translates into Computer sessions that can think deeply over long sequences of actions and still respond often enough that you feel comfortable iterating rather than setting something up and walking away.

If you are using Computer for extended coding sessions, refactors, or end-to-end research projects – especially with US-focused workflows like building reports, market analysis, or legal-style document reviews – that combination of long context and high throughput is a big deal. You can keep asking follow-ups, layering in new sources, or pivoting the task without having to manually reset or rewrite massive prompts just to keep the model from forgetting what you said ten steps ago.

There is also a bigger strategic angle here: the open-model ecosystem is evolving from “here are some weights on GitHub” to “here is a full frontier-class stack shipping in real products on day one.” NVIDIA’s Nemotron 3 family was always framed as a three-tiered system – Nano, Super, and Ultra – where Nano handles lightweight, high-frequency jobs, Super powers collaborative and high-volume agents, and Ultra is the heavyweight reasoning and orchestration engine. Nemotron 3 Nano has already landed on platforms like Hugging Face and multiple inference providers; Ultra now arriving inside Perplexity closes the loop between open research, cloud deployment, and end-user applications.

For Perplexity, this deepens its role in what NVIDIA has called the “Nemotron coalition,” where different partners integrate these open models into their own products rather than treating them as side-grade options. For users, especially in markets like the US where AI tooling is quickly becoming part of everyday professional workflows, it means that the open-versus-proprietary debate is less about raw capability and more about which ecosystem fits your use case and values.

NVIDIA’s choice to release Nemotron 3 Ultra with open weights, data, and recipes under an open license gives developers and platforms a lot of room to customize, fine-tune, or self-host variants of the model for domain-specific workflows. That openness is part of why you are seeing day-zero integrations not just on Perplexity but also across orchestration platforms and inference providers that are building their own agent stacks on top.

So what does all of this mean if you are just a person with a Perplexity Pro or Max subscription, logging in from a laptop or phone somewhere in the US and wondering if this actually changes your day-to-day?

In the near term, the shift is mostly experiential rather than flashy. Long Computer sessions feel less fragile and more “confident,” especially when they involve writing, refactoring, or analyzing large bodies of text and code. Multi-step research tasks – think: “analyze these reports, cross-check with current news, and draft something that ties it all together” – become more viable as a single continuous session instead of a series of disconnected prompts.

Over time, as Perplexity leans harder into agent-based features, Nemotron 3 Ultra gives the platform more headroom to experiment with richer, more autonomous behaviors without wrecking latency or cost. And because the model itself is open, it nudges the ecosystem toward a world where the frontier capabilities you get inside a polished consumer product are not fundamentally different from the tools independent developers and researchers can access and modify.

In other words, this is one of those platform updates that doesn’t demand a big marketing explainer in your inbox but quietly shifts the ceiling on what your AI assistant can handle. If you are on Pro or Max, you do not have to do anything fancy: just run the kinds of long, complex, multi-step tasks you already wish your AI could handle better, especially inside Computer, and see how far you can push it now that Nemotron 3 Ultra is doing a lot of the heavy lifting behind the scenes.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Leave a Comment

Leave a ReplyCancel reply

Most Popular

Apple starts age verification in Texas

Perplexity’s AI “Personal Computer” steps onto Windows desktops

iOS 27 rumored to skip four older iPhone models

Here are all the winners of Apple’s 2026 Design Awards

Anthropic opens Project Glasswing to 150 new global defenders

Also Read
Screenshot of a ChatGPT interface displaying a drafted email in a document-style editor. The email is addressed to a repair service regarding a dishwasher leak and resulting cabinet damage, requesting a repair appointment. Editing and sharing controls appear at the top of the document, including a prominent pink “Send” button. The interface features a sidebar with navigation icons, a prompt input field at the bottom, and a blue-green gradient background surrounding the application window, illustrating AI-assisted email drafting and communication.

Draft it, tweak it, send it: ChatGPT adds native email sending

ChatGPT Memory summary modal showing a personalized overview of a user’s work, hobbies, travel interests, and community involvement, with options to correct or dismiss specific details.

OpenAI’s “Dreaming” update makes ChatGPT actually remember you

Logo featuring a stylized orange asterisk-like symbol followed by the word 'Claude' in bold black serif font on a light beige background.

Claude Cowork usage limits doubled on all paid plans for the next month

Illustration of two abstract hands on a pink background holding a cluster of white geometric shapes — a triangle, square, circle, and diamond.

Anthropic tightens its Claude Partner Network with tiers and a hub

Technology-themed illustration showing a glowing Earth emerging from a black background, surrounded by radiant golden data-like light trails extending outward. In the foreground, a series of floating interface panels display icons representing databases, task management, data analysis, artificial intelligence, and interconnected neural networks. A luminous green cube with connected nodes sits at the center, symbolizing AI infrastructure, large-scale computing, and global data ecosystems. The image conveys themes of machine learning, enterprise AI, cloud computing, and worldwide digital connectivity.

NVIDIA’s Nemotron 3 Ultra targets faster, cheaper long-running agents

Promotional graphic showing an AI chat prompt interface against a blue gradient background. The prompt asks: “Use my Function Health lab results to analyze changes in my vitamin D levels over time and build a dashboard showing trends, progress, and how each result compares to optimal ranges.” Tool chips labeled “Computer” and “Function Health” appear below the prompt, alongside an “Orchestrator” label, microphone icon, and send button, illustrating AI-assisted health data analysis and personalized wellness insights.

Perplexity’s health push connects Apple Health, Function labs, and other sources into Computer

Conceptual illustration showing a person seated in an armchair within a dark, dreamlike landscape, watching a glowing upward-trending financial chart projected across the scene. Above the chart floats a transparent sphere containing a computer icon, illuminated by beams of light from above. The image combines elements of technology, artificial intelligence, investing, and market analysis, symbolizing the use of AI-powered tools to monitor trends, research data, and support financial decision-making.

Perplexity’s Main Street AI push arrives with $250 credits per business

Conceptual technology-themed illustration featuring an open book on a desk beneath floating translucent digital panels and a glass sphere containing a computer icon. Blurred light streaks, code-filled interface windows, and layered geometric frames hover above the book against a dark background, symbolizing the intersection of knowledge, artificial intelligence, computing, and digital research. A pen rests beside the book, reinforcing themes of learning, analysis, and human-computer collaboration.

Perplexity Computer now decides what runs local vs cloud

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.