By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIAnthropicTech

Run smarter, pay less: Sonnet and Haiku tap Opus as a hidden advisor

This Anthropic trick turns cheap models into Opus-guided power agents.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Apr 10, 2026, 2:56 AM EDT
Share
We may get a commission from retail offers. Learn more
Anthropic 'The Advisor Strategy' illustration featuring a geometric triangle network diagram with three interconnected white circles and black nodes on a warm coral background, symbolizing connection and strategic relationships.
Image: Anthropic
SHARE

Anthropic is quietly changing how we think about AI agents, and it’s doing it with a deceptively simple idea: let a cheaper model drive, and bring in the genius only when you really need it.

Instead of running Claude Opus end-to-end for every task — which gets expensive fast — Anthropic is now pushing what it calls the “advisor strategy”: pair Opus as a behind-the-scenes advisor with Sonnet or Haiku as the executor, and you get near Opus-level intelligence at Sonnet-like prices.

Here’s how it actually works in practice. Your agent runs on Claude Sonnet or Haiku, which does everything the user sees: calling tools, browsing, reading results, iterating, and writing the final answer. But when that smaller model hits a genuinely hard decision — think tricky reasoning, complex planning, or ambiguous context — it silently “escalates” to Opus via a new advisor tool built into the Claude Platform. Opus doesn’t talk to your user, doesn’t call tools, and doesn’t try to solve the entire task; it just reads the shared context and sends back a plan, correction, or stop signal so the executor can continue.

What Anthropic is doing here is flipping the usual pattern on its head. Traditionally, teams set up a big orchestrator model that decomposes a task and hands chunks to smaller worker models. Anthropic’s advisor strategy does the opposite: the smaller, cheaper model is in charge, and it only pulls in frontier-level reasoning from Opus when absolutely necessary. That means your default cost profile is Sonnet or Haiku, and only a sliver of tokens are billed at Opus rates — just enough to unlock its decision-making and planning.

Under the hood, this is all wired through a new server-side advisor tool that Sonnet and Haiku “know” how to use. In a typical API call, you declare advisor_20260301 as a tool, specify Opus as the advisor model, and let the executor decide when to invoke it. The entire exchange — executor calling the advisor, Opus reading curated context, sending back a plan, and Sonnet continuing — happens within a single /v1/messages request, so you’re not juggling extra round-trips or complex context management yourself.

Anthropic is backing this up with numbers. In its internal evaluations, Sonnet with an Opus advisor shows a 2.7 percentage point gain on SWE-bench Multilingual compared to Sonnet alone, while actually reducing cost per agentic task by about 11.9%. On BrowseComp and Terminal-Bench 2.0, the same pattern holds: higher scores with lower cost per task than running Sonnet solo. In other words, you’re not paying extra just for a bit more quality — in some workloads, you’re actually saving money and gaining accuracy at the same time.

The story gets even more interesting with Haiku. On BrowseComp, Haiku with an Opus advisor jumps to 41.2%, more than double Haiku’s solo score of 19.7%. It still trails Sonnet solo in raw performance, but Anthropic says it costs around 85% less per task, which puts it squarely in the “high-volume, good-enough-but-smart” sweet spot. For large workloads — think customer support triage, large-scale content operations, or bulk research agents — that trade-off is extremely attractive: frontier-flavored intelligence at a fraction of the price.

Crucially, Anthropic has wired in cost controls from the start. You can set max_uses to cap how many times the advisor can be called per request, and advisor tokens are reported separately in the usage block, so teams can watch exactly how much they’re spending on Opus versus the executor. Because Opus typically only produces a short plan of ~400–700 tokens, and the heavier output is generated by Sonnet or Haiku at their lower rates, the total bill lands well below what you’d pay for Opus end-to-end.

For developers already using tool-augmented agents, Anthropic is trying hard not to break anything. The advisor tool is “just another tool” in the Messages API: you can combine it with web search, code execution, and other tools in the same loop. The executor can browse the web, run code, call custom tools, and consult Opus — all inside a single, coherent agent run. That makes the advisor strategy feel less like some niche feature and more like a new default pattern for building serious agents on top of Claude.

Anthropic is also leaning on early customer feedback to sell the idea. Genspark reports “clear improvements in agent turns, tool calls, and overall score — better than a planning tool we built ourselves.” Eve Legal says Haiku 4.5 with an Opus 4.6 advisor can match frontier-model quality at around 5× lower cost on structured document extraction tasks. And at Bolt, the team notes that the advisor setup “makes better architectural decisions on complex tasks while adding no overhead on simple ones” — the plans and trajectories, they say, are “night and day different.” Taken together, these quotes paint a picture of teams that experimented with their own planning and orchestration layers, then quietly retired them once the advisor tool started outperforming their custom logic.

From a developer’s point of view, getting started is fairly lightweight. The feature is available in beta on the Claude Platform, and Anthropic outlines a three-step flow: add the beta header anthropic-beta: advisor-tool-2026-03-01, declare advisor_20260301 in your Messages API request, and adjust your system prompt for your specific use case (such as coding agents). Anthropic even recommends a simple evaluation recipe: run your existing eval suite against Sonnet solo, Sonnet + Opus advisor, and Opus solo to see the quality–cost trade-offs in your own environment.

Strategically, this move says a lot about where Anthropic thinks the market is going. Instead of forcing teams to choose between “cheap but dumb” and “smart but expensive,” the advisor strategy creates a middle lane: “smart when needed, cheap by default.” For many organizations that are watching their token bills but still want frontier-level reasoning in critical paths, that’s exactly the sort of lever they’ve been asking for.

It also marks a subtle shift in how we talk about AI capabilities. The question is no longer just “Which model is the smartest?” but “How do you route the right intelligence to the right part of the task at the right price?” Anthropic’s answer is to make that routing automatic and model-native: Sonnet and Haiku know when they’re stuck and when it’s time to ask Opus for help, instead of relying on brittle, hand-coded orchestration trees.

For teams building production agents, this could become a new default architecture: pick Sonnet when you want strong general performance, bolt Opus on as an advisor to cover the hardest reasoning, and drop down to Haiku + Opus when scale and unit cost are your main constraints. You get Opus-level guidance in the moments that matter, without paying Opus-level prices for every token.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:Claude AI
Leave a Comment

Leave a ReplyCancel reply

Most Popular

MacBook Neo is so popular that it’s now a massive problem for Apple

Perplexity’s Billion Dollar Build is a stress test for AI-native startup ideas

OpenAI Codex loses six older models in spring cleanup

Perplexity and Plaid unite to bring all your money data into one smart view

Anthropic’s Project Glasswing could reshape how software is secured

Also Read
A dark-mode Google Finance dashboard interface. On the left, a vertical watchlist shows several stocks, each with a miniature line graph and percentage change indicator. The center features a candlestick chart tracking price movements, currently at 100.00 with a +2.25% gain. Below it are color-coded bar graphs and a search field for stocks, ETFs, and more. On the right, a “Research” panel poses the question “What’s going on with the markets today?” above a small trend graph, creating an organized, data-focused layout.

Google Finance’s AI upgrade goes global in 100+ countries

Gemini NotebookLM interface showing a light blue background with the Gemini logo and NotebookLM branding at the top. Left sidebar displays navigation options including New chat, My stuff, Scheduled actions, Gems, Notebooks section with Job search and Biology finals items, New notebook button, and Chats section listing various items like travel essays, meal prep, and recipes. Main content area prompts to "Give your notebook a name" with "Grad school application" entered as an example, accompanied by a blue submit arrow button.

Google launches Gemini Notebooks to keep chats, files and NotebookLM in sync

Simtheory and Ortto acquisition by Canva announcement featuring two founders on the left against a green background, with a purple-to-teal gradient backdrop. Right side displays Canva's integrated product interfaces including a lead sources breakdown chart, sales reporting dashboard, content creation panel with messaging options, and a mobile notification mockup showing a "New Feature Alert" on an iPhone lock screen

Canva buys Simtheory and Ortto to supercharge AI marketing stack

Split-screen PayPal Payment Links showcase. Left side displays a payment card on a taupe background with patterned mugs, showing a bowl icon in a white circle, a blue "Buy Now" button, and PayPal branding. Right side shows a whimsical illustration of a green pear shape with "NICE" and "PEAR" handwritten text, featuring a QR code circle and "$250 Received money" label on a blue background

Creators can now add PayPal, Venmo, and Pay Later inside Canva designs

Anthropic

Anthropic and PayPal talk scaling Claude Cowork

Claude Cowork logo and text on a light grey background, featuring a coral-colored starburst icon next to the product name in black serif font.

Claude Cowork rolls out to all paid plans with enterprise superpowers

ChatGPT logo and wordmark in white on a soft blue and orange gradient background, representing OpenAI’s ChatGPT platform.

OpenAI launches mid-tier $100 ChatGPT Pro plan with higher Codex limits

Screenshot of an Evernote desktop note titled “YouTube Link Preview” showing a large embedded YouTube video player in the center of the note editor, with Evernote’s left sidebar navigation (Home, Shortcuts, Notes, Tasks, Files, Calendar, etc.) and top formatting toolbar visible, demonstrating how a YouTube video appears inline inside an Evernote note; the presenter’s face in the video is blurred for privacy.

Evernote adds inline YouTube playback in notes

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.