By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AITech

Moonshot AI says its new model beats GPT-5 — and it’s open for everyone

Moonshot’s new Kimi K2 AI model says it outperforms OpenAI and Anthropic’s best systems.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Nov 8, 2025, 8:28 AM EST
Share
We may get a commission from retail offers. Learn more
Illustrated image of artificial intelligence (AI)
Illustration by Kasia Bojanowska / Dribbble
SHARE

Moonshot AI, a Beijing-based lab that’s quietly been building an open-weight stack for the past two years, dropped a new model this week called Kimi K2 Thinking. The company says the model beats OpenAI’s GPT-5 and Anthropic’s Claude Sonnet 4.5 on a handful of the hardest agent-style benchmarks currently used to judge “thinking” systems — and, crucially, the code and weights are available for anyone to use on Hugging Face. If the numbers hold up, the result would be a jolt to the industry narrative that top-tier capabilities must be locked behind expensive, proprietary APIs.

Moonshot describes Kimi K2 Thinking as a mixture-of-experts (MoE) reasoning model that activates about 32 billion parameters at inference but has roughly 1 trillion parameters total across its expert pool. It’s explicitly built to think with tools: the model can call web browsers and other utilities, chain hundreds of steps of tool use, and (Moonshot says) continually refine hypotheses as it goes. That combination — deep multi-step reasoning plus stable tool use — is the headline feature Moonshot is pushing.

On Moonshot’s own benchmark pages, K2 Thinking posts scores such as 44.9% on Humanity’s Last Exam (HLE) with tools, 60.2% on BrowseComp (a web-search + synthesis test), and strong marks on coding suites like SWE-Bench Verified. Those are the numbers being used to argue that it outperforms GPT-5 and Claude Sonnet 4.5 on certain agentic tasks.

Why the “open and free” bit matters

You don’t usually see headline-grade model releases that are genuinely open. Moonshot has published the model materials on Hugging Face and released technical write-ups and code, meaning developers can download weights, run experiments locally, fork the code — or embed the model in commercial products, subject to the model’s modified license. That availability matters: it lowers the barrier for startups and researchers who want to experiment with agentic workflows without paying steep per-token fees to a closed provider.

Moonshot also published a technical narrative arguing K2’s MoE design plus quantization and serving tricks let the lab train and run a trillion-parameter system far more cheaply than conventional dense models — a claim that, if true, challenges the idea that building frontier models requires “scale and burn” budgets in the hundreds of millions or billions.

The eyebrow-raising price tag (and the caveats)

Several outlets reporting on the launch cited a figure — $4.6 million — as the training bill for K2 Thinking. That number has been widely repeated, but it’s important to stress it isn’t independently verified: outlets that repeated the figure note it came via a “source familiar with the matter” and that CNBC, which reported it, could not independently confirm the number. If the figure is accurate, it would be staggeringly low compared with the public spending numbers we’ve seen from leading Silicon Valley labs — but the number should be treated with caution until more independent accounting appears.

What Kimi K2 actually looks like under the hood

The broad technical story is familiar to anyone following modern LLM engineering:

  • MoE architecture: many expert sub-modules; only a subset activate per token, so you can get a huge “parameter count” but relatively modest inference costs. Moonshot says K2 activates ~32B params per call while the model houses ~1T total.
  • Tool-first training: the model was post-trained and fine-tuned to plan, call tools (search, browse, APIs), verify results, and iterate — not just to produce a single text reply. Moonshot emphasizes long sequences of tool calls (200–300) as a core capability.
  • Long context: the model family has been pushed to enormous context windows (Moonshot advertises hundreds of thousands of tokens in some versions), which helps for sustained, multi-step tasks.

Those design choices match a broader industry trend: instead of trying to encode everything into a single “dense” network, engineers are stitching together specialist modules and cheap tool calls to get emergent agentic behavior without astronomical inference bills.

Why investors and businesses should pay attention

For businesses that have been sold pricey enterprise models under the argument “you get what you pay for,” a high-performing free alternative is a strategic headache. If an open model can match or beat a proprietary one on productivity tasks, the economic moat that supported subscription pricing narrows.

Investors will be watching two things: (1) whether K2’s real-world performance (outside of company-published benchmarks) matches the launch claims, and (2) whether Moonshot’s serving economics and license let startups build profitable services around the model without re-creating the expensive infrastructure stacks that firms like OpenAI and Anthropic run. Early signs — including the Hugging Face release and bench numbers — have already stirred conversation in trading desks and boardrooms.

The geopolitical and security angle

The new release also re-ignites familiar geopolitical anxieties. Western policymakers have tended to view advanced models from Chinese labs through lenses of control, censorship, or national advantage — sometimes rightly, sometimes not. Moonshot’s choice to open weights complicates the usual narrative: these models are now not just national trophies but engineering artifacts anyone can inspect, run, and reuse. That has pros (transparency, rapid innovation) and cons (easier proliferation of risky capabilities).

Security-conscious organizations will ask tougher questions: who trained the data, what red-teaming was done, how does the model handle disallowed content, and could it be adapted to misuse at scale? Those are not rhetorical — several governments reacted fast to similar releases earlier in the year, and security reviews will matter more now that agentic capabilities are becoming affordable.

So, is this a Sputnik moment or a flash in the pan?

There’s a reasonable middle ground. The Kimi K2 launch is an important data point: it shows that MoE + careful tool integration can push open models into territory that used to belong only to deep-pocketed proprietary teams. But the history of AI hype is long — claims need independent verification, third-party audits, and time in production to prove robustness.

If Kimi K2’s performance and cost story hold under scrutiny, expect three things to happen quickly: (1) more aggressive open-weight launches from other firms, (2) renewed pressure on proprietary vendors to justify their pricing or open parts of their stacks, and (3) faster conversations about how to regulate or audit agentic systems that can interact with the web and other services autonomously. If the numbers don’t hold, the launch will still have shifted perceptions — at minimum, it forces incumbents to explain why closed models remain worth paying for.

The practical takeaway

If you build software or manage AI procurement:

  • Try the model yourself (it’s available on Hugging Face and Moonshot’s platforms) and run the tasks you care about; benchmarks are a starting point, not a guarantee.
  • Treat the $4.6M training figure as unverified until clearer accounting appears. Don’t make budget decisions based purely on press numbers.
  • Keep an eye on license terms: “open” doesn’t always mean “no strings” — Moonshot’s release uses a modified MIT license with some commercial restrictions at scale. Read the legal fine print before embedding the model in revenue-generating products.

Final note

The AI landscape is changing fast. Kimi K2 Thinking is the latest example of that velocity: it raises hard, useful questions about cost, openness, and what “frontier” AI really means. Whether it ultimately reshapes the market or ends up as an overhyped milestone depends not on launch tweets, but on repeated, independent testing and real-world use. For now, the industry has a new model to probe — and a new argument to settle about whether the next big leap will be proprietary or open.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Leave a Comment

Leave a ReplyCancel reply

Most Popular

Amazon Prime still offers free trials in 2026 — if you know where to look

Windows 11 needs 4x the RAM for the same work and MacBook Neo proves it

Stop rebooting: grab 35% off Parallels Desktop and run Windows on your Mac the easy way

Google Doodle stitches up a shamrock logo for St. Patrick’s Day 2026

iOS 27 could be the Snow Leopard of the iPhone

Also Read
Bright abstract background in soft orange, pink, and yellow gradients with two rounded white buttons centered, labeled “5.4 mini” and “5.4 nano” in gray text, representing OpenAI’s small GPT-5.4 models.

OpenAI launches GPT-5.4 mini and nano for faster, cheaper AI

An illustration of a lone person sitting at a desk with a laptop on a surreal, softly lit landscape under a starry night sky, with sweeping teal and gold bands across the sky and the white text “comet enterprise” prominently centered in the middle.

Perplexity unveils Comet Enterprise with granular admin and security controls

Wide banner showing the Perplexity logo and text on the left and the NVIDIA logo on the right against a dark background, above a stylized green landscape made of mossy hills overlaid with glowing white data points.

Perplexity enters NVIDIA Nemotron Coalition as a founding partner

Minimal diagram showing ten labeled cognitive abilities arranged in a circle around the words “Cognitive Abilities,” including perception, generation, attention, learning, memory, reasoning, metacognition, executive functions, problem solving, and social cognition, each with a small blue icon.

Google DeepMind maps a new way to score AI systems on the road to AGI

Bright lime‑green and black Nike Powerbeats Pro 2 wireless workout earbuds with over‑ear hooks are shown floating in front of their open charging case, which features a speckled Volt pattern on the base and the “JUST DO IT.” slogan inside the lid.

Special-edition Nike Powerbeats Pro 2 land with Volt design and ANC

Centered FIFA World Cup 2026 logo on a black background, featuring the golden World Cup trophy inside a bold white “26” with the word “FIFA” below and “World Cup 2026” in white text.

YouTube is now a preferred platform for the FIFA World Cup 2026

Black background graphic with the word “colab” in bold orange lowercase letters on the left, an orange heart emoji in the center, and the white Model Context Protocol logo with the text “Model Context Protocol” on the right.

Google’s Colab MCP server lets any AI agent run your notebooks

Mobile screenshot showing two Amazon app checkout screens side by side on an orange background, with the left phone displaying a cart containing Huggies Size 3 Little Snugglers diapers for 23.17 dollars and options to proceed to checkout, change quantity, delete, or save for later, and the right phone showing delivery choices highlighting a paid “Arriving in 1 hour” option for 9.99 dollars, a “In 3 hours” option for 4.99 dollars, and a free Same-Day delivery window later in the day.

Amazon launches ultra-fast 1-hour and 3-hour delivery in more US cities

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.