By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIGoogleTech

Google’s Genie 3 AI can now create interactive 3D video game worlds in real time

DeepMind’s Genie 3 can now generate immersive, interactive environments that remember object placement and respond to user inputs.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Aug 6, 2025, 1:08 PM EDT
Share
Google Genie 3 AI model
Image: Google DeepMind
SHARE

On August 5, 2025, Google DeepMind unveiled Genie 3, the latest iteration of its “world” model capable of generating richly detailed, interactive 3D environments from a simple text prompt. Unlike its predecessor, Genie 2—which offered only 10–20 seconds of navigable content in a single go—Genie 3 delivers several minutes of continuous, real-time interaction at 720p resolution and 24 frames per second. More impressively, it “remembers” where objects are placed, ensuring your virtual walls stay painted and chalkboards stay written on, even if you look away and return moments later.

World models simulate digital spaces much like handcrafted video games, but instead rely entirely on neural networks to conjure every rock, tree, and rainstorm. In December 2024, DeepMind introduced Genie 2, which could generate short, interactive sequences based on a given image. Though groundbreaking, its impact was limited by its brief playtime and inconsistent memory: objects might shift or vanish if you revisited an earlier location. Seeking to break through these constraints, DeepMind’s researchers spent the past eight months enhancing consistency, immersion, and duration—pushing the boundary from tens of seconds to multiple minutes of play.

What Genie 3 brings to the table

  1. Extended interaction horizons
    Users can wander, experiment, and explore for several consecutive minutes—up from just seconds—opening up possibilities for in-depth educational simulations, longer-form game prototypes, and more robust AI-agent training scenarios.
  2. Persistent world memory
    Genie 3 retains the state of every surface and object for about a minute. That means paint, graffiti, or even rubble you create or move will reliably persist when you loop back, mimicking the spatial coherence we expect in traditional game engines.
  3. Promptable world events
    Through additional text inputs, users can dynamically alter weather conditions, spawn non-player characters, or trigger environmental effects like earthquakes or snowfall—without retraining or restarting the environment.
  4. Real-time performance
    Running at 24 fps at 720p, Genie 3 strikes a balance between visual fidelity and computational feasibility. While not photorealistic, the 0.7MP frame size ensures smooth, immersive experiences on modern hardware.

DeepMind’s engineers built Genie 3 atop two key advances: the video-generation prowess of Veo 3 (which learned physics through self-supervised video training) and the short-term spatial memory innovations tested in Genie 2. By combining a large transformer backbone with a novel “attentive memory” mechanism, the model reasons over past frames to uphold consistency—even though no explicit caching or hard-coded physics engine is used. According to DeepMind, Genie 3 learned these rules implicitly, absorbing patterns of object permanence and physical interaction as part of its training regime.

Real-world applications

  • Education & training: Imagine medical students exploring a virtual anatomy lab where instruments remain on the table as you step away and return, or history classes wandering through a dynamically reconstructed ancient city.
  • Game development: Indie studios could prototype level designs on the fly, spawning new NPCs or environmental hazards without writing a single line of code.
  • AI agent research: For researchers in embodied AI, world models offer scalable, safe testbeds. Agents can learn navigation, object manipulation, and multi-step problem solving in minutes rather than hours of costly real-world trials.

Current limitations

Despite its strides, Genie 3 remains a research preview, accessible only to a small cohort of academics and creators under strict safeguards. Some notable constraints include:

  • Limited interaction duration: At “a few minutes” of memory, Genie 3 still falls short of the multi-hour sessions needed for comprehensive agent training.
  • Resolution ceiling: While 720p is sufficient for prototypes, serious game studios will want 1080p or higher for commercial releases.
  • Text legibility: On-screen text often only appears crisply if it’s explicitly provided in the prompt, limiting dynamic signage or UI elements generated on the fly.
  • Complex multi-agent dynamics: Simulating multiple independent actors in one space can lead to unpredictable behaviors, as the model’s “agency” remains rudimentary.

DeepMind is treating Genie 3’s rollout with caution. By restricting early access, the team hopes to study misuse scenarios—such as generating disorienting or dangerous virtual environments—and build robust mitigation strategies. Google’s “responsibility & safety” framework emphasizes continuous monitoring, red-teaming, and bias evaluation, signaling that full public release may hinge on satisfying stringent ethical benchmarks.

Looking forward, DeepMind plans to explore higher-resolution outputs, longer memory spans, and more naturalistic physics interactions. There’s also talk of integrating Genie 3 into the Gemini ecosystem, potentially allowing users to summon 3D worlds alongside text and images, all within one unified AI assistant. Whether for training the next generation of AI or delighting gamers with procedurally generated adventures, Genie 3 underscores DeepMind’s vision: that world models are a pivotal stepping stone toward truly general intelligence.

Genie 3 marks a significant leap in AI-driven simulation, extending playtime, bolstering memory, and opening doors to dynamic world events—all in real time. While still in its research infancy, the model’s capabilities hint at a future where creating and exploring vast digital realms might be as simple as typing a sentence. As access widens and technology matures, we’re likely to see these virtual universes become mainstays in education, entertainment, and AI research—reshaping how we build and interact with digital worlds.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:Google DeepMind
Most Popular

Gemini 3.1 Flash TTS is Google’s new powerhouse text-to-speech model

Google app for desktop rolls out globally on Windows

Google debuts Gemini app for Mac with instant shortcut access

Google Chrome’s new Skills feature makes AI workflows one tap away

Anthropic’s revamped Claude Code desktop app is all about parallel coding workflows

Also Read
Claude design system interface showing an interactive 3D globe visualization with customizable settings. The left side displays a dark-themed globe with North America in focus, overlaid with cyan-colored connecting arcs between major North American cities including Reykjavik, Vancouver, Seattle, Portland, San Francisco, Los Angeles, Toronto, Montreal, Chicago, New York, Nashville, Atlanta, Austin, New Orleans, and Miami. The top of the interface includes navigation tabs for 'Stories' and 'Explore', along with 'Tweaks' toggle (enabled), and action buttons for 'Comment' and 'Edit'. On the right side is a dark control panel with three sections: Theme (Dark mode selected, with Light option available), Breakpoint (Desktop selected, with Tablet and Mobile options), and Network settings including adjustable sliders for Arc color (bright cyan), Arc width (0.6), Arc glow (13), Arc density (100%), City size (1.0), and Pulse speed (3.4s), plus checkboxes for 'Show arcs', 'Show cities', and 'City labels'.

Anthropic Labs unveils Claude Design

OpenAI Codex app logo featuring a stylized terminal symbol inside a cloud icon on a blue and purple gradient background, with the word “Codex” displayed below.

Codex desktop app now handles nearly your whole stack

A graphic design featuring the text “GPT Rosalind” in bold black letters on a light green background. Behind the text are overlapping translucent green rectangles. In the bottom left corner, part of a chemical structure diagram is visible with labels such as “CH₃,” “CH₂,” “H,” “N,” and the Roman numeral “II.” The right side of the background shows a blurred turquoise and green abstract pattern, evoking a scientific or natural theme.

OpenAI launches GPT-Rosalind to accelerate biopharma research

Perplexity interface showing a model selection menu with options for advanced AI models. The default choice, “Claude Opus 4.7 Thinking,” is highlighted as a powerful model for complex tasks. Other options include “GPT-5.4 New” for complex tasks and “Claude Sonnet 4.6” for everyday tasks using fewer credits. A toggle for “Thinking” is switched on, and a tooltip on the right reads “Computer powered by Claude 4.7 Opus.”

Perplexity Max users now get Claude Opus 4.7 in Computer by default

Anthropic brand illustration divided into two halves: On the left, an orange-coral background displays a stylized network or molecule diagram with white circular nodes connected by white lines, enclosed within a black wavy border outline representing a head or mind. On the right, a light teal background features an abstract line drawing of a figure or person with curved black lines and black dots, sketched over a white grid on transparent checkered background, suggesting data points and analytical thinking. The composition symbolizes the intersection of artificial intelligence and human cognition.

Claude Opus 4.7 is Anthropic’s new powerhouse for serious software work

Illustration of Claude Code routines concept: An orange-coral background with a stylized design featuring two black curly braces (code brackets) flanking a white speech bubble containing a handwritten lowercase 'u' symbol. The image represents code execution and automated routines within Claude Code.

Anthropic gives Claude Code cloud routines that work while you sleep

Gemini interface showing a NEET Mock Exam Practice Session. On the left side, a chat message from the user says 'I want to take a NEET mock exam.' Below it is Gemini's response explaining a complete NEET mock exam designed to test concepts in Physics, Chemistry, and Biology, with a 'Show thinking' option expanded. The response includes an embedded card for 'NEET UG Practice Test' dated Apr 11, 7:10 PM, with options to 'Try again without interactive quiz' and encouragement message. On the right side is a panel titled 'NEET UG Practice Test' displaying three subject sections: Physics (45 Questions with a yellow icon and blue Start button), Chemistry (45 Questions with a purple icon and blue Start button), and Biology (90 Questions with a green icon). Each section includes a brief description of question topics covered.

Google Gemini now lets you take full NEET mock exams for free

AI Mode in Chrome showing AI-powered shopping assistant panel alongside a Ninja coffee machine product page with pricing and details

Chrome’s AI Mode puts search and pages side by side

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.