GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIAndroidGoogleGoogle PixelMobile

Google’s Gemini Live AI can now highlight objects on your screen in real time

New Gemini Live update rolls out to Pixel 10 first, expanding to Android and iOS.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Aug 22, 2025, 11:24 AM EDT
Share
Smartphone displaying live video of a hand holding a yellow flower; 'Gemini Live' below.
Image: Google
SHARE

Google’s Gemini Live — the real-time, talking version of Gemini that you can have conversations with while sharing pictures, video, or your screen — is getting a practical, slightly sci-fi upgrade: it will now point to things for you. Not with words alone, but by literally highlighting items on your phone’s display while you share your camera or screen, so the assistant can show as well as tell.

Think of it as a GPS arrow for the physical world. If you hold your phone up to a messy toolbox and ask which screwdriver fits a particular screw, Gemini Live can now draw attention to the correct tool on your live camera feed. If you’re comparing two coats you’re holding up to the camera, it can mark the one that best matches your “warm, water-resistant” criteria. Google says the new “visual guidance” is meant to help faster and less ambiguous — especially for tasks that live in a mix of the digital and physical world.

The visual-guidance feature will debut on Google’s new Pixel 10 phones at launch (Google’s hardware event confirmed the Pixel 10 release for August 28, 2025), and Google says it will begin rolling the capability out to other Android devices “at the same time,” with iPhone users getting access in the “coming weeks.” That staged rollout is typical for features that tie closely to device sensors and on-device AI.

The changes aren’t only visual. Google is expanding Gemini Live’s ability to interact with other apps on your phone. The assistant will be able to take actions like drafting texts in Messages, placing calls via Phone, or scheduling things in Clock and Calendar during an ongoing live conversation. Importantly, Gemini Live keeps the conversation “live” in the literal sense: you can interrupt it with a real request — for example, while it’s describing a route, you can say, “This route looks good. Now, send a message to Alex that I’m running about 10 minutes late,” and Gemini will draft or send that message for you. That smoother handoff between conversation and action is the kind of UX Google hopes will make conversational AI feel less like a Q&A bot and more like a helpful companion.

Google is also rolling out an updated audio model for Gemini Live that it says better captures the “key elements of human speech” — intonation, rhythm, pitch — so the assistant’s voice sounds more natural and expressive. You’ll be able to tweak speaking speed, and the assistant may adopt different tones or even accents for storytelling or role-based narration. The goal is less robotic recitation and more expressive speech that matches the context of what you’re asking about.

This feels like a pragmatic move. People already use their phones to get help with hands-on tasks — fixing things, cooking, shopping, or comparing objects. Adding a simple visual overlay turns an otherwise messy descriptive exchange (“Which one is the bigger bolt?”) into a single, glanceable interaction. For field work, remote troubleshooting, or accessibility scenarios where a sighted helper needs to point something out for someone else, the feature could be genuinely useful.

As useful as it sounds, visual guidance raises the expected set of concerns. First, it requires you to actively share your camera or screen during a live conversation, which is an important privacy switch — you must opt in. Second, computer vision isn’t perfect: highlights could be wrong, distract you, or offer false confidence in safety-critical situations (imagine a misidentified wire in a DIY electrical job). Finally, device compatibility and latency will matter: a smooth experience depends on sensor quality, network or on-device processing, and how quickly Gemini can analyze the frame and draw an overlay. Google’s posts and demos emphasize opt-in controls and a staged rollout, but real-world testing will tell how well those promises pan out.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:Gemini AI (formerly Bard)
Most Popular

OpenAI expands GPT-Rosalind access with new Rosalind Biodefense program

Codex computer use comes to Windows, with mobile in the loop

Anthropic raises $65 billion, nears trillion-dollar status

Claude Opus 4.8 now powers Perplexity Max and Computer

Qualcomm’s new Snapdragon C is the budget laptop chip nobody knew they were waiting for

Also Read
Grocery, gardening, and household items from a Walmart delivery are arranged on a front doorstep outside a brick home. A blue Walmart shopping bag, a bag of Miracle-Gro potting mix, bread, and potted flowers sit on a welcome mat, surrounded by decorative planters and colorful blooming plants near a wooden front door.

Walmart’s 30-minute delivery is now live in 33 U.S. cities

Acer Aspire Go 15 (AG15-Q31P) powered by Qualcomm Snapdragon C chip

Acer Aspire Go 15 is the first laptop ever built on Qualcomm’s new Snapdragon C chip

Acer Swift Spin 14 AI (SFSP14-Q51T) laptop

Acer’s Swift Spin 14 AI is the convertible laptop that finally gets Snapdragon right

Split-panel graphic featuring a torn sheet of grid paper with black hand-drawn scribbles on a light blue background on the left, and a minimalist illustration of an open hand holding a connected node network symbol on a terracotta-orange background on the right, representing creativity, ideas, and collaborative intelligence.

Claude Opus 4.8 launches with sharper judgment and new controls

Minimal hand-drawn illustration of a hanging presentation screen displaying a coding symbol (“”), suspended above a stylized script-like “pm” mark on a solid terracotta-orange background, representing programming, development workflows, or coding education.

Claude Code now orchestrates its own dynamic workflows

Perplexity and Microsoft logos displayed side by side against a night sky with circular star trails above a dark mountain landscape, symbolizing a partnership or collaboration between the two companies.

Perplexity Computer now works natively in Microsoft’s core productivity apps

Minimal flat illustration of code review: an orange background with two large black curly braces framing the center, where a white octagonal icon containing a simple code symbol “” is examined by a black magnifying glass.

Anthropic’s security-guidance plugin makes Claude Code less reckless

Perplexity illustration. The image depicts a dark, abstract interior space with vertical columns and beams of light streaming through, creating a play of shadows and light. In the center, there is a white geometric Perplexity logo resembling a stylized star or snowflake. The light beams display a spectrum of colors, adding a surreal and intriguing atmosphere to the scene.

Perplexity open-sources its blazing-fast Unigram tokenizer

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.