GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIAppleTech

Apple wants its AI to help the blind navigate cities with just audio

SceneScout, Apple’s latest AI research project, aims to give visually impaired users detailed street-level information before they travel.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Jul 8, 2025, 10:01 AM EDT
Share
Apple logo
Image: Sutterstock
SHARE

Imagine standing at a busy Parisian intersection for the first time, the Eiffel Tower looming somewhere beyond the rooftops, but you can’t see any of it. For blind and low‑vision (BLV) travelers, that uncertainty about what’s around the corner can be a real barrier to independent exploration. Apple’s latest research, however, aims to change that. In a new paper from Apple Machine Learning Research, engineers detail SceneScout, a multimodal AI agent powered by large language models that can “look” at Street View–style imagery and narrate what it sees—before you ever step outside.

Current pre‑travel tools for BLV users—think turn‑by‑turn navigation or simple landmark callouts—offer route instructions but little in the way of landscape context. You might know you’ll pass a coffee shop or a bus stop, but what about the row of shady trees at that last turn, or the wide open plaza before the museum steps? These visual cues, so obvious to sighted folks browsing Google Street View or Apple Maps’ Look Around, remain hidden if you can’t see them. Researchers Leah Findlater and Cole Gleason from Apple, along with Columbia University’s Gaurav Jain, argue that richer landscape descriptions could boost confidence and safety for BLV travelers.

At its core, SceneScout is an AI “agent”: it ingests a series of panoramic images (via Apple Maps APIs), reasons over them with an LLM (OpenAI’s GPT‑4o in Apple’s tests), and then generates natural‑language narrations. The interface is web‑based and fully compliant with W3C accessibility guidelines, ensuring that screen readers like VoiceOver can relay everything smoothly. SceneScout offers two distinct interaction modes:

  1. Route Preview: You supply a start and end point, and SceneScout steps through block after block, describing each segment in sequence. It might tell you: “At the corner of 5th and Main, you’ll see a row of mature oak trees lining the sidewalk, followed by a tactile crosswalk with raised bumps.” These commentary snippets help you build a mental map of tactile and visual landmarks before you set out.
  2. Virtual Exploration: More like free‑roaming Street View, this mode lets you “move” through the imagery however you please. As you navigate, SceneScout reports on whatever enters its frame—shop awnings, lamppost styles, or even subtle curb cuts for wheelchair access. It’s a choose‑your‑own‑adventure for mapping the unseen.

In a user study with ten BLV participants (the paper’s N=10), SceneScout uncovered environmental details that existing apps simply don’t surface. The majority of its descriptions were deemed accurate 72% of the time, and a whopping 95% of the stable elements (think permanent fixtures like buildings or trees) remained correct even when Street View data was a bit out of date.

That said, the system isn’t perfect. Some descriptions included “subtle and plausible errors”—a sign misread as a café logo, or a construction zone described as a bike rack—missteps that BLV users can’t verify without sight. These errors, while infrequent, underscore the need for cautious design and user trust modeling.

Participants had plenty of ideas for enhancing SceneScout:

  • Personalized narration: Over multiple sessions, SceneScout could learn what you care about—perhaps you prefer architectural details over street art, or tactile information on sidewalk textures—and tailor its commentary accordingly.
  • Pedestrian viewpoint: Shifting from the “car‑mounted camera” angle of Street View to a ground‑level perspective would align descriptions with what you actually feel underfoot.
  • Real‑time in situ mode: Rather than previewing routes in advance, imagine wearing bone‑conduction headphones or using a “transparency” overlay on Apple Glass that narrates your surroundings live, synced via gyroscope and compass rather than forcing you to line up a camera exactly right.

These suggestions point toward a hybrid future: part pre‑travel planning, part live guidance—each amplifying the other.

While SceneScout today relies on pre‑captured panorama data, it hints at broader possibilities for Apple’s upcoming hardware. Rumors swirl around AirPods equipped with forward‑facing cameras and Apple Glass smart glasses, both potentially streaming live video feeds into Apple’s “Intelligence” ecosystem. In that scenario, SceneScout–style narration could happen in real time, with freshly captured frames instead of months‑old Street View images.

Picture strolling down an unfamiliar block: your AirPods whispering, “The crosswalk ahead has raised tactile strips; a café patio with metal chairs is on your right,” all without glancing at your phone. That’s the kind of assisted autonomy Apple seems to be exploring.

As with any Apple Machine Learning Research paper, SceneScout shows what could be possible, not necessarily what’s coming to your next iOS update. There’s no guarantee this exact feature will ship, but the research illuminates Apple’s thinking around accessibility, computer vision, and LLM‑driven agents. For BLV travelers craving more context about the world around them—whether pre‑trip or on the move—the future looks a little more navigable.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:Apple Intelligence
Most Popular

How to stream all five seasons of The Boys right now

Claude for Microsoft 365 is now generally available

Anthropic launches full Claude Platform on AWS with native integration

OpenAI upgrades its Realtime API with three new voice AI models

AI-powered Google Finance launches across Europe now

Also Read
Apple App Store logo

Apple rebalances South Korea App Store pricing to keep global tiers in line

Close-up mockup of an iPhone displaying an RCS text conversation in the Messages app. The chat is with a contact named “Grace,” shown with a profile photo at the top. Below the contact name, the interface displays “Text Message • RCS” and “Encrypted,” indicating secure RCS messaging support. A green message bubble asks, “How are you doing?” and the reply says, “I’m good thanks. Just got back from a camping trip in Yosemite!” The screen uses Apple’s clean light-mode Messages interface with the Dynamic Island visible at the top.

iOS 26.5 update adds secure RCS messaging for iPhone users

Modern kitchen interior featuring a Samsung Bespoke AI Refrigerator Family Hub in a soft green-themed space. The large white refrigerator has a built-in display panel on the upper door showing abstract artwork. Surrounding the refrigerator are matching pastel green cabinets, a kitchen island with open shelving, and a dark countertop with a gold-tone faucet. Natural light enters through a large window beside the minimalist kitchen setup, highlighting the clean and modern design.

Gemini AI comes to Samsung’s Bespoke AI refrigerator Family Hub screen

Screenshot of the Windows 11 touchpad “Scroll & zoom” settings page in dark mode. The panel shows multiple enabled touchpad options with blue checkmarks, including “Drag two fingers to scroll,” “Automatic scrolling at edge,” “Automatic scrolling with pressure,” “Accelerated scrolling,” and “Pinch to zoom.” A “Single-finger scrolling” option is set to “Right Side.” The interface also includes sliders for “Scroll speed” and “Zoom speed,” along with a dropdown menu for “Scrolling direction” set to “Down motion scrolls up.”

Windows 11 adds custom scroll sliders to Settings

Illustration comparing Gmail writing suggestions before and after personalization. On the left, under the heading “Today,” a generic email draft to “Alex Liu” uses formal, template-style language with placeholder text. On the right, under “With personalization,” the same draft is rewritten in a more natural and conversational tone with specific influencer campaign details, highlighted text snippets, and a personalized sign-off. Along the right side are three colored labels reading “Personalized tone and style,” “Based on past emails,” and “Based on Drive files,” emphasizing how Gmail uses user context to improve writing suggestions.

Help me write in Gmail gets smarter with personalization

Three smartphone mockups displaying a ChatGPT trusted contact safety feature. The first screen explains how adding a trusted contact can help someone receive support during serious mental health or safety concerns. The second screen shows a form for inviting a trusted contact with fields for name, phone, email, and consent confirmation. The third screen confirms that the invitation was sent and offers an option to send a personal note.

OpenAI adds an emergency-style Trusted Contact option inside ChatGPT settings

Futuristic digital artwork showing a glowing computer face icon inside a translucent glass-like sphere resting on a soft grassy surface. Floating reflective droplets surround the sphere against a dark black background, creating a surreal and minimalist sci-fi atmosphere.

The new Perplexity Mac app ships with Personal Computer

Icon of Apple App Store mobile application on iPhone.

Apple now allows gambling apps on Brazil App Store with license requirements

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.