By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIGoogleTech

Gemini app audio support allows uploads up to three hours

The Gemini app now lets users upload audio files up to three hours in length while free accounts are capped at ten minutes and five daily prompts.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Sep 8, 2025, 3:30 PM EDT
Share
Google Gemini AI. The image shows the word "Gemini" written in a modern, sans-serif font on a black background. The letters "G" and "e" are in a gradient blue color, while the letters "m," "i," "n," and "i" transition from a light blue to a light beige color. Above the second "i" in "Gemini," there is a stylized star or sparkle symbol, adding a celestial or futuristic touch to the design.
Image: Google
SHARE

Google spent much of the past year stuffing new powers into Gemini — its multimodal AI stack — and on Monday it added one of the most practical: you can now drop audio files into the Gemini app and ask the model to listen, transcribe, summarize, and pull out the bits that matter. It’s the kind of quality-of-life change that looks small on a keynote slide but can transform how people turn conversations into work.

What’s new

  • The Gemini app now accepts uploaded audio files across web and mobile.
  • Free users can upload up to 10 minutes of audio and get five prompts per day; paid tiers (AI Pro / AI Ultra) increase that allowance — Google lists uploads of up to three hours for paid subscribers. All users can attach up to 10 files per prompt, including ZIP archives.
  • At the same time, Google expanded Search (AI Mode) to five additional languages — Hindi, Indonesian, Japanese, Korean and Brazilian Portuguese — thanks to the integration of Gemini 2.5.
  • NotebookLM, Google’s document-driven research tool, got new report styles — study guides, blog-post formats, briefing docs, and even flashcards and quizzes — in over 80 languages. The feature will let you shape tone and structure and generate study/teaching materials from uploaded files.

Why this matters

If you’ve ever ended a meeting with a tangle of voice notes, a stack of lecture recordings, or interview files and promised yourself you’ll “go back and summarize,” this removes a friction point. Instead of exporting audio, firing up a separate transcription service, and pasting a transcript into a prompt, you can hand Gemini the file and ask it to do everything in one place: identify speakers, summarize action items, produce a short excerpt for social, or generate study flashcards. That’s useful for students, podcasters, reporters, researchers, and busy product teams.

The details (limits, formats, and the fine print)

Google’s official help pages and coverage of the rollout confirm the tiered limits: free accounts have modest caps (10 minutes total audio, five daily prompts), while AI Pro and AI Ultra subscribers get significantly more headroom — up to three hours of audio uploads and priority access to features. The app supports multiple audio file formats and accepts up to 10 files in a single prompt (including ZIPs), which makes batch uploads possible.

Android Central and other hands-on pieces add practical color: the app transcribes and highlights key moments, and surfaces speaker separation and actionable insights — the things people actually want from audio analysis. That’s also why Google’s Josh Woodward (VP of Google Labs and Gemini) framed audio support as the “#1 request.”

What Google shipped alongside audio

This isn’t an isolated tweak. The audio update landed in a larger bundle:

  • Search (AI Mode) — Now speaks five more languages (Hindi, Indonesian, Japanese, Korean, Brazilian Portuguese), widening access to Gemini-powered, conversational search in non-English markets. That’s a notable step for people who prefer to ask complex, follow-up questions in their native tongues.
  • NotebookLM — Think of it as a “research assistant that writes” — you can now ask NotebookLM to output finished artifacts (blog posts, study guides, briefings, quizzes, flashcards) from your uploads and set tone/structure. Google says the expanded report tools will appear in over 80 languages. That makes NotebookLM less of a note-taking toy and more of a content production/workflow tool.

A quick note on privacy and memory

This summer, Google also pushed Gemini toward more persistent memory — letting the assistant remember preferences and details from prior conversations by default in some versions — and introduced controls (opt-outs, temporary chats) so users could limit what’s stored. If you plan to upload sensitive interviews or private meetings, be sure to check your Gemini app’s Personal Context / Keep Activity settings before you hand over recordings. Google has positioned those controls as the trade-off for a more personal assistant; the defaults and opt-outs matter.

What this means for competing workflows

Tools that previously sat between audio and action — standalone transcribers, separate summarizers, or manual workflows — now compete with an integrated path inside Google’s ecosystem. That’s convenient, but it also nudges more content into Google’s services (where it can be used to improve models, depending on your settings). For organizations that need strict chain-of-custody or certified transcripts, dedicated providers will still be the safer bet; for fast summaries, ideation, and classroom use, this is a major timesaver.

Takeaway

This update is less about headline-grabbing new model benchmarks and more about smoothing a real workflow: audio → insight. By folding audio into the same Gemini canvas you use for text and images, Google is nudging the assistant toward being an honest-to-God everyday tool — not just a research demo. The trade-offs are familiar: convenience versus control. If you’re curious, try a short audio clip first (free users are intentionally capped at 10 minutes), poke the app’s privacy settings, and see whether Gemini’s summaries match what you need.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:Gemini AI (formerly Bard)
Most Popular

DJI’s FC200 and T200 drones push industrial delivery and agriculture into the 200kg era

DJI Osmo Mobile 8P debuts with detachable remote and smarter tracking

ChatGPT for Clinicians is now free for verified US doctors

GoPro Mission 1 series is powerful, pricey, and not for casual users

Opera GX Playground bundles panic button, Fake My History and Grass Touching Corner

Also Read
Tesla humanoid robot Optimus standing outdoors near a building entrance, raising one hand in a waving gesture. The robot has a sleek black-and-gold design with a reflective black face panel and “TESLA” branding on its chest. Part of a Tesla Cybercab vehicle is visible in the foreground, with trees, landscaping, and people walking in the background.

Elon Musk blames copycats for delayed Tesla Optimus reveal

Abstract 3D composition of colorful geometric shapes balanced on a horizontal red beam against a black background. The arrangement includes a blue half-sphere, a red half-bowl shape, an orange cube, a green rectangular block, a blue trapezoid, a yellow sphere, and a red triangular prism, creating a minimalist modern design.

Decoupled DiLoCo brings chaos-resilient AI pre-training to Google’s global fleet

Promotional poster for Apple TV series “Star City” featuring a close-up of a person’s face partially revealed through a torn paper-like red and white graphic on a dark background. The Apple TV logo appears above the bold white title “STAR CITY” on the right side, creating a dramatic sci-fi thriller visual style.

Apple TV shares Star City trailer previewing its next premium sci-fi drama after For All Mankind

Anthropic

Investors chase Anthropic as its secondary value tops $1 trillion

ChatGPT Workspace Agents Library

OpenAI’s new workspace agents let ChatGPT run end-to-end team processes

Claude Cowork logo and text on a light grey background, featuring a coral-colored starburst icon next to the product name in black serif font.

Anthropic adds interactive charts and diagrams to Claude Cowork

Screenshot of an AI chat interface showing the model selection dropdown menu open. “Kimi K2.6 Thinking” is selected at the top, with options including Best, Kimi K2.6 (marked New), Claude Sonnet 4.6, Claude Opus 4.7 (marked Max), and Nemotron 3 Super. A tooltip on the right says “Moonshot AI’s latest model,” highlighting Kimi K2.6.

Perplexity Pro and Max just got Kimi K2.6 support

Kimi K2.6 hero image

Kimi K2.6 is Moonshot’s new engine for autonomous coding and research

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.