GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIGoogleTech

Gemini app audio support allows uploads up to three hours

The Gemini app now lets users upload audio files up to three hours in length while free accounts are capped at ten minutes and five daily prompts.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Sep 8, 2025, 3:30 PM EDT
Share
Google Gemini AI. The image shows the word "Gemini" written in a modern, sans-serif font on a black background. The letters "G" and "e" are in a gradient blue color, while the letters "m," "i," "n," and "i" transition from a light blue to a light beige color. Above the second "i" in "Gemini," there is a stylized star or sparkle symbol, adding a celestial or futuristic touch to the design.
Image: Google
SHARE

Google spent much of the past year stuffing new powers into Gemini — its multimodal AI stack — and on Monday it added one of the most practical: you can now drop audio files into the Gemini app and ask the model to listen, transcribe, summarize, and pull out the bits that matter. It’s the kind of quality-of-life change that looks small on a keynote slide but can transform how people turn conversations into work.

What’s new

  • The Gemini app now accepts uploaded audio files across web and mobile.
  • Free users can upload up to 10 minutes of audio and get five prompts per day; paid tiers (AI Pro / AI Ultra) increase that allowance — Google lists uploads of up to three hours for paid subscribers. All users can attach up to 10 files per prompt, including ZIP archives.
  • At the same time, Google expanded Search (AI Mode) to five additional languages — Hindi, Indonesian, Japanese, Korean and Brazilian Portuguese — thanks to the integration of Gemini 2.5.
  • NotebookLM, Google’s document-driven research tool, got new report styles — study guides, blog-post formats, briefing docs, and even flashcards and quizzes — in over 80 languages. The feature will let you shape tone and structure and generate study/teaching materials from uploaded files.

Why this matters

If you’ve ever ended a meeting with a tangle of voice notes, a stack of lecture recordings, or interview files and promised yourself you’ll “go back and summarize,” this removes a friction point. Instead of exporting audio, firing up a separate transcription service, and pasting a transcript into a prompt, you can hand Gemini the file and ask it to do everything in one place: identify speakers, summarize action items, produce a short excerpt for social, or generate study flashcards. That’s useful for students, podcasters, reporters, researchers, and busy product teams.

The details (limits, formats, and the fine print)

Google’s official help pages and coverage of the rollout confirm the tiered limits: free accounts have modest caps (10 minutes total audio, five daily prompts), while AI Pro and AI Ultra subscribers get significantly more headroom — up to three hours of audio uploads and priority access to features. The app supports multiple audio file formats and accepts up to 10 files in a single prompt (including ZIPs), which makes batch uploads possible.

Android Central and other hands-on pieces add practical color: the app transcribes and highlights key moments, and surfaces speaker separation and actionable insights — the things people actually want from audio analysis. That’s also why Google’s Josh Woodward (VP of Google Labs and Gemini) framed audio support as the “#1 request.”

What Google shipped alongside audio

This isn’t an isolated tweak. The audio update landed in a larger bundle:

  • Search (AI Mode) — Now speaks five more languages (Hindi, Indonesian, Japanese, Korean, Brazilian Portuguese), widening access to Gemini-powered, conversational search in non-English markets. That’s a notable step for people who prefer to ask complex, follow-up questions in their native tongues.
  • NotebookLM — Think of it as a “research assistant that writes” — you can now ask NotebookLM to output finished artifacts (blog posts, study guides, briefings, quizzes, flashcards) from your uploads and set tone/structure. Google says the expanded report tools will appear in over 80 languages. That makes NotebookLM less of a note-taking toy and more of a content production/workflow tool.

A quick note on privacy and memory

This summer, Google also pushed Gemini toward more persistent memory — letting the assistant remember preferences and details from prior conversations by default in some versions — and introduced controls (opt-outs, temporary chats) so users could limit what’s stored. If you plan to upload sensitive interviews or private meetings, be sure to check your Gemini app’s Personal Context / Keep Activity settings before you hand over recordings. Google has positioned those controls as the trade-off for a more personal assistant; the defaults and opt-outs matter.

What this means for competing workflows

Tools that previously sat between audio and action — standalone transcribers, separate summarizers, or manual workflows — now compete with an integrated path inside Google’s ecosystem. That’s convenient, but it also nudges more content into Google’s services (where it can be used to improve models, depending on your settings). For organizations that need strict chain-of-custody or certified transcripts, dedicated providers will still be the safer bet; for fast summaries, ideation, and classroom use, this is a major timesaver.

Takeaway

This update is less about headline-grabbing new model benchmarks and more about smoothing a real workflow: audio → insight. By folding audio into the same Gemini canvas you use for text and images, Google is nudging the assistant toward being an honest-to-God everyday tool — not just a research demo. The trade-offs are familiar: convenience versus control. If you’re curious, try a short audio clip first (free users are intentionally capped at 10 minutes), poke the app’s privacy settings, and see whether Gemini’s summaries match what you need.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:Gemini AI (formerly Bard)
Most Popular

OpenAI expands GPT-Rosalind access with new Rosalind Biodefense program

Codex computer use comes to Windows, with mobile in the loop

Anthropic raises $65 billion, nears trillion-dollar status

Claude Opus 4.8 now powers Perplexity Max and Computer

Qualcomm’s new Snapdragon C is the budget laptop chip nobody knew they were waiting for

Also Read
Grocery, gardening, and household items from a Walmart delivery are arranged on a front doorstep outside a brick home. A blue Walmart shopping bag, a bag of Miracle-Gro potting mix, bread, and potted flowers sit on a welcome mat, surrounded by decorative planters and colorful blooming plants near a wooden front door.

Walmart’s 30-minute delivery is now live in 33 U.S. cities

Acer Aspire Go 15 (AG15-Q31P) powered by Qualcomm Snapdragon C chip

Acer Aspire Go 15 is the first laptop ever built on Qualcomm’s new Snapdragon C chip

Acer Swift Spin 14 AI (SFSP14-Q51T) laptop

Acer’s Swift Spin 14 AI is the convertible laptop that finally gets Snapdragon right

Split-panel graphic featuring a torn sheet of grid paper with black hand-drawn scribbles on a light blue background on the left, and a minimalist illustration of an open hand holding a connected node network symbol on a terracotta-orange background on the right, representing creativity, ideas, and collaborative intelligence.

Claude Opus 4.8 launches with sharper judgment and new controls

Minimal hand-drawn illustration of a hanging presentation screen displaying a coding symbol (“”), suspended above a stylized script-like “pm” mark on a solid terracotta-orange background, representing programming, development workflows, or coding education.

Claude Code now orchestrates its own dynamic workflows

Perplexity and Microsoft logos displayed side by side against a night sky with circular star trails above a dark mountain landscape, symbolizing a partnership or collaboration between the two companies.

Perplexity Computer now works natively in Microsoft’s core productivity apps

Minimal flat illustration of code review: an orange background with two large black curly braces framing the center, where a white octagonal icon containing a simple code symbol “” is examined by a black magnifying glass.

Anthropic’s security-guidance plugin makes Claude Code less reckless

Perplexity illustration. The image depicts a dark, abstract interior space with vertical columns and beams of light streaming through, creating a play of shadows and light. In the center, there is a white geometric Perplexity logo resembling a stylized star or snowflake. The light beams display a spectrum of colors, adding a surreal and intriguing atmosphere to the scene.

Perplexity open-sources its blazing-fast Unigram tokenizer

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.