By using this site, you agree to the Privacy Policy and Terms of Use.
Accept

GadgetBond

  • Latest
  • How-to
  • Tech
    • AI
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Add GadgetBond as a preferred source to see more of our stories on Google.
Font ResizerAa
GadgetBondGadgetBond
  • Latest
  • Tech
  • AI
  • Deals
  • How-to
  • Apps
  • Mobile
  • Gaming
  • Streaming
  • Transportation
Search
  • Latest
  • Deals
  • How-to
  • Tech
    • Amazon
    • Apple
    • CES
    • Computing
    • Creators
    • Google
    • Meta
    • Microsoft
    • Mobile
    • Samsung
    • Security
    • Xbox
  • AI
    • Anthropic
    • ChatGPT
    • ChatGPT Atlas
    • Gemini AI (formerly Bard)
    • Google DeepMind
    • Grok AI
    • Meta AI
    • Microsoft Copilot
    • OpenAI
    • Perplexity
    • xAI
  • Transportation
    • Audi
    • BMW
    • Cadillac
    • E-Bike
    • Ferrari
    • Ford
    • Honda Prelude
    • Lamborghini
    • McLaren W1
    • Mercedes
    • Porsche
    • Rivian
    • Tesla
  • Culture
    • Apple TV
    • Disney
    • Gaming
    • Hulu
    • Marvel
    • HBO Max
    • Netflix
    • Paramount
    • SHOWTIME
    • Star Wars
    • Streaming
Follow US
AIMicrosoftTech

Edge Copilot video summaries rely on subtitles, transcripts

Edge Copilot struggles to summarize raw YouTube videos without subtitles or transcripts. Its AI relies on text, not actual video comprehension.

By
Shubham Sawarkar
Shubham Sawarkar's avatar
ByShubham Sawarkar
Editor-in-Chief
I’m a tech enthusiast who loves exploring gadgets, trends, and innovations. With certifications in CISCO Routing & Switching and Windows Server Administration, I bring a sharp...
Follow:
- Editor-in-Chief
Dec 8, 2023, 9:25 PM EST
Share
We may get a commission from retail offers. Learn more
Microsoft Edge Copilot video summaries rely on subtitles, transcripts
Image: Microsoft
SHARE

When Microsoft unveiled its new AI helper called Edge Copilot for its Edge web browser this week, one feature that grabbed headlines was the ability to automatically generate text summaries of online videos. However, as impressive as this capability may sound at first, there are some key caveats to what Edge Copilot can accomplish.

As explained by Mikhail Parakhin, Microsoft’s CEO of advertising and web services, for Edge Copilot to summarize a video, that video first needs to either have subtitles or needs to have been “pre-processed” by Microsoft’s AI systems ahead of time. The assistant cannot watch and comprehend raw video in real-time the way a human can.

“In order for it to work, we need to pre-process the video. If the video has subtitles – we can always fallback on that, if it does not and we didn’t preprocess it yet – then it won’t work,” Parakhin wrote in response to questions about the feature.

In essence, rather than truly summarizing video content, what Edge Copilot does is summarize the text transcript of a video, whether that transcript was added manually via subtitles or auto-generated by Microsoft’s speech recognition software. So while the result to the user may appear like an AI-powered video summary, the underlying technique is more text-based than video-based in nature.

This nuance became apparent when designer Pietro Schirano posted a demonstration of Edge Copilot summarizing a YouTube video about the trailer for the forthcoming Grand Theft Auto VI video game. While Copilot quickly generated a coherent text summary, in this case, the video already included both machine-generated subtitles from YouTube as well as a user-created transcript. It was unclear whether Copilot could have achieved the same feat with a video lacking subtitles.

When asked whether Edge Copilot could summarize most publicly available YouTube videos without pre-processing, Parakhin’s response suggested that while it may work on many videos, performance would be unreliable compared to videos containing subtitles. “Should work for most videos,” he stated tentatively.

The subtleties around Edge Copilot’s video summarization capabilities underscore how AI systems that may seem intelligent on the surface can still have significant underlying constraints in terms of the data they require. It also highlights the machine learning arms race unfolding between Microsoft and leading rivals like Google. Just last month, Google announced enhancements to YouTube summarization in its own AI chatbot called Bard.

As for Edge Copilot, Parakhin readily admits the tool remains a work in progress, posting from an airplane this week that the team continues “adding ability for Edge Copilot to use information in videos.” So while Copilot’s video smarts face limitations today, Microsoft is invested in enhancing them over time. For now, though, viewers hoping to leverage AI for digesting video content may need to lower their expectations around what Copilot can realistically deliver absent manually added subtitles.


Discover more from GadgetBond

Subscribe to get the latest posts sent to your email.

Topic:Microsoft Edge browser
Leave a Comment

Leave a ReplyCancel reply

Most Popular

Gemini 3.1 Flash TTS is Google’s new powerhouse text-to-speech model

Google app for desktop rolls out globally on Windows

Google debuts Gemini app for Mac with instant shortcut access

Google Chrome’s new Skills feature makes AI workflows one tap away

Anthropic’s revamped Claude Code desktop app is all about parallel coding workflows

Also Read
OpenAI Codex app logo featuring a stylized terminal symbol inside a cloud icon on a blue and purple gradient background, with the word “Codex” displayed below.

Codex desktop app now handles nearly your whole stack

A graphic design featuring the text “GPT Rosalind” in bold black letters on a light green background. Behind the text are overlapping translucent green rectangles. In the bottom left corner, part of a chemical structure diagram is visible with labels such as “CH₃,” “CH₂,” “H,” “N,” and the Roman numeral “II.” The right side of the background shows a blurred turquoise and green abstract pattern, evoking a scientific or natural theme.

OpenAI launches GPT-Rosalind to accelerate biopharma research

Perplexity interface showing a model selection menu with options for advanced AI models. The default choice, “Claude Opus 4.7 Thinking,” is highlighted as a powerful model for complex tasks. Other options include “GPT-5.4 New” for complex tasks and “Claude Sonnet 4.6” for everyday tasks using fewer credits. A toggle for “Thinking” is switched on, and a tooltip on the right reads “Computer powered by Claude 4.7 Opus.”

Perplexity Max users now get Claude Opus 4.7 in Computer by default

Anthropic brand illustration divided into two halves: On the left, an orange-coral background displays a stylized network or molecule diagram with white circular nodes connected by white lines, enclosed within a black wavy border outline representing a head or mind. On the right, a light teal background features an abstract line drawing of a figure or person with curved black lines and black dots, sketched over a white grid on transparent checkered background, suggesting data points and analytical thinking. The composition symbolizes the intersection of artificial intelligence and human cognition.

Claude Opus 4.7 is Anthropic’s new powerhouse for serious software work

Illustration of Claude Code routines concept: An orange-coral background with a stylized design featuring two black curly braces (code brackets) flanking a white speech bubble containing a handwritten lowercase 'u' symbol. The image represents code execution and automated routines within Claude Code.

Anthropic gives Claude Code cloud routines that work while you sleep

Gemini interface showing a NEET Mock Exam Practice Session. On the left side, a chat message from the user says 'I want to take a NEET mock exam.' Below it is Gemini's response explaining a complete NEET mock exam designed to test concepts in Physics, Chemistry, and Biology, with a 'Show thinking' option expanded. The response includes an embedded card for 'NEET UG Practice Test' dated Apr 11, 7:10 PM, with options to 'Try again without interactive quiz' and encouragement message. On the right side is a panel titled 'NEET UG Practice Test' displaying three subject sections: Physics (45 Questions with a yellow icon and blue Start button), Chemistry (45 Questions with a purple icon and blue Start button), and Biology (90 Questions with a green icon). Each section includes a brief description of question topics covered.

Google Gemini now lets you take full NEET mock exams for free

AI Mode in Chrome showing AI-powered shopping assistant panel alongside a Ninja coffee machine product page with pricing and details

Chrome’s AI Mode puts search and pages side by side

Google Gemini AI

Google Gemini can now craft images from your personal photos

Company Info
  • Homepage
  • Support my work
  • Latest stories
  • Company updates
  • GDB Recommends
  • Daily newsletters
  • About us
  • Contact us
  • Write for us
  • Editorial guidelines
Legal
  • Privacy Policy
  • Cookies Policy
  • Terms & Conditions
  • DMCA
  • Disclaimer
  • Accessibility Policy
  • Security Policy
  • Do Not Sell or Share My Personal Information
Socials
Follow US

Disclosure: We love the products we feature and hope you’ll love them too. If you purchase through a link on our site, we may receive compensation at no additional cost to you. Read our ethics statement. Please note that pricing and availability are subject to change.

Copyright © 2026 GadgetBond. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | Do Not Sell/Share My Personal Information.