Remember the days when a company-wide update meant scrolling through an agonizingly long email or sitting through a dry, static slide deck? Those days are rapidly fading in the rearview mirror, and Google is pressing hard on the accelerator.
Since its introduction, Google Vids has been quietly reshaping how teams communicate, turning everyday workers into amateur video producers without requiring a degree in film editing. Now, according to a recent update from the Google Workspace team, the platform is getting a massive injection of new capabilities that push its AI-generated avatars out of the uncanny valley and right into the director’s chair.
At the heart of this update is the integration of Gemini 3.1 Flash Text-To-Speech (TTS) and the much-anticipated Veo 3.1 video generation model. If you’ve toyed around with earlier iterations of AI avatars, you know they could sometimes feel a bit stiff—like animatronics doing a corporate read. This new underlying architecture changes the game, making the digital spokespeople significantly more realistic, conversational, and expressive.
Google is rolling out 53 default preset avatars, a massive jump from the previous 23. But what’s particularly interesting isn’t just the sheer number; it’s the stylistic variety. Alongside photorealistic digital humans, users can now choose from 3D cartoon and graphic novel styles. Whether you need a highly professional avatar named Eleanor to walk through Q3 financials, or a more stylized, animated character like Finley for a lighthearted onboarding video, the platform is giving creators the flexibility to match the medium to the message.
But a video is only as good as its reach, which brings us to one of the most practical upgrades in this rollout: language accessibility. Google has added support for 16 new languages, bringing the total up to 24. For multinational companies, this is a massive friction-reducer. You can generate a training video in English, and seamlessly create localized versions in Hindi, Telugu, Arabic, or Polish with the same expressive vocal delivery, all without hiring a global team of voice actors.
The most fascinating part of the update, however, is what Google is doing with custom avatars. Through a new integration with design tool Nano Banana Pro, users can now build highly personalized digital spokespeople and pair them with over 30 steering-controlled voices powered by Gemini Audio.
In the past, you were mostly stuck having your custom avatar sit or stand there, reading a script. Now, you’re basically acting as a director. By simply typing a text prompt, you can instruct your custom avatar to walk across the screen, use hand gestures, or interact with objects. You can even upload image references to drop them into customized locations or surround them with your company’s specific branding. The AI preserves your custom character’s exact appearance and voice across different generations, so your bespoke brand ambassador stays consistent from video to video.
Ultimately, this update underscores a massive shift in how we work. We are moving from an era where video production was a specialized, expensive endeavor siloed in marketing departments, to an era where rich, directed, multi-lingual video content can be generated by anyone with a keyboard and a Workspace account. As these tools become more fluid and less robotic, the standard for what we consider “acceptable” corporate communication is going to rise—and the standard email blast is going to look a whole lot more boring in comparison.
Discover more from GadgetBond
Subscribe to get the latest posts sent to your email.