Google upgrades text-to-speech with native audio output

Onstage, Google announced new text-to-speech previews that allow developers to take advantage of “native audio output” for improved customization. Google says that native audio output, driven by its latest Gemini models, enables more expressive, natural speech — voices that capture subtle nuances and that can seamlessly switch to a whisper.

Native audio output works in over 24 languages and can change languages on the fly, according to Google. It’s available in the Gemini API starting today.

May 20, 2025 – May 20, 2025

From the Storyline: Google I/O 2025 live coverage: Google AI Ultra, Project Mariner, Gemini app updates, and more

Google I/O, Google’s biggest developer conference of the year, is here. I/O will showcase product announcements from across Google’s portfolio….

Latest in AI