Models
April 16, 2026

Gemini 3.1 Flash TTS brings expressive AI speech to developers

Google introduces Gemini 3.1 Flash TTS, enabling highly controllable, expressive AI speech with audio tags, multilingual support, and enterprise-ready deployment via Google AI Studio and Vertex AI.

Google Cloud has launched Gemini 3.1 Flash TTS, a next-generation text-to-speech model designed for high-quality, expressive, and controllable AI audio. The model supports over 70 languages and offers more than 200 audio tags, allowing developers to fine-tune tone, pacing, and emotion using natural language prompts.

It also includes 30+ prebuilt voices and enables detailed customization of accents and speaking styles.

Available through Google AI Studio and Vertex AI, the model is built for scalable enterprise use cases such as accessibility tools, audiobooks, gaming, and customer interactions. Additionally, SynthID watermarking helps identify AI-generated audio, improving transparency and trust.

#
Google

Read Our Content

See All Blogs
AI system implementation

Rogue Agent Impact Visualizer

Sarankumar S

May 28, 2026
Read more
AI system implementation

Reinforcement learning for LLMs: SDAR's for multi-turn agent training

Deveshi Dabbawala

May 21, 2026
Read more