Models
April 16, 2026

Gemini 3.1 Flash TTS brings expressive AI speech to developers

Google introduces Gemini 3.1 Flash TTS, enabling highly controllable, expressive AI speech with audio tags, multilingual support, and enterprise-ready deployment via Google AI Studio and Vertex AI.

Google Cloud has launched Gemini 3.1 Flash TTS, a next-generation text-to-speech model designed for high-quality, expressive, and controllable AI audio. The model supports over 70 languages and offers more than 200 audio tags, allowing developers to fine-tune tone, pacing, and emotion using natural language prompts.

It also includes 30+ prebuilt voices and enables detailed customization of accents and speaking styles.

Available through Google AI Studio and Vertex AI, the model is built for scalable enterprise use cases such as accessibility tools, audiobooks, gaming, and customer interactions. Additionally, SynthID watermarking helps identify AI-generated audio, improving transparency and trust.

#
Google

Read Our Content

See All Blogs
LLM Models

Open Weight Models: The GoML Point of View

Rishabh Sood

April 21, 2026
Read more
Gen AI

How 700 million users are redefining AI adoption trends through ChatGPT

Deveshi Dabbawala

April 20, 2026
Read more