Streaming Gemini 3.1's expressive new TTS model in Java
Google just released Gemini 3.1 Flash Text-to-Speech (TTS), a new expressive TTS model that you can steer with audio tags and scene descriptions.
I wanted to see how it worked with the Gemini Interactions SDK for Java.
Expressive control
The model sounds natural out of the box, but the real benefit is the control you have over expressiveness. By defining “Audio Profiles”, “Scene Details”, and “Director’s Notes” in your prompt, you can control the character’s pacing, tone, and environment.
Read more...