Gemini 3.1 Flash TTS – with directed prompts

(simonwillison.net)

17 points | by aanet 2 days ago ago

5 comments

  • roscas 2 days ago ago

    Hope they release an offline model for Ollama, a small one easy to work with for TTS in other languages.

  • Insensitivity 2 days ago ago

    No matter what I wrote in the audio profile, AI Studio never followed it, regardless of scene or context.

    For example, I tried to get a male voice and kept getting female ones. Not sure if it's an AI Studio bug or I was doing something wrong.

    • voxic11 2 days ago ago

      voice is determined by the voice parameter, you can't control it via the prompt, the prompt only directs how the chosen voices delivers the lines.

  • aanet 2 days ago ago

    The 3 examples, with three distinct styles, are fascinating.

    I'd like to see one with cockney accent, just for lulz