Gemini TTS
Google's Gemini TTS converts text to realistic audio. 30 voice presets, multi-speaker synthesis (up to 10 speakers), 24+ languages, and inline style markers for expressive control.
By Google
Pricing: $10 per 1m-tokens
Overview
Google's Gemini TTS converts text to realistic audio. 30 voice presets, multi-speaker synthesis (up to 10 speakers), 24+ languages, and inline style markers for expressive control.
Frequently asked questions
- What is Gemini TTS?
- Gemini TTS by Google, available via Runflow's unified API.
- How much does Gemini TTS cost on Runflow?
- Contact us for current pricing. Runflow offers predictable, usage-based pricing with no GPU management fees.
- Do I need to manage GPUs?
- No. Runflow handles all infrastructure, including GPU provisioning, scaling, and failover.
Related models
- ElevenLabs TTS v3, Generate text-to-speech audio using Eleven-v3 from ElevenLabs.
Discoverable surfaces
- Dispatch endpoint:
POST https://api.runflow.io/v1/models/gemini-tts/runs - Per-model spec (markdown): https://app.runflow.io/models/gemini-tts/llms.txt
- Docs page: https://docs.runflow.io/models/gemini-tts
- Public OpenAPI spec: https://docs.runflow.io/api/openapi.public.json
- Agent skill (start here): https://www.runflow.io/.well-known/agent-skills/runflow/SKILL.md