AudioGooglevflash· Released 2025-06-01
Gemini Flash TTS
Fast, affordable multi-speaker dialogue generation — cheapest TTS on Oakgen.
Gemini Flash TTS is Google's fast, affordable text-to-speech model optimized for multi-speaker dialogue. It supports up to 2 speakers and style presets at the lowest TTS pricing on Oakgen. A 1-minute narration costs about 3 credits (~$0.01).
Capabilities at a glance
- Cheapest TTS on Oakgen at ~$0.01 per minute
- Multi-speaker dialogue with up to 2 voices
- Style presets for different delivery types
- Fast generation (2–4 seconds)
- Gemini Pro variant for higher quality
Specs
- Starting price
- $0.01 / generation
- Generation time
- 2–4 seconds
- Max resolution
- 44.1 kHz stereo
- Inputs → outputs
- text → audio
How to use Gemini Flash TTS
- 1Use for podcast-style dialogueAssign two speakers to alternate lines — Gemini Flash TTS handles natural turn-taking and pacing between voices.
- 2Apply style presetsChoose from presets like 'news', 'casual', 'narrative' to set the overall delivery style.
API access
curl -X POST https://api.oakgen.ai/v1/generate/speech \
-H "Authorization: Bearer $OAKGEN_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gemini-flash-tts",
"text": "Welcome to our weekly tech roundup.",
"style": "news"
}'Compared to other models
vs. minimax-speech-hd — cost
Gemini Flash TTS is ~50% cheaper than MiniMax Speech HD. MiniMax wins on voice cloning and language support (29 languages vs Gemini's presets).
License & commercial use
Licensed through Google's commercial terms.
Permitted on all paid Oakgen plans.
FAQs
How much does Gemini Flash TTS cost on Oakgen?
Gemini Flash TTS starts at $0.01 per generation on Oakgen. Most generations complete in 2–4 seconds. The $19/month Pro plan includes 5,000 credits, covering roughly 1923 generations per month.
Can I use Gemini Flash TTS commercially?
Permitted on all paid Oakgen plans.
What is the maximum output resolution?
Gemini Flash TTS supports up to 44.1 kHz stereo.
Does Oakgen provide API access to Gemini Flash TTS?
Yes. Oakgen's REST API exposes Gemini Flash TTS under the model slug 'gemini-flash-tts'. See the API snippet below for an example request.