AudioGooglevflash· Released 2025-06-01

Gemini Flash TTS

Fast, affordable multi-speaker dialogue generation — cheapest TTS on Oakgen.

Gemini Flash TTS is Google's fast, affordable text-to-speech model optimized for multi-speaker dialogue. It supports up to 2 speakers and style presets at the lowest TTS pricing on Oakgen. A 1-minute narration costs about 3 credits (~$0.01).

Capabilities at a glance

  • Cheapest TTS on Oakgen at ~$0.01 per minute
  • Multi-speaker dialogue with up to 2 voices
  • Style presets for different delivery types
  • Fast generation (2–4 seconds)
  • Gemini Pro variant for higher quality

Specs

Starting price
$0.01 / generation
Generation time
2–4 seconds
Max resolution
44.1 kHz stereo
Inputs → outputs
textaudio

How to use Gemini Flash TTS

  1. 1
    Use for podcast-style dialogue
    Assign two speakers to alternate lines — Gemini Flash TTS handles natural turn-taking and pacing between voices.
  2. 2
    Apply style presets
    Choose from presets like 'news', 'casual', 'narrative' to set the overall delivery style.

API access

curl -X POST https://api.oakgen.ai/v1/generate/speech \
  -H "Authorization: Bearer $OAKGEN_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-flash-tts",
    "text": "Welcome to our weekly tech roundup.",
    "style": "news"
  }'

Compared to other models

vs. minimax-speech-hdcost

Gemini Flash TTS is ~50% cheaper than MiniMax Speech HD. MiniMax wins on voice cloning and language support (29 languages vs Gemini's presets).

License & commercial use

Licensed through Google's commercial terms.

Permitted on all paid Oakgen plans.

FAQs

How much does Gemini Flash TTS cost on Oakgen?
Gemini Flash TTS starts at $0.01 per generation on Oakgen. Most generations complete in 2–4 seconds. The $19/month Pro plan includes 5,000 credits, covering roughly 1923 generations per month.
Can I use Gemini Flash TTS commercially?
Permitted on all paid Oakgen plans.
What is the maximum output resolution?
Gemini Flash TTS supports up to 44.1 kHz stereo.
Does Oakgen provide API access to Gemini Flash TTS?
Yes. Oakgen's REST API exposes Gemini Flash TTS under the model slug 'gemini-flash-tts'. See the API snippet below for an example request.

Related models

Gemini Flash TTS on Oakgen — Affordable Multi-Speaker TTS | Oakgen.ai