This section lists models for generating speech from text, along with their pricing structures.

Available Speech Generation Models

Model NamePrompt LengthCost per Media
Kokoro 82M4K tokens$0.00038 per token