AI Voice & Text-to-Speech Models
Compare 11 text-to-speech and voice synthesis models across pricing, latency, and quality. Click any column header to sort.
Last updated: 21 Apr 2026 · Prices per 1M characters
11 models
| ModelModel | ProviderProvider | Cost / 1M chars$/1M ch | LatencyLatency | QualityQual. |
|---|---|---|---|---|
| ElevenLabs | $24.00 | 1s | 94 | |
| ElevenLabs | $18.00 | 500ms | 90 | |
| Cartesia | $15.00 | 300ms | 88 | |
| OpenAI | $12.00 | 1s | 86 | |
| ElevenLabs | $12.00 | 300ms | 85 | |
| OpenAI | $30.00 | 2s | 83 | |
| Resemble AI | $20.00 | 500ms | 83 | |
| Deepgram | $15.00 | 500ms | 82 | |
| PlayHT | $18.00 | 500ms | 82 | |
| Cartesia | $8.00 | 200ms | 80 | |
| OpenAI | $15.00 | 1s | 78 |
Cost per 1M characters (USD) (standard tier)Quality: composite score (0-100)