AI Speech-to-Text Models
Compare 7 speech recognition models across pricing, accuracy, processing speed, and maximum audio duration. Click any column header to sort.
Last updated: 21 Apr 2026 · Prices per minute of audio
7 models
| ModelModel | ProviderProvider | Cost / Minute$/min | Max DurationMax Dur. | SpeedSpeed | QualityQual. |
|---|---|---|---|---|---|
| AssemblyAI | $0.0065 | 10.0hr | 2x RT | 93 | |
| OpenAI | $0.0060 | 2.0hr | 5x RT | 92 | |
| Deepgram | $0.0043 | 10.0hr | 3x RT | 91 | |
| $0.0030 | 9.5hr | 3x RT | 90 | ||
| OpenAI | $0.0060 | 2.0hr | 10x RT | 88 | |
| Deepgram | $0.0036 | 10.0hr | 3x RT | 87 | |
| AssemblyAI | $0.0020 | 10.0hr | 5x RT | 85 |
Cost per minute of audio (USD) (standard tier)Quality: composite score (0-100)