The open speech-to-text / ASR models you can run yourself — Whisper, Distil-Whisper, NVIDIA Parakeet & Canary, Moonshine and more. Ranked by recognition, each with the honest VRAM-to-run, license, and runner. We list the recognized open transcription models with transparent costs (some run real-time on CPU); accuracy on your audio is yours to judge.
How this is ranked: Objective task filter, ordered by notability/recognition. We don't quote word-error-rate benchmarks we didn't run — we surface the recognized open ASR models with honest run-costs. 'Whisper alternative' is a real high-intent query; we present alternatives, the user judges accuracy for their language/audio.
More: all "best" lists · cost calculator · all models
Open the free Spanvero advisor → · Honest, $0-markup. © 2026 Cynosure LLC.