ElevenLabs Scribe
Scribe is the world's most accurate speech-to-text model, supporting 99 languages.
EditorRecommendationProductivitySpeech RecognitionMultilingual
Scribe is a high-accuracy speech-to-text model developed by ElevenLabs, designed to handle the unpredictability of real-world audio. It supports 99 languages and provides features such as word-level timestamps, speaker diarization, and audio event labeling. Scribe demonstrates superior performance on the FLEURS and Common Voice benchmarks, surpassing leading models like Gemini 2.0 Flash, Whisper Large V3, and Deepgram Nova-3. It significantly reduces error rates for traditionally underserved languages (such as Serbian, Cantonese, and Malayalam), where error rates often exceed 40% in competing models. Scribe offers an API for developer integration and will launch a low-latency version to support real-time applications.
ElevenLabs Scribe Visit Over Time
Monthly Visits
16796078
Bounce Rate
38.63%
Page per Visit
5.2
Visit Duration
00:05:41