ElevenLabs Scribe

Scribe is the world's most accurate speech-to-text model, supporting 99 languages.

EditorRecommendationProductivitySpeech RecognitionMultilingual
Scribe is a high-accuracy speech-to-text model developed by ElevenLabs, designed to handle the unpredictability of real-world audio. It supports 99 languages and provides features such as word-level timestamps, speaker diarization, and audio event labeling. Scribe demonstrates superior performance on the FLEURS and Common Voice benchmarks, surpassing leading models like Gemini 2.0 Flash, Whisper Large V3, and Deepgram Nova-3. It significantly reduces error rates for traditionally underserved languages (such as Serbian, Cantonese, and Malayalam), where error rates often exceed 40% in competing models. Scribe offers an API for developer integration and will launch a low-latency version to support real-time applications.
Visit

ElevenLabs Scribe Visit Over Time

Monthly Visits

16796078

Bounce Rate

38.63%

Page per Visit

5.2

Visit Duration

00:05:41

ElevenLabs Scribe Visit Trend

ElevenLabs Scribe Visit Geography

ElevenLabs Scribe Traffic Sources

ElevenLabs Scribe Alternatives