AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

ElevenLabs Releases Scribe Speech-to-Text Model, Achieving Record Accuracy with 96.7% in English

AIbase基地

Published inAI News · 5 min read · Feb 27, 2025

ElevenLabs, a prominent AI voice cloning and generation startup, recently launched its latest speech-to-text model – Scribe v1. This model claims to achieve the highest accuracy across multiple languages, and users can experience it via the company's website.

According to ElevenLabs' benchmarks, Scribe surpasses Google's Gemini 2.0 Flash, OpenAI's Whisper v3, and Deepgram Nova-3 in accurately converting spoken language to text, achieving an unprecedentedly low error rate. The company states that Scribe supports high-precision transcription in 99 languages, including previously underserved languages like Serbian, Cantonese, and Malayalam.

ElevenLabs' Chief Research Scientist, Flavio Schneider, announced on X (formerly Twitter) that Scribe is the company's "most intelligent audio understanding model" to date. He emphasized that Scribe is more than just a transcription tool; it understands audio content, detecting non-speech events (like laughter, sound effects, music, and background noise), and accurately distinguishing speakers in long audio content within complex environments. Notably, Scribe can identify and isolate up to 32 different speakers within a single audio file.

ElevenLabs advises that Scribe is "best suited for scenarios requiring high-accuracy transcription, rather than real-time transcription." The company also plans to release a low-latency version to expand its use in real-time applications.

Based on benchmarks from FLEURS and Common Voice, Scribe excels at handling real-world audio challenges, achieving the lowest word error rates, particularly in Italian (98.7% accuracy) and English (96.7% accuracy).

Scribe is now available via the ElevenLabs website and API, priced at $0.40 per hour of input audio, with a 50% discount for the next six weeks. A low-latency version for real-time applications is also under development.

For enterprise decision-makers, Scribe provides a scalable tool for high-accuracy transcription, suitable for industries needing automated documentation, meeting transcription, and content accessibility. Its high-precision handling of multiple languages will also benefit multinational corporations, media companies, and customer support applications.

It's noteworthy that Scribe's release coincided with the launch of Hume's text-to-speech model, Octave. Octave, a large language model-based text-to-speech tool, allows users to customize AI-generated voices based on emotional needs, intended for content creation such as audiobooks, podcasts, and video game voiceovers. While Scribe and Octave have different functionalities, their simultaneous release reflects the increasingly fierce competition in AI-driven audio models.

Product Link: https://elevenlabs.io/blog/meet-scribe

Key Highlights:
🌟 Scribe v1 is ElevenLabs' latest speech-to-text model, achieving record-high accuracy across multiple languages.
🗣️ Supports 99 languages, can distinguish up to 32 different speakers, and adapts to complex audio environments.
💰 Currently priced at $0.40 per hour, with a 50% discount for the next six weeks; a low-latency version is under development.

ElevenLabs Scribe v1 Speech-to-text Model AI Speech Recognition

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

London AI Creative Studio Wonder Secures $3 Million in Funding with Participation from ElevenLabs and OpenAI Leadership

Wonder, a London-based AI creative studio, has announced a $3 million funding round. The investment round included participation from key figures at ElevenLabs and OpenAI.

Apr 14, 2025

1.0k

ElevenLabs Launches MCP Server: Seamless AI Voice Integration for Smart Assistants

Apr 8, 2025

1.5k

ElevenLabs Launches World's First AI Text-to-Bark Model

ElevenLabs, a pioneer in AI audio technology, recently announced the launch of Text To Bark, the world's first AI text-to-speech model designed specifically for dogs. This innovative technology has garnered significant attention from the tech industry and pet lovers alike. It purportedly converts human-input text into highly realistic dog barks, with a claimed accuracy so high that 95% of dogs can't distinguish them from real canine vocalizations. This is considered a bold attempt to facilitate communication between humans and their pets.

Apr 2, 2025

460

OpenAI Unveils New Speech-to-Text Model: gpt-4o-transcribe, Boasting Significantly Improved Accuracy

Following previous attention in the speech AI field, OpenAI continues its exploration, releasing three new self-developed speech models: gpt-4o-transcribe, gpt-4o-mini-transcribe, and gpt-4o-mini-tts. The most notable is gpt-4o-transcribe. These new models are currently...

Mar 21, 2025

1.6k

ElevenLabs Opens AI Audiobook Publishing Program to All Authors, Challenging Audible

According to TechCrunch, voice AI company ElevenLabs now allows authors to publish AI-generated audiobooks on its reader app, following a previous partnership with Spotify for AI-narrated audiobooks. ElevenLabs, which raised a massive $190 million last month, began inviting authors to pilot the publishing program last year and has now opened it to all authors. The company aims to provide affordable and easy-to-use audiobook creation tools to reduce production costs and compete with Audible.

Feb 26, 2025

810

Spotify Partners with ElevenLabs to Launch AI Narrated Audiobook Option

On Thursday, Spotify announced its partnership with ElevenLabs to officially introduce audiobooks narrated using the company's AI voice technology. As one of the most recognized AI audio providers, this new collaboration is expected to significantly increase the number of AI-narrated audiobooks on the platform. According to the new process, authors need to first download audio file packages from ElevenLabs, then access Spotify's audiobook distribution service, Findaway Voices.

Feb 21, 2025

1.7k

AI Voice Unicorn ElevenLabs Completes $250 Million Series C Funding, Valuation Surpasses $3 Billion

Artificial intelligence voice company ElevenLabs has once again captured attention, having just completed a Series C funding round totaling $250 million, with a valuation between $3 billion and $3.3 billion. This round was led by ICONIQ Growth, demonstrating strong market confidence in AI voice technology. Just a year ago, ElevenLabs completed an $80 million Series B funding, when its valuation was only a third of what it is now, highlighting the company's rapid growth. ElevenLabs was founded by Mat

Jan 25, 2025

2.1k

Fast! ElevenLabs Launches Flash Voice Conversation Model: Only 75 Milliseconds Delay Supporting 32 Voices

Dec 20, 2024

3.1k

ElevenLabs Launches New Conversational AI Platform to Empower Rapid Development of Intelligent Voice Agents

Dec 4, 2024

2.6k

Supernatural AI Voice! ElevenLabs Launches GenFM Feature to Compete with NotebookLM

Voice AI startup ElevenLabs has launched the GenFM feature, which is integrated into its mobile app ElevenLabs Reader. This feature allows the creation of multi-character podcast audio content from various content types such as PDFs, articles, and eBooks. GenFM supports 32 languages, reaching a global audience, and can automatically select suitable voices to generate podcast content. The ElevenReader app is available for both iOS and Android platforms.

Nov 29, 2024

1.3k