AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

The open-source voice recognition model FireRedASR by Xiaohongshu boasts superior accuracy in Chinese recognition

AIbase基地

Published inAI News · 4 min read · Feb 12, 2025

379

In the field of speech recognition, the development of technology for recognizing Chinese has always attracted significant attention. Recently, the FireRed team from Xiaohongshu released a brand new open-source speech recognition model — FireRedASR. This large-model-based speech recognition system has achieved outstanding results on multiple standard test sets, marking a major breakthrough in Chinese speech recognition technology.

The core metric of FireRedASR is the Character Error Rate (CER); the lower the CER, the better the model's recognition performance. In recent public tests, FireRedASR achieved a CER of 3.05%, an 8.4% reduction compared to the previous best model, Seed-ASR. This result demonstrates the innovative capabilities of the FireRed team in speech recognition technology.

The FireRedASR model consists of two core structures: FireRedASR-LLM and FireRedASR-AED. The former focuses on achieving the highest accuracy in speech recognition, while the latter strikes a good balance between accuracy and inference efficiency. The team has provided models of various sizes and inference codes to meet the needs of different application scenarios.

In multiple everyday application scenarios, FireRedASR has also demonstrated strong performance. In a test set composed of various sources, including short videos, live broadcasts, and voice input, FireRedASR-LLM reduced the CER by 23.7% to 40% compared to leading service providers in the industry. Particularly in scenarios requiring lyric recognition, the model performed exceptionally well, achieving a relative reduction of 50.2% to 66.7% in CER.

Furthermore, FireRedASR has excelled in scenarios involving Chinese dialects and English, with its CER significantly outperforming previous open-source models on the KeSpeech and LibriSpeech test sets, proving its robustness and adaptability in various language environments.

The FireRed team hopes to promote the development and application of speech recognition technology through the open-sourcing of this new model, contributing to the future of voice interaction. All models and code have been made public on GitHub, encouraging more developers and researchers to participate.

huggingface: https://huggingface.co/FireRedTeam

github: https://github.com/FireRedTeam/FireRedASR

Highlights:
- 🎤 FireRedASR is the newly released open-source speech recognition model from the Xiaohongshu team, with excellent accuracy in recognizing Chinese.
- 🚀 The model is divided into FireRedASR-LLM and FireRedASR-AED, catering to accuracy and efficiency needs respectively.
- 🌍 FireRedASR performs excellently in various scenarios, suitable for Mandarin, Chinese dialects, and English among other language environments.

SpeechRecognition FireRed FireRedASR ErrorRate

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

SoundHound AI (SOUN) Receives Key AIOps Recognition, But Price Target Lowered

Apr 17, 2025

250

Amazon Unveils Nova Sonic, a Next-Generation AI Voice Model to Enhance Alexa Performance

Apr 9, 2025

330

OpenAI Dialogue Completion API Experiencing Errors - Urgent Fixes Underway

Mar 26, 2025

130

Yan Zhijie, Head of Alibaba Tongyi Lab's Speech Team, Departs

Mar 14, 2025

400

Doubao Large Model Release: 8 Key Moments of 2024 - From AI Rising Star to Complete Breakthrough

Dec 30, 2024

4.2k

Claude APP Adds Voice Feature: Anthropic Tests Up to 10 Minutes of Voice Transcription

Oct 30, 2024

1.7k

Hedra Launches New Voice Cloning Feature: AI Virtual Avatars Sound More Realistic

Oct 30, 2024

3.4k

Gladia Voice Recognition API Secures $16 Million in Series A Funding, Challenging Amazon, Microsoft, and Google

Oct 16, 2024

1.5k

Breakthrough Voice Recognition Technology: FunASR Launches Multi-Language Offline Transcription Tool

Oct 16, 2024

3.7k

Israeli Company Launches Open Source Speech Recognition Model Whisper Medusa with 50% Speed Increase

Israeli AI company aiOla has released an open source speech recognition model named Whisper Medusa, which is based on an improved architecture design that incorporates multi-head attention mechanisms, allowing it to process speech 50% faster than OpenAI's Whisper model. Whisper Medusa makes parallel predictions of ten tokens instead of the traditional one at a time, significantly enhancing speech recognition speed while maintaining performance. Its innovative training method employs weak supervision, freezing the backbone system and utilizing...

Aug 7, 2024

2.2k