SenseVoice

Multilingual speech understanding model providing high-precision speech recognition and sentiment analysis.

CommonProductOthersSpeech RecognitionSentiment Analysis
SenseVoice is a speech foundation model with multiple speech understanding capabilities, including Automatic Speech Recognition (ASR), Language Identification (LID), Speech Emotion Recognition (SER), and Audio Event Detection (AED). It focuses on high-precision multilingual speech recognition, speech emotion recognition, and audio event detection, supporting over 50 languages and exceeding the recognition performance of the Whisper model. The model uses an autoregressive end-to-end framework, resulting in extremely low inference latency, making it an ideal choice for real-time speech processing.
Visit

SenseVoice Visit Over Time

Monthly Visits

503747431

Bounce Rate

37.31%

Page per Visit

5.7

Visit Duration

00:06:44

SenseVoice Visit Trend

SenseVoice Visit Geography

SenseVoice Traffic Sources

SenseVoice Alternatives