SenseVoice
Multilingual speech understanding model providing high-precision speech recognition and sentiment analysis.
CommonProductOthersSpeech RecognitionSentiment Analysis
SenseVoice is a speech foundation model with multiple speech understanding capabilities, including Automatic Speech Recognition (ASR), Language Identification (LID), Speech Emotion Recognition (SER), and Audio Event Detection (AED). It focuses on high-precision multilingual speech recognition, speech emotion recognition, and audio event detection, supporting over 50 languages and exceeding the recognition performance of the Whisper model. The model uses an autoregressive end-to-end framework, resulting in extremely low inference latency, making it an ideal choice for real-time speech processing.
SenseVoice Visit Over Time
Monthly Visits
515580771
Bounce Rate
37.20%
Page per Visit
5.8
Visit Duration
00:06:42