Seed-ASR
Speech recognition technology based on large language models.
PremiumNewProductOthersSpeech RecognitionLarge Language Models
Seed-ASR is a speech recognition model developed by ByteDance that leverages large language models (LLMs). By inputting continuous speech representations and contextual information into the LLM, it significantly enhances performance in comprehensive evaluation sets across multiple fields, accents/dialects, and languages, guided by extensive training and context-awareness capabilities. Compared to recently released large ASR models, Seed-ASR achieves a 10%-40% reduction in word error rate on public test sets in both Chinese and English, further demonstrating its strong performance.
Seed-ASR Visit Over Time
Monthly Visits
10660
Bounce Rate
60.50%
Page per Visit
1.4
Visit Duration
00:00:46