Seed-ASR

Speech recognition technology based on large language models.

PremiumNewProductOthersSpeech RecognitionLarge Language Models
Seed-ASR is a speech recognition model developed by ByteDance that leverages large language models (LLMs). By inputting continuous speech representations and contextual information into the LLM, it significantly enhances performance in comprehensive evaluation sets across multiple fields, accents/dialects, and languages, guided by extensive training and context-awareness capabilities. Compared to recently released large ASR models, Seed-ASR achieves a 10%-40% reduction in word error rate on public test sets in both Chinese and English, further demonstrating its strong performance.
Visit

Seed-ASR Visit Over Time

Monthly Visits

10660

Bounce Rate

60.50%

Page per Visit

1.4

Visit Duration

00:00:46

Seed-ASR Visit Trend

Seed-ASR Visit Geography

Seed-ASR Traffic Sources

Seed-ASR Alternatives