Emilia

Large-scale Multilingual Voice Generation Dataset

CommonProductOthersVoice DatasetMultilingual
Emilia is an open-source multilingual field voice dataset specifically designed for large-scale voice generation research. It includes over 10,100 hours of high-quality voice data in six languages with corresponding text transcriptions, covering a variety of speaking styles and content types such as stand-up comedy, interviews, debates, sports commentary, and audiobooks.
Visit

Emilia Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

Emilia Visit Trend

Emilia Visit Geography

Emilia Traffic Sources

Emilia Alternatives