seed-tts-eval

A testing dataset for evaluating a model's zero-shot speech generation capability

CommonProductOpenSourceSpeech SynthesisAutomatic Speech Recognition
seed-tts-eval is a testing dataset for evaluating a model's zero-shot speech generation capability. It provides an objective evaluation test set across diverse domains, containing samples extracted from both English and Mandarin public language repositories. This dataset is used to measure the model's performance across various objective metrics. It utilizes 1000 samples from the Common Voice dataset and 2000 samples from the DiDiSpeech-2 dataset.
Visit

seed-tts-eval Visit Over Time

Monthly Visits

488643166

Bounce Rate

37.28%

Page per Visit

5.7

Visit Duration

00:06:37

seed-tts-eval Visit Trend

seed-tts-eval Visit Geography

seed-tts-eval Traffic Sources

seed-tts-eval Alternatives