AI News

AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

seed-tts-eval

A testing dataset for evaluating a model's zero-shot speech generation capability

CommonProductOpenSourceSpeech SynthesisAutomatic Speech Recognition

seed-tts-eval is a testing dataset for evaluating a model's zero-shot speech generation capability. It provides an objective evaluation test set across diverse domains, containing samples extracted from both English and Mandarin public language repositories. This dataset is used to measure the model's performance across various objective metrics. It utilizes 1000 samples from the Common Voice dataset and 2000 samples from the DiDiSpeech-2 dataset.

seed-tts-eval

seed-tts-eval Visit Over Time

Monthly Visits

521149929

Bounce Rate

35.96%

Page per Visit

6.1

Visit Duration

00:06:29

seed-tts-eval Visit Trend

seed-tts-eval Visit Geography

seed-tts-eval Traffic Sources

seed-tts-eval Alternatives

seed-tts-eval — A testing dataset for evaluating a model's zero-shot speech generation capability

•Speech Synthesis•Automatic Speech Recognition

EaseVoice Trainer — A simple and easy-to-use speech cloning and speech model training tool.

•Speech Synthesis•Machine Learning

MegaTTS 3 — A highly efficient speech synthesis model that supports Chinese, English, and speech cloning.

•Speech Synthesis•Deep Learning

OpenAI.fm — Developers can interactively experience the new voice models gpt-4o-transcribe, gpt-4o-mini-transcribe, and gpt-4o-mini-tts in the OpenAI API.

•Speech Synthesis•Developer Tools

Orpheus TTS — An open-source text-to-speech system dedicated to achieving natural human speech.

•Text-to-Speech•Open Source

CSM 1B — CSM 1B is a text-to-speech generation model developed by Sesame, capable of generating high-quality audio.

•Speech Synthesis•Text-to-Speech

Sesame CSM — A model for generating conversational speech, supporting high-quality speech generation from text and audio input.

•Speech Synthesis•Artificial Intelligence

Sesame AI — Sesame AI is an advanced text-to-speech platform that generates natural conversational speech with emotional intelligence.

•Speech Synthesis•Artificial Intelligence

Spark-TTS — Spark-TTS is a highly efficient single-stream decoupled speech synthesis model based on large language models.

•Speech Synthesis•Large Language Model

Llasa — A TTS base model based on the Llama framework, compatible with 160,000 hours of tokenized speech data.

•Speech Synthesis•Artificial Intelligence

Octave TTS — Octave TTS is the first speech synthesis model capable of understanding the meaning of text, generating speech that is rich in emotion and style.

InternationalSelection

•Speech Synthesis•Artificial Intelligence

IndexTTS — An industrial-grade, controllable, and efficient zero-shot text-to-speech system

•Speech Synthesis•Artificial Intelligence

Xingsheng AI — Xingsheng AI is an AI podcast generator that can create AI podcasts from any content.

ChineseSelection

•Podcast•Content Creation

LLaSA_training — LLaSA: Extending training and inference computational requirements for LLaMA-based speech synthesis

•Speech Synthesis•Deep Learning

Llasa-1B — Llasa-1B is a text-to-speech (TTS) model based on the LLaMA architecture, supporting both Chinese and English speech synthesis.

•Text-to-Speech•Speech Synthesis

Llasa-3B — Llasa-3B is a text-to-speech synthesis model based on LLaMA that supports speech generation in both Chinese and English.

•Text-to-Speech•Speech Synthesis

Hailuo AI Audio — Hailuo AI Audio is an audio synthesis tool designed to create realistic speech.

•Speech synthesis•Audio production

kokoro-onnx — A text-to-speech (TTS) project based on Kokoro and ONNX runtime.

•TTS•Speech Synthesis

audiblez — A tool to convert eBooks into audiobooks.

•eBooks•audiobooks

Kokoro-82M — A cutting-edge text-to-speech (TTS) model with 82 million parameters.

•Text-to-Speech•Speech Synthesis

BetterWhisperX

BetterWhisperX — An automatic speech recognition tool providing word-level timestamps and speaker identification.

•Automatic Speech Recognition•Word-Level Timestamps

Voxdazz — AI Celebrity Voice Generator that transforms text into voice.

•Speech Synthesis•Celebrity Imitation

Gemini 2.0 Flash Experimental — A high-performance AI model developed by Google DeepMind

InternationalSelection

•Machine Learning•Natural Language Processing

Moonshine Web — Real-time browser-based speech recognition application

•Speech recognition•Automatic speech recognition

CosyVoice Speech Generation Model 2.0-0.5B — Efficient, multilingual speech synthesis model

•Speech Synthesis•Artificial Intelligence

GaussianSpeech — Audio-driven high-fidelity 3D head avatar synthesis technology

•3D Animation•Speech Synthesis

OuteTTS-0.2-500M

OuteTTS-0.2-500M — High-performance text-to-speech synthesis model

•Text-to-Speech•Speech Synthesis

whisper-ner-v1 — An advanced model for joint speech transcription and entity recognition.

•Speech Recognition•Entity Recognition

WhisperNER — Unified open-source named entity and speech recognition model

•Automatic Speech Recognition•Named Entity Recognition

OuteTTS — An experimental text-to-speech model.

•Text-to-Speech•Speech Synthesis