en
每月不到10元,就可以无限制地访问最好的AIbase。立即成为会员
Home
News
Daily Brief
Income Guide
Tutorial
Tools Directory
Product Library
en
Search AI Products and News
Explore worldwide AI information, discover new AI opportunities
AI News
AI Tools
AI Cases
AI Tutorial
Type :
AI News
AI Tools
AI Cases
AI Tutorial
2024-07-26 09:26:21
.
AIbase
.
10.6k
Wuhan University Collaborates with China Mobile and Jiutian AI Team to Release Open-source Audio-Video Speaker Recognition Dataset VoxBlink2
Wuhan University, in collaboration with China Mobile's Jiutian AI team and Duke Kunshan University, has released the open-source audio-video speaker recognition dataset VoxBlink2, which is based on YouTube data and contains over 110,000 hours of audio-video recordings. The dataset includes 9,904,382 high-quality audio clips and their corresponding video segments, sourced from 111,284 users on YouTube, making it the largest publicly available audio-video speaker recognition dataset to date. The release of this dataset aims to enrich open-source speech corpora and support the training of voiceprint large models.