AIbase
Product LibraryTool Navigation

Voice-synthesis

Public

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Creat2020-05-04T04:59:45
Update2025-03-25T13:09:34
169
Stars
0
Stars Increase

Related projects