Llasa-3B

Llasa-3B is a text-to-speech synthesis model based on LLaMA that supports speech generation in both Chinese and English.

CommonProductOthersText-to-SpeechSpeech Synthesis
Llasa-3B is a powerful text-to-speech (TTS) model developed based on the LLaMA architecture, focused on Chinese and English speech synthesis. By integrating XCodec2's speech encoding technology, it efficiently converts text into natural and fluent speech. Its main advantages include high-quality speech output, support for multilingual synthesis, and flexible speech prompting capabilities. This model is suitable for various applications requiring speech synthesis, such as audiobook production and voice assistant development. Its open-source nature also allows developers to explore and expand its functionalities freely.
Visit

Llasa-3B Visit Over Time

Monthly Visits

21315886

Bounce Rate

45.50%

Page per Visit

5.2

Visit Duration

00:05:02

Llasa-3B Visit Trend

Llasa-3B Visit Geography

Llasa-3B Traffic Sources

Llasa-3B Alternatives