Llasa-1B
Llasa-1B is a text-to-speech (TTS) model based on the LLaMA architecture, supporting both Chinese and English speech synthesis.
CommonProductOthersText-to-SpeechSpeech Synthesis
Llasa-1B is a text-to-speech model developed by the Audio Lab at the Hong Kong University of Science and Technology. Based on the LLaMA architecture and integrated with speech tokens from the XCodec2 codec, it converts text into natural and fluent speech. The model has been trained on 250,000 hours of Chinese and English speech data and supports generating speech from plain text, as well as utilizing given voice prompts for synthesis. Its main advantage is the ability to produce high-quality multilingual speech, making it suitable for various applications such as audiobooks and voice assistants. The model is licensed under CC BY-NC-ND 4.0, prohibiting commercial use.
Llasa-1B Visit Over Time
Monthly Visits
21315886
Bounce Rate
45.50%
Page per Visit
5.2
Visit Duration
00:05:02