VideoChat
Real-time voice interaction digital human, supporting end-to-end voice solutions.
CommonProductVideoReal-time voice interactionDigital human
VideoChat is a real-time voice interaction digital human project that supports end-to-end voice solutions (GLM-4-Voice - THG) and cascading solutions (ASR-LLM-TTS-THG). Users can customize the appearance and voice of the digital human, with voice cloning capabilities that require no training, achieving initial package latency as low as 3 seconds. This project leverages the latest AI technologies, including Automatic Speech Recognition (ASR), Large Language Models (LLM), End-to-End Multimodal Large Language Models (MLLM), Text-to-Speech (TTS), and Talking Head Generation (THG), to provide users with a highly customizable and low-latency interaction experience.
VideoChat Visit Over Time
Monthly Visits
515580771
Bounce Rate
37.20%
Page per Visit
5.8
Visit Duration
00:06:42