TANGO Model
Co-speech Gesture Video Reproduction Technology
CommonProductVideoArtificial IntelligenceGesture Recognition
TANGO is a co-speech gesture video reproduction technology based on hierarchical audio-motion embedding and diffusion interpolation. It utilizes advanced artificial intelligence algorithms to convert voice signals into corresponding gesture animations, enabling the natural reproduction of gestures in videos. This technology has broad application prospects in video production, virtual reality, and augmented reality, significantly enhancing the interactivity and realism of video content. TANGO was jointly developed by the University of Tokyo and CyberAgent AI Lab, representing the cutting edge of artificial intelligence in gesture recognition and motion generation.
TANGO Model Visit Over Time
Monthly Visits
9085
Bounce Rate
59.56%
Page per Visit
1.1
Visit Duration
00:00:08