TANGO Model

Co-speech Gesture Video Reproduction Technology

CommonProductVideoArtificial IntelligenceGesture Recognition
TANGO is a co-speech gesture video reproduction technology based on hierarchical audio-motion embedding and diffusion interpolation. It utilizes advanced artificial intelligence algorithms to convert voice signals into corresponding gesture animations, enabling the natural reproduction of gestures in videos. This technology has broad application prospects in video production, virtual reality, and augmented reality, significantly enhancing the interactivity and realism of video content. TANGO was jointly developed by the University of Tokyo and CyberAgent AI Lab, representing the cutting edge of artificial intelligence in gesture recognition and motion generation.
Visit

TANGO Model Visit Over Time

Monthly Visits

9085

Bounce Rate

59.56%

Page per Visit

1.1

Visit Duration

00:00:08

TANGO Model Visit Trend

TANGO Model Visit Geography

TANGO Model Traffic Sources

TANGO Model Alternatives