InstructAvatar

Text-guided emotional and action control for generating vivid 2D avatars

CommonProductImageAvatar GenerationEmotional Control
InstructAvatar is an innovative text-guided method for generating 2D avatars with rich emotional expression. This model controls avatar emotions and facial expressions via a natural language interface, offering fine-grained control, improved interactivity, and generalization ability for generated videos. It utilizes an automated annotation process to construct a training dataset of instruction-video pairs and incorporates a novel dual-branch diffusion base generator capable of predicting avatars simultaneously based on audio and textual instructions. Experimental results demonstrate that InstructAvatar outperforms existing methods in fine-grained emotional control, lip-sync quality, and naturalness.
Visit

InstructAvatar Visit Over Time

Monthly Visits

463

Bounce Rate

0.00%

Page per Visit

3.0

Visit Duration

00:01:53

InstructAvatar Visit Trend

InstructAvatar Visit Geography

No Geography Data

InstructAvatar Traffic Sources

InstructAvatar Alternatives