InstructAvatar
Text-guided emotional and action control for generating vivid 2D avatars
CommonProductImageAvatar GenerationEmotional Control
InstructAvatar is an innovative text-guided method for generating 2D avatars with rich emotional expression. This model controls avatar emotions and facial expressions via a natural language interface, offering fine-grained control, improved interactivity, and generalization ability for generated videos. It utilizes an automated annotation process to construct a training dataset of instruction-video pairs and incorporates a novel dual-branch diffusion base generator capable of predicting avatars simultaneously based on audio and textual instructions. Experimental results demonstrate that InstructAvatar outperforms existing methods in fine-grained emotional control, lip-sync quality, and naturalness.
InstructAvatar Visit Over Time
Monthly Visits
235
Bounce Rate
39.76%
Page per Visit
1.0
Visit Duration
00:00:00