Meta Unveils MoCha AI System: Generating Character Animations with Synchronized Speech and Movement

AIbase基地

Published inAI News · 4 min read · Apr 2, 2025

703

Meta and researchers from the University of Waterloo have collaborated to develop MoCha, an AI system capable of generating full-body character animations with synchronized speech and natural movements. Unlike previous models focusing solely on facial animation, MoCha renders full-body motion from multiple camera angles, including lip-sync, gestures, and interactions between multiple characters.

Improving Lip-Sync Accuracy

MoCha's demonstration highlights the synchronized generation of upper body movements and gestures in close-up and medium shots. Its unique "audio-visual window attention" mechanism successfully addresses two long-standing challenges in AI video generation: maintaining full audio resolution during video compression and avoiding lip-sync mismatches during parallel video generation.

MoCha innovates by limiting each frame's access to a specific audio data window, mimicking human speech production – lip movements are closely tied to immediate sounds, while body language reflects broader textual patterns. By adding markers before and after each frame's audio, MoCha achieves smoother transitions and more precise lip synchronization.

MoCha generates realistic videos with facial expressions, gestures, and lip movements based on text descriptions.

To build the system, the research team used 300 hours of carefully curated video content and combined it with text-based video sequences to expand the possibilities of expression and interaction. MoCha excels particularly in multi-character scenes; users define characters once and easily recall them across different scenes using labels (e.g., "Character 1" or "Character 2") without repeated descriptions.

Managing Multiple Characters

In tests across 150 different scenarios, MoCha outperformed comparable systems in both lip-sync accuracy and the naturalness of its movements. Independent evaluators consistently rated the generated videos as highly realistic, exhibiting unprecedented precision and naturalness.

Researchers developed a prompt template allowing users to reference specific characters without repeated descriptions.

MoCha's development shows significant potential across various applications, particularly in digital assistants, virtual avatars, advertising, and educational content. While Meta hasn't revealed whether the system will be open-sourced or if it remains a research prototype, its introduction undoubtedly marks a new chapter in AI-driven video generation.

MoCha's release is particularly noteworthy in the increasingly competitive landscape of AI video technology. Meta recently launched the MovieGen system, while ByteDance, the parent company of TikTok, is developing its own AI animation tools, including INFP, OmniHuman-1, and Goku, highlighting the active involvement of social media companies in this field.

MoCha AI Character Animation Meta Full-Body Motion Capture

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Meta Ray-Ban Smart Glasses Roll Out Real-Time Translation, Offline Support Included

Meta recently announced the global rollout of real-time translation for its Ray-Ban Meta smart glasses. Previously, this feature was limited to early testing users in select markets. This full launch allows users to enjoy more convenient language conversion across various scenarios, especially the ability to overcome language barriers offline. According to Meta, the real-time translation feature on Ray-Ban Meta smart glasses now covers global sales markets and supports English, French, and Italian (among other languages).

Apr 24, 2025

Meta Launches Real-Time Translation for Ray-Ban Smart Glasses

Meta has announced the rollout of several new features for its Ray-Ban smart glasses, including real-time translation, Instagram messaging, and calling. Initially available only to select users in a preview program, these features are now available to all Ray-Ban Stories users. The real-time translation feature, first revealed at Meta Connect 2024, underwent limited testing in select countries last December. Now, users can utilize this feature in supported markets.

Apr 24, 2025

Meta Uses AI to Identify Underage Users on Instagram, Triggering Protective Mode

Meta has announced it will use artificial intelligence (AI) to verify the age of teenage users on Instagram, preventing users from misrepresenting their age. This measure aims to enhance the online safety of teenagers, ensuring they use social media in a protected environment. Meta stated that once the system detects an account suspected of belonging to a teenager, even if the user has entered an adult birthday, the system will automatically place it in "teen account" mode. Instagram reportedly implemented this last year.

Apr 22, 2025

120

Apple Intelligence Feature Restricted on Meta Apps: Ban Sparks AI Competition Debate

According to foreign media reports, Apple's newly launched Apple Intelligence feature is disabled on Meta's apps (including Facebook, Instagram, WhatsApp, and Threads), preventing users from accessing core functionalities such as Writing Tools and the custom emoji generator (Genmoji). This move is believed to be related to Meta's strategy of promoting its own Meta AI tools, highlighting the intensifying competition between the two tech giants in the AI arena.

Apr 21, 2025

170

UK AI Copyright Regulations Could Lead to Biased Models and Reduced Creator Revenue

Policy experts have voiced concerns over proposed AI copyright regulations in the UK, arguing that a lack of comprehensive text and data mining exemptions could lead to lower-quality AI models and stifle innovation. They suggest that prohibiting companies like OpenAI, Google, and Meta from using copyrighted material to train AI in the UK could result in biased model outputs, diminishing their effectiveness. The UK government launched a consultation in December 2024 to explore how to protect creators while allowing the use of creative content in AI model training.

Apr 16, 2025

Meta's Plan to Use EU User Data for AI Training Raises Privacy Concerns

Meta Platforms, Inc. has announced plans to use user data from its European Union applications, including Facebook and Instagram, to train its artificial intelligence models. The company clarified that the training data will include users' public posts, comments, and interactions with Meta AI, but will exclude private messages with friends and family. Training will be limited to users aged 18 and over. Meta stated it will inform its EU users of this plan this week via in-app notifications and emails.

Apr 15, 2025

110

Meta Restarts AI Training Using Public Content from European Users

Meta recently announced it will resume training its AI models using publicly available content from European users. This decision follows a pause last year due to data privacy concerns. Meta stated that this AI training will primarily rely on publicly shared posts and comments from adult users across the 27 EU countries. Furthermore, interactions between users and Meta AI, such as questions and queries, will also be used to train and improve its AI models. Image attribution: Image generated by AI, image licensing provided by Midj

Apr 15, 2025

100

California Crosswalk Buttons Hacked to Mimic Musk and Zuckerberg's Voices

Apr 15, 2025

Meta's Llama-4-Maverick Plummets in Rankings, Raising Concerns of Benchmark Manipulation

Meta's open-source large language model, Llama-4-Maverick, has experienced a dramatic drop in LMArena rankings, plummeting from second place to 32nd. This significant shift has sparked widespread skepticism among developers, who suspect Meta may have manipulated the benchmark by submitting a specially optimized version. The issue stems from Meta's April 6th release of its latest large language model, Llama 4, encompassing three versions: Scout, Maverick, and Behemoth.

Apr 14, 2025

650

Llama 4 Arrives on Vertex AI: Deploy Meta's New Model with One Click

Google Cloud Platform recently announced that Meta's latest generation of open-source large language models, Llama 4, is now available in its Vertex AI Model Garden. The news has generated significant excitement in the global tech community. The Scout and Maverick models from the Llama 4 series are now integrated into Vertex AI and available to developers via fully managed Model-as-a-Service (MaaS) API endpoints in preview.

Apr 10, 2025

280

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Meta Unveils MoCha AI System: Generating Character Animations with Synchronized Speech and Movement

AIbase基地

Improving Lip-Sync Accuracy

Managing Multiple Characters

This article is from AIbase Daily

AI News Recommendations

Meta Ray-Ban Smart Glasses Roll Out Real-Time Translation, Offline Support Included

Meta Launches Real-Time Translation for Ray-Ban Smart Glasses

Meta Uses AI to Identify Underage Users on Instagram, Triggering Protective Mode

Apple Intelligence Feature Restricted on Meta Apps: Ban Sparks AI Competition Debate

UK AI Copyright Regulations Could Lead to Biased Models and Reduced Creator Revenue

Meta's Plan to Use EU User Data for AI Training Raises Privacy Concerns

Meta Restarts AI Training Using Public Content from European Users

California Crosswalk Buttons Hacked to Mimic Musk and Zuckerberg's Voices

Meta's Llama-4-Maverick Plummets in Rankings, Raising Concerns of Benchmark Manipulation

Llama 4 Arrives on Vertex AI: Deploy Meta's New Model with One Click