Alibaba Releases EMO, a Portrait Video Generation Framework

开源中国

Published inAI News · 2 min read · Feb 29, 2024

The Alibaba team has released EMO, a portrait video generation framework capable of producing audio portraits with rich facial expressions and head poses. EMO utilizes a reference network to extract features from reference images and motion frames, processes audio through a pre-trained audio encoder for embedding, and combines multi-frame noise with facial region masks to generate videos. Experimental results show that EMO outperforms existing methods in terms of expressiveness and realism. The potential applications of this model could enhance the level of digital media and virtual content generation technology, but it may also be misused as a tool for criminal activities.

EMO Portrait Video Generation AI

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Release of the New Generation AI Video Generation Model LTX-2: One-Click Generation of High-Quality Narrative Videos

Lightricks' LTX-2 AI model generates 20-second 4K narrative videos with synchronized visuals, audio, and lip-sync in a single diffusion process, enhancing video creation efficiency.....

Oct 31, 2025

AI Daily: Sora's Free Quota to Shrink; Moonshot Releases Kimi Linear Architecture; Canva Freely Releases Affinity Professional Design Suite

OpenAI's Sora video generator reduces free access and introduces paid plans and creator revenue sharing after one month, reflecting commercialization amid user growth and computing costs.....

Oct 31, 2025

Blind People Can Also See Street Scenes? Google's New AI System Makes Virtual Exploration Accessible, Marking a Key Step in Technology for Good

Google has launched the StreetReaderAI prototype system, helping blind and low-vision users to independently explore Google Street View through natural language interaction. The system integrates computer vision, geographic information systems, and large language models, enabling a multimodal AI-driven real-time conversational street view experience, breaking through the limitations of traditional voice announcements and enhancing the freedom of accessible urban exploration.

Oct 31, 2025

Chrome Canary Adds Gemini AI Features: 'Nano Banana' and Deep Search Make Their Debut

Google adds an AI feature to the Chrome browser, introducing the 'Nano Banana' image generation tool and the 'Deep Search' topic research feature in the latest beta version. Users can create images or perform information retrieval directly in the search box without switching pages to quickly start tasks.

Oct 31, 2025

美的发布新一代 Home AI 系统: Building a Smart Home Hub That Can Think

At Midea's Visionary Conference, Xu Yi unveiled Home AI, an interactive system that connects, senses, reasons, and executes to intelligently manage home appliances and enhance user experience through habit-based optimization.....

Oct 31, 2025

NVIDIA Partners with Samsung to Build an AI Factory: 50,000 GPUs Drive the Future of Manufacturing

NVIDIA and Samsung Electronics have formed a strategic partnership to build an AI factory, deploying over 50,000 GPUs, supporting Samsung's semiconductor manufacturing, yield prediction, and equipment maintenance optimization, driving the manufacturing industry into the AI factory era.

Oct 31, 2025

Amazon Cuts 14,000 Jobs: AI Transformation Accelerates - Are Human Jobs Being Replaced by Robots?

Amazon lays off 14,000 full-time employees (4% of global workforce) without warning as part of its 30,000-person optimization plan. Accounts were locked immediately, preventing work handovers.....

Oct 31, 2025

China Academy of Information and Communications Technology's Artificial Intelligence Institute Jointly Released 'Research Report on the Application of Large Model Integrated Machines (2025)'

China Academy of Information and Communications Technology and the Artificial Intelligence Industry Development Alliance released 'Research Report on the Application of Large Model Integrated Machines (2025)', analyzing technical evolution, industry dynamics, and application practices, providing enterprises with comprehensive references. The report outlines the development history of large model integrated machines, highlights significant progress in recent years, and focuses on changes at the technical level.

Oct 31, 2025

Bevel Secures $10 Million in Series A Funding, Focused on Innovating AI Health Assistants

The fragmentation of health data is a common issue, with smart devices and apps recording sleep, steps, and more separately, but lacking integrated analysis. New York-based startup Bevel is filling this gap, helping users organize and understand their data. Recently, it secured $10 million in funding led by General Catalyst.

Oct 31, 2025

Former ByteDance AI Executive Liao Qian Leaves to Become a Vendor: Secures Millions of Dollars in Half a Month, Aiming to Make Marketing Agencies Operate Like 007

Former ByteDance AI VP Liao Qian founded 'Ultimate Context', focusing on multimodal agents for marketing. Secured millions in angel funding from HT Investment and Baidu Ventures. The product automates strategy, scripts, and video generation for brands.....

Oct 31, 2025

120

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Alibaba Releases EMO, a Portrait Video Generation Framework

开源中国

This article is from AIbase Daily

AI News Recommendations

Release of the New Generation AI Video Generation Model LTX-2: One-Click Generation of High-Quality Narrative Videos

AI Daily: Sora's Free Quota to Shrink; Moonshot Releases Kimi Linear Architecture; Canva Freely Releases Affinity Professional Design Suite

Blind People Can Also See Street Scenes? Google's New AI System Makes Virtual Exploration Accessible, Marking a Key Step in Technology for Good

Chrome Canary Adds Gemini AI Features: 'Nano Banana' and Deep Search Make Their Debut

美的发布新一代 Home AI 系统: Building a Smart Home Hub That Can Think

NVIDIA Partners with Samsung to Build an AI Factory: 50,000 GPUs Drive the Future of Manufacturing

Amazon Cuts 14,000 Jobs: AI Transformation Accelerates - Are Human Jobs Being Replaced by Robots?

China Academy of Information and Communications Technology's Artificial Intelligence Institute Jointly Released 'Research Report on the Application of Large Model Integrated Machines (2025)'

Bevel Secures $10 Million in Series A Funding, Focused on Innovating AI Health Assistants

Former ByteDance AI Executive Liao Qian Leaves to Become a Vendor: Secures Millions of Dollars in Half a Month, Aiming to Make Marketing Agencies Operate Like 007

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Alibaba Releases EMO, a Portrait Video Generation Framework

开源中国

This article is from AIbase Daily

AI News Recommendations

Release of the New Generation AI Video Generation Model LTX-2: One-Click Generation of High-Quality Narrative Videos

AI Daily: Sora's Free Quota to Shrink; Moonshot Releases Kimi Linear Architecture; Canva Freely Releases Affinity Professional Design Suite

Blind People Can Also See Street Scenes? Google's New AI System Makes Virtual Exploration Accessible, Marking a Key Step in Technology for Good

Chrome Canary Adds Gemini AI Features: 'Nano Banana' and Deep Search Make Their Debut

美的发布新一代 Home AI 系统: Building a Smart Home Hub That Can Think

NVIDIA Partners with Samsung to Build an AI Factory: 50,000 GPUs Drive the Future of Manufacturing

Amazon Cuts 14,000 Jobs: AI Transformation Accelerates - Are Human Jobs Being Replaced by Robots?

China Academy of Information and Communications Technology's Artificial Intelligence Institute Jointly Released 'Research Report on the Application of Large Model Integrated Machines (2025)'

Bevel Secures $10 Million in Series A Funding, Focused on Innovating AI Health Assistants

Former ByteDance AI Executive Liao Qian Leaves to Become a Vendor: Secures Millions of Dollars in Half a Month, Aiming to Make Marketing Agencies Operate Like 007

GEO Services