AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

Huawei and Fudan University Join Forces to Create a New Framework for 3D Digital Humans EmoTalk3D: Realistic and Rich Expressions of Joy, Anger, Sadness, and Happiness

AIbase基地

Published inAI News · 5 min read · Aug 7, 2024

381

In the field of 3D digital humans, despite significant advancements, previous methods still faced issues with multi-view consistency and inadequate emotional expressiveness. To address these challenges, a research team from Nanjing University, Fudan University, and Huawei Noah's Ark Lab has made a new breakthrough.

Product Entry: https://nju-3dv.github.io/projects/EmoTalk3D/

They have collected the EmoTalk3D dataset, which includes calibrated multi-view videos, emotional annotations, and frame-by-frame 3D geometry. They propose a new method for synthesizing 3D talking heads with controllable emotions, significantly improving lip synchronization and rendering quality.

Dataset:

By training on the EmoTalk3D dataset, the research team has constructed a mapping framework from "speech to geometry to appearance." It first predicts realistic 3D geometry sequences from audio features, then synthesizes the appearance of the 3D talking head represented by 4D Gaussians based on the predicted geometry. The appearance is further decomposed into standard and dynamic Gaussians, learned from multi-view videos and fused to render free-viewpoint talking head animations.

The model can achieve controllable emotions in the generated talking heads and render them over a wide range of views. While capturing dynamic facial details such as wrinkles and subtle expressions, it demonstrates improved rendering quality and stability in lip generation. In the example of the generated results, it accurately shows the expressions of happiness, anger, and frustration of the 3D digital human.

The overall process includes five modules:

The first is an emotional content decomposition encoder, which parses content and emotional features from the input speech; the second is a speech-to-geometry network, which predicts dynamic 3D point clouds from the features; the third is a Gaussian optimization and completion module, which establishes a standard appearance; the fourth is a geometry-to-appearance network, which synthesizes facial appearance based on dynamic 3D point clouds; and the fifth is a rendering module, which renders dynamic Gaussians into free-viewpoint animations.

Additionally, they have established the EmoTalk3D dataset, a multi-view talking head dataset with frame-by-frame 3D facial shapes and emotional annotations, which will be made available to the public for non-commercial research purposes.

Key Points:
💥 Introduced a new method for synthesizing digital humans with controllable emotions.
🎯 Constructed a mapping framework from "speech to geometry to appearance."
👀 Established the EmoTalk3D dataset and prepared for public release.

['3D Digital Humans''Emotional Expressiveness''Multi-View Consistency''Huawei Noah's Ark Lab']

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team