Byte Loopy's Lip Sync Feature Goes Live on J梦, Matching Expressions and Emotions Based on Context

AIbase基地

Published inAI News · 6 min read · Sep 23, 2024

2.3k

Remember the ByteDance project Loopy that left everyone in awe when it was first released earlier this month? This lip-syncing project, which perfectly matches the digital human's voice with the visuals, expressions, and emotions, has officially launched on Jimeng.

AIbase tested it out and the results are quite impressive, making it the best lip-syncing service currently available for Chinese language.

In the past, lip-sync videos often had a common flaw: the mouth movements seemed to match the audio, but the voice never quite felt like it belonged to the person speaking, creating a sense of disconnection when watching such videos.

ByteDance, in collaboration with Zhejiang University's research team, has developed a video diffusion model called LOOPY, based on audio-driven technology, which perfectly addresses this issue.

Unlike traditional lip-syncing where characters merely move their mouths, Loopy enables characters in lip-sync videos to express appropriate tones, emotions, and facial expressions in the context of speaking or singing. It can precisely "direct" every subtle movement of the virtual character, such as sighs, emotional eyebrow and eye movements, and natural head movements.

Currently, this feature has been integrated into Jimeng's video generation module:

AIbase uploaded a photo of a girl to test it out,

Jimeng currently offers two lip-syncing options:

1. Text-to-Speech

文本朗读.jpg

The operation on Jimeng is straightforward: simply upload the image or video of the character you want to lip-sync, input the text, and choose a voice. Here, AIbase selected a cool,御姐-style voice, and the test results are as follows:

As you can see, the character exhibits subtle facial expressions while speaking, and the dynamic details like nasolabial folds appear quite realistic.

2. Upload Local Audio

Moreover, you can not only make her speak but also upload a singing audio to make her sing:

对口型，图片+本地配音.jpg

Here, AIbase chose a popular excerpt from a recent Douyin video to see the results:

The results are truly impressive, not only are the lip movements accurate, but the voice doesn't feel disjointed, as if it's the girl's original voice.

However, there was a small issue: the girl in the photo chosen by AIbase wasn't looking at the viewer, which might not create a strong sense of immersion. Let's try a more direct angle:

Isn't that much better? And while the character is singing, she also exhibits very realistic actions like closing her eyes and shaking her head.

AIbase also tested a male version, and the results are as follows:

Isn't the effect stunning? What surprised AIbase the most is that it also considers very subtle details like the Adam's apple and eyebrows, making the overall video more realistic.

Feel free to experience it yourself~

Jimeng Product Entry: https://top.aibase.com/tool/jimeng

Loopy Lip Sync ByteDance AIbase

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Tavus Releases SOTA Lip-Sync Model Hummingbird-0: Revolutionizing Zero-Shot Lip Synchronization

AI video research company Tavus recently released Hummingbird-0, its latest zero-shot lip synchronization model, hailed as the current State-of-the-Art (SOTA) in lip-sync technology. The model is now available for research preview on the Tavus platform, API, and FAL, generating significant interest within the AI content creation field. Hummingbird-0 represents a breakthrough in zero-shot lip synchronization, built upon Tavus' flagship model Phoenix-

Apr 25, 2025

ByteDance Launches Vidi, a Multimodal Model Leading the Trend in Ultra-Long Video Understanding and Editing

Apr 23, 2025

430

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

Apr 23, 2025

210

ByteDance Restructures AI Product Line: Cat Box Leadership Change, Xinghui Merged into Doubao, Focusing on Growth

According to LatePost, ByteDance recently made significant adjustments to its AI product department, Flow. The social companionship AI product, Cat Box, has a new leader. The previous head, Liang Chenqi, has left the company, and has been replaced by Xi Yuan (codename), the former head of Xinghui. Meanwhile, the Xinghui team, which develops AI camera and image generation applications, is slated to merge into the Doubao App, under the unified management of Doubao App's head, Lu You (codename). The Flow department is headed by Zhu Jun and includes Doubao, Cat Box, Xinghui, Doubao Aixue, and G.

Apr 23, 2025

140

ByteDance Research Open-Sources ChatTS-14B: Native Understanding and Reasoning Over Time

ByteDance Research has announced the open-sourcing of ChatTS-14B, a 14-billion parameter large language model (LLM) specifically designed for understanding and reasoning with time series data. Released under the Apache2.0 license, ChatTS-14B's open-source release has garnered significant attention within the AI community, marking a substantial advancement in the intersection of time series analysis and generative AI. ChatTS-14B: An Intelligent Conversational Engine for Time Series. ChatTS-14B is based on Qwen2.5-1...

Apr 21, 2025

920

Coze Space Officially Opens Beta Testing, Supporting MCP Extension Integration

ByteDance's technology team announced that its new AI collaborative workspace, "Coze Space", is officially opening beta testing. Coze Space aims to be the optimal place for users to collaborate with AI Agents, providing comprehensive services ranging from answering questions to solving problems, helping users work more efficiently.

Apr 19, 2025

980

BMW Brilliance and ByteDance's Volcano Engine Partner to Drive AI-Powered Automotive Marketing

Recently, BMW Brilliance Lynk & Co Digital Information Technology Co., Ltd. (Lynk & Co) and ByteDance's Volcano Engine have partnered to innovate automotive marketing services with the help of Artificial Intelligence (AI) technology. This collaboration leverages AI to achieve precise product matching and purchase recommendations, optimize content guidance, and enhance the user car-buying experience and dealer operational efficiency. BMW Group President and CEO in Greater China, Gao Xiang, stated that AI is key to BMW's creation of smarter and more considerate mobility solutions, and is being rapidly integrated into R&D, production, supply chain, product, service, and operations.

Apr 18, 2025

250

ByteDance Releases UI-TARS-1.5: Open-Source Multimodal Agent Leading a New Wave in GUI Automation

ByteDance has officially released UI-TARS-1.5 on the Hugging Face platform, an open-source multimodal agent built upon a powerful vision-language model. This release marks another significant breakthrough for ByteDance in the field of AI automated interaction, providing developers and users with a highly efficient and intelligent cross-platform GUI (Graphical User Interface) automation solution. UI-TARS-1.5: A New Benchmark for Multimodal Agents. UI-TARS-1.5 is the latest in ByteDance's UI-TARS series...

Apr 18, 2025

1.2k

ByteDance Doubao Open-Source Seed Agent Model UI-TARS-1.5

The ByteDance Doubao large model team announced the open-sourcing of UI-TARS-1.5, an open-source multimodal agent built on a vision-language model capable of efficiently executing various tasks in a virtual world. The model achieved state-of-the-art (SOTA) performance on seven typical GUI (Graphical User Interface) benchmark evaluations and demonstrated, for the first time, its long-term reasoning capabilities in games and interactive capabilities in open spaces. This open-source project marks a significant advancement in multimodal agent technology for GUIs.

Apr 18, 2025

1.0k

AI Daily: ByteDance Releases Doubao 1.5 Deep Thinking Model; WeChat Launches Yuanbao, its First AI Assistant; OpenAI Releases o4-mini and a Full-Blooded o3

Welcome to the 【AI Daily】column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest AI news, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1、OpenAI released two multimodal reasoning models, o4-mini and a full-blooded o3. OpenAI showcased its latest multimodal models, o4-mini and a full-blooded o3, during a technical livestream.

Apr 17, 2025

770

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Byte Loopy's Lip Sync Feature Goes Live on J梦, Matching Expressions and Emotions Based on Context

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tavus Releases SOTA Lip-Sync Model Hummingbird-0: Revolutionizing Zero-Shot Lip Synchronization

ByteDance Launches Vidi, a Multimodal Model Leading the Trend in Ultra-Long Video Understanding and Editing

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

ByteDance Restructures AI Product Line: Cat Box Leadership Change, Xinghui Merged into Doubao, Focusing on Growth

ByteDance Research Open-Sources ChatTS-14B: Native Understanding and Reasoning Over Time

Coze Space Officially Opens Beta Testing, Supporting MCP Extension Integration

BMW Brilliance and ByteDance's Volcano Engine Partner to Drive AI-Powered Automotive Marketing

ByteDance Releases UI-TARS-1.5: Open-Source Multimodal Agent Leading a New Wave in GUI Automation

ByteDance Doubao Open-Source Seed Agent Model UI-TARS-1.5

AI Daily: ByteDance Releases Doubao 1.5 Deep Thinking Model; WeChat Launches Yuanbao, its First AI Assistant; OpenAI Releases o4-mini and a Full-Blooded o3