Adobe Launches AI Sound Effect Generation System MultiFoley, Creating Synchronized Video Audio from Text Prompts

AIbase基地

Published inAI News · 5 min read · Dec 2, 2024

779

Recently, the Adobe research team collaborated with researchers from the University of Michigan to develop an artificial intelligence system called MultiFoley, which can generate sound effects for voiceovers in movies and videos, aiding in post-production.

The innovation of MultiFoley lies in its ability to allow users to create sound effects through text prompts, reference audio, or video examples. In demonstrations, the system can even transform a cat's meow into a lion's roar or convert the sound of a typewriter into piano notes, perfectly syncing with the video visuals.

The audio output quality of MultiFoley reaches a high bandwidth of 48kHz, primarily due to the researchers training the system using videos and professional sound effect libraries available online. Unlike previous systems, MultiFoley integrates multiple input methods—text, audio, and video references—into a single model for the first time. It analyzes visual features at 8 frames per second and scales them to match a 40Hz audio sampling rate, ensuring that the generated audio remains tightly synchronized with the video.

In testing, MultiFoley excelled in synchronizing audio with video and matching sound effects to text descriptions, achieving an average synchronization accuracy of 0.8 seconds, significantly better than the typical delay of over one second found in traditional systems. User studies showed that 85.8% of participants believed MultiFoley outperformed the runner-up in semantic consistency, while 94.5% preferred its synchronization results.

Although MultiFoley demonstrates strong potential, the research team also pointed out some current limitations, such as a relatively small training dataset, which restricts the variety of sound effects it can produce. Additionally, the system faces challenges in generating multiple simultaneous sound effects. The research team plans to release the source code and model soon.

While Adobe has not yet announced plans to integrate MultiFoley into its products, this technology aligns well with the existing AI features in Adobe Premiere Pro video editing software, promising to bring convenience to individual creators and production companies in the sound design process.

Key Points:
🎬 MultiFoley is an AI sound effect generation system developed by Adobe in collaboration with the University of Michigan, capable of generating sound effects through various input methods.
🔊 The system achieves an audio output quality of 48kHz with an average synchronization accuracy of 0.8 seconds, surpassing traditional sound effect systems.
📈 User research indicates that MultiFoley received high ratings for both semantic consistency and synchronization of sound effects.

MultiFoley Adobe Sound Effect Generation Artificial Intelligence System

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Human Learning Paradigm Enters the Scene! Deep Machine Intelligence Releases PhysBrain 1.0: Giving Robots Physical Common Sense

Embodied intelligence company Deep Machine Intelligence released the PhysBrain 1.0 model, built using a human learning paradigm, achieving a breakthrough from action imitation to conceptual understanding, internalizing physical common sense into parameters.

Mar 27, 2026

260

Xiaomi Launches Full-Hand Haptic Bionic Hand: Volume Reduced by 60% and Introduces Bionic Sweat Gland Cooling Technology

Xiaomi Robotics released an industrial-grade full-hand haptic bionic hand. Through integrated software and hardware optimization, the dual-side installation success rate was improved to 90.2%, and it continues to be optimized to approach 100%. The bionic hand is highly human-like, with a volume reduced by 60% compared to the previous generation, making it suitable for industrial environments such as automobile factories.

Mar 27, 2026

520

Attack the Local Life AI Access Point! Meituan's Wang Xing: Has Been Investing for Three Years, Self-developed LongCat Large Model Fully Opened

Wang Xing, CEO of Meituan, stated that AI is a strategic opportunity for local life services. The company will take an offensive approach to layout AI and create a new entry point for local life. Meituan has been investing for three years, preparing for a long-term battle in capital and talent, and is committed to redefining core businesses through AI technology.

Mar 27, 2026

250

Embodied Intelligence Ends Uncontrolled Growth: First Industry Standard Officially Released and Implemented from June

China's first embodied AI industry standard, drafted by CAICT and over 40 organizations, will take effect in June 2026. It establishes a unified benchmark framework and clarifies system requirements to guide industry development.....

Mar 27, 2026

250

OpenAI Announces Indefinite Suspension of ChatGPT Adult Mode and Shutdown of Sora Video Model

OpenAI announced the indefinite suspension of the "ChatGPT NSFW mode" development plan, continuing its recent strategic retreat. This feature was proposed by CEO Sam Altman in October 2025 but has been repeatedly delayed due to ethical risks and regulatory controversies. Over the past week, the company also reduced the priority of services like "instant checkout," indicating a shift in direction, focusing on core business.

Mar 27, 2026

580

MIIT Solicits Comments on 121 Industry Standards Including the 'Artificial Intelligence Model Context Protocol'

The MIIT has publicly solicited opinions on 121 industry standard plans, focusing on regulating the application security of artificial intelligence model context protocols. The goal is to address protocol compatibility and data security issues in large models related to multimodal interaction, long text processing, and cross-platform calling through standardization, marking a significant step forward in China's AI underlying protocol standardization and security regulation system construction.

Mar 26, 2026

230

Amap Open Platform Launches Skills Compatible with OpenClaw, Driving the Evolution of Mapping Services towards an Agent-based System

Amap integrates with OpenClaw AI, enabling natural language interaction instead of traditional API calls. It introduces skills like life-office assistants and website generators, achieving rapid 'demand-to-product' development.....

Mar 26, 2026

140

Mercedes-Benz Also Uses Intelligence! Mercedes Collaborates with Tsinghua University and Zhipei, the First Large Model Enters the Premium Rear Seat

Mercedes-Benz partners with Tsinghua University and Zhipu AI to integrate multimodal AI into the new Maybach S-Class rear entertainment system, pioneering this technology in automotive rear cabins and redefining luxury travel interaction.....

Mar 25, 2026

200

Xiaomi's AI Full Stack Layout Enters an Explosive Phase, System-Level Intelligent Assistant MiClaw Opens a New Paradigm on the Edge

Xiaomi's 2025 financial report shows total revenue of 457.3 billion yuan, an increase of 25% year-on-year, and adjusted net profit of 39.2 billion yuan, up by 43.8%. The company predicts that 2026 will be the explosive year for AI applications, and is accelerating the integration of large models and multimodal technologies with the "people-vehicle-home ecosystem". The basic model layout has been completed, and the self-developed large model has already been applied in areas such as voice and language.

Mar 25, 2026

170

Rising 455%! JD.com Releases AI All-in-One: JoyAI Large Model Open Source, Embodied Intelligence Plan Collects Millions of Hours of Video

JD.com shifts AI strategy from single breakthroughs to comprehensive expansion, unveiling progress in AI R&D and application across foundational models, digital humans, embodied intelligence, and agent ecosystems, aiming to deeply integrate AI into supply chains and e-commerce. Notably, JD.com open-sourced its foundational model JoyAI for the first time, with usage surging 4.55 times month-over-month.....

Mar 25, 2026

270

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Adobe Launches AI Sound Effect Generation System MultiFoley, Creating Synchronized Video Audio from Text Prompts

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Human Learning Paradigm Enters the Scene! Deep Machine Intelligence Releases PhysBrain 1.0: Giving Robots Physical Common Sense

Xiaomi Launches Full-Hand Haptic Bionic Hand: Volume Reduced by 60% and Introduces Bionic Sweat Gland Cooling Technology

Attack the Local Life AI Access Point! Meituan's Wang Xing: Has Been Investing for Three Years, Self-developed LongCat Large Model Fully Opened

Embodied Intelligence Ends Uncontrolled Growth: First Industry Standard Officially Released and Implemented from June

OpenAI Announces Indefinite Suspension of ChatGPT Adult Mode and Shutdown of Sora Video Model

MIIT Solicits Comments on 121 Industry Standards Including the 'Artificial Intelligence Model Context Protocol'

Amap Open Platform Launches Skills Compatible with OpenClaw, Driving the Evolution of Mapping Services towards an Agent-based System

Mercedes-Benz Also Uses Intelligence! Mercedes Collaborates with Tsinghua University and Zhipei, the First Large Model Enters the Premium Rear Seat

Xiaomi's AI Full Stack Layout Enters an Explosive Phase, System-Level Intelligent Assistant MiClaw Opens a New Paradigm on the Edge

Rising 455%! JD.com Releases AI All-in-One: JoyAI Large Model Open Source, Embodied Intelligence Plan Collects Millions of Hours of Video

AI News Recommendations

Human Learning Paradigm Enters the Scene! Deep Machine Intelligence Releases PhysBrain 1.0: Giving Robots Physical Common Sense

Xiaomi Launches Full-Hand Haptic Bionic Hand: Volume Reduced by 60% and Introduces Bionic Sweat Gland Cooling Technology

Attack the Local Life AI Access Point! Meituan's Wang Xing: Has Been Investing for Three Years, Self-developed LongCat Large Model Fully Opened

Embodied Intelligence Ends Uncontrolled Growth: First Industry Standard Officially Released and Implemented from June

OpenAI Announces Indefinite Suspension of ChatGPT Adult Mode and Shutdown of Sora Video Model

MIIT Solicits Comments on 121 Industry Standards Including the 'Artificial Intelligence Model Context Protocol'

Amap Open Platform Launches Skills Compatible with OpenClaw, Driving the Evolution of Mapping Services towards an Agent-based System

Mercedes-Benz Also Uses Intelligence! Mercedes Collaborates with Tsinghua University and Zhipei, the First Large Model Enters the Premium Rear Seat

Xiaomi's AI Full Stack Layout Enters an Explosive Phase, System-Level Intelligent Assistant MiClaw Opens a New Paradigm on the Edge

Rising 455%! JD.com Releases AI All-in-One: JoyAI Large Model Open Source, Embodied Intelligence Plan Collects Millions of Hours of Video

GEO Services