National Day Secretly Launching Something Big! Meta's Movie Gen Video Generation is Here: Generate 16-Second HD Videos with Just One Click, Including Voiceovers

AIbase基地

Published inAI News · 6 min read · Oct 5, 2024

322

Meta has recently released Movie Gen, an AI video generation model dubbed the "Metaverse version of Sora." This model can create high-quality videos with a single click, add voiceovers, edit and splice videos, and even transform personal photos into personalized videos.

With the simultaneous release of a 92-page technical report, the powerful features and advanced architecture of Movie Gen have attracted widespread attention in the industry.

Movie Gen Video: A Revolution in High-Definition Video Generation

Movie Gen consists of two core models: Movie Gen Video and Movie Gen Audio. The Movie Gen Video, a Transformer model with 300 billion parameters, can generate high-definition videos at 1080P resolution, 16 seconds long, and 16 frames per second based on text prompts.

Key Features:

Text-to-Video: Create high-quality customized videos with simple text input

Video Editing: Precisely modify the style and content of existing videos

Personalized Videos: Transform personal photos into dynamic videos

Audio Generation: Add voiceovers, sound effects, and background music to videos

The model incorporates the architectural design of Llama3 and employs "flow matching" technology, surpassing traditional diffusion models in video accuracy and detail representation.

From the demonstration, the videos generated by Movie Gen achieve extremely high standards in image quality, lighting effects, and motion smoothness. The facial stability of characters, the realism of animal fur, and the richness of background details are all impressive. The audio generation is equally outstanding, capable of creating background music that matches the scene's atmosphere and accurately syncing with video action points.

Movie Gen Audio: A Breakthrough in Synchronized Audio Generation

Movie Gen Audio is a model with 13 billion parameters, capable of generating high-quality voiceovers and music at 48kHz for videos. It can not only produce synchronized sound effects but also create background music that matches the scene's atmosphere, and even produce continuous audio for several minutes.

Personalized Videos: Creating Unique Content

In terms of functionality, Movie Gen demonstrates remarkable diversity and flexibility. Users can generate customized videos with simple text input, edit the style and content of existing videos, and even upload personal photos to create unique personalized videos. These features make Movie Gen one of the most advanced media foundation models currently available.

Meta's demonstration videos are impressive. From a stormy mountain scene to a little girl flying a kite on the beach, to a sloth wearing pink sunglasses, the videos generated by Movie Gen achieve extremely high standards in image quality, lighting effects, and motion smoothness.

Even more astonishing is its ability to transform ordinary photos into dynamic videos, such as turning a photo of Zuckerberg into a fitness video.

Technically, Movie Gen incorporates several innovations:

Transformer architecture based on Llama3

Flow matching training method to enhance video quality

Multi-stage training process to optimize performance

Llama3-assisted prompt rewriting to improve generation quality

Innovative video editing and audio expansion techniques

Although Movie Gen is currently in a "futures" state and is expected to be open to the public next year, its release has already caused a significant stir in the industry. Some commentators believe that Meta's move not only preempts OpenAI in releasing a product similar to Sora but may also spur other companies to accelerate the development of next-generation AI video technologies.

Reference: https://x.com/AIatMeta/status/1842188252541043075

Official Website: https://ai.meta.com/research/movie-gen/

MovieGen Metaverse AI Video Generation Meta

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Meta Releases WebSSL Models: A New Exploration in Language-Free Visual Learning

In the field of artificial intelligence, Meta recently introduced the WebSSL family of models. These models, ranging in size from 300 million to 7 billion parameters, are trained on purely image data and aim to explore the vast potential of language-free visual self-supervised learning (SSL). This new research opens up new possibilities for future multimodal tasks and offers a fresh perspective on understanding how visual representations are learned. Previously, OpenAI's CLIP model was known for its performance in multimodal tasks such as visual question answering (VQA) and document understanding.

Apr 25, 2025

140

Pixverse Launches MCP: Unlocking a New Frontier in AI Video Generation

With the rapid advancement of generative AI technology, the video creation field is experiencing a new wave of transformation. Pixverse, a leading platform in AI video generation, recently launched the Model Context Protocol (MCP), providing users and developers with a more efficient and flexible video generation solution. What is MCP? Unlocking new ways to generate AI videos. Pixverse's MCP (Model Context Protocol) is specifically designed for AI video generation...

Apr 25, 2025

140

Jidream Video 3.0 Internal Testing: Smooth Camera Work, Accurate Capture of Facial Expressions

Last night, Jidream launched the internal testing of its Video 3.0 model. The new video model boasts smoother camera work and higher prompt fidelity compared to previous models. Based on the examples provided by the official release, the new model demonstrates improved stability in handling large movements, significantly reducing instances of character distortion. It can easily handle various scenarios, such as a man playing golf, a dog cooking, a boy singing passionately, and a toy hugging a robot. Key highlights of Jidream 3.0 include: 1. Rich cinematic language, ranging from rapid pushes to create suspense, to slow pans to showcase expansive scenes, and more.

Apr 25, 2025

120

Meta Ray-Ban Smart Glasses Roll Out Real-Time Translation, Offline Support Included

Meta recently announced the global rollout of real-time translation for its Ray-Ban Meta smart glasses. Previously, this feature was limited to early testing users in select markets. This full launch allows users to enjoy more convenient language conversion across various scenarios, especially the ability to overcome language barriers offline. According to Meta, the real-time translation feature on Ray-Ban Meta smart glasses now covers global sales markets and supports English, French, and Italian (among other languages).

Apr 24, 2025

100

Meta Launches Real-Time Translation for Ray-Ban Smart Glasses

Meta has announced the rollout of several new features for its Ray-Ban smart glasses, including real-time translation, Instagram messaging, and calling. Initially available only to select users in a preview program, these features are now available to all Ray-Ban Stories users. The real-time translation feature, first revealed at Meta Connect 2024, underwent limited testing in select countries last December. Now, users can utilize this feature in supported markets.

Apr 24, 2025

Revolutionizing Video Creation! Alibaba's VACE Model Unifies Text, Image, and Video Inputs

Scientists at Alibaba Group have introduced VACE, a universal AI model designed to unify a wide range of video generation and editing tasks. At the heart of VACE is an enhanced Diffusion Transformer architecture, innovating with a novel input format called "Video Conditional Unit" (VCU). VCU distills diverse modalities such as text prompts, reference images or video sequences, and spatial masks into a unified representation, and through a specialized mechanism coordinates different inputs to avoid conflicts. Concept decoupling enables fine-grained control.

Apr 23, 2025

110

Top 20 AI Video Generation Companies of 2025 Announced: Keling AI, Jimeng AI, and PixVerse AI Take the Lead

Vidu Q1 Officially Launched: Higher Definition, Smoother Frame Rates

Shengshu Technology has officially launched Vidu Q1, a high-performance generative AI video model. Its exceptional visual quality, smooth cinematic transitions, precise sound effects, and enhanced animation style have generated significant industry buzz. According to AIbase, Vidu Q1 surpasses existing competitors in the VBench comprehensive video generation evaluation standard. With comprehensive upgrades across four core functions, it provides creators with a production experience comparable to professional film studios. Project details have been released on the Vidu website and social media platforms, marking a significant advancement in AI video generation technology.

Apr 22, 2025

230

Meta Uses AI to Identify Underage Users on Instagram, Triggering Protective Mode

Meta has announced it will use artificial intelligence (AI) to verify the age of teenage users on Instagram, preventing users from misrepresenting their age. This measure aims to enhance the online safety of teenagers, ensuring they use social media in a protected environment. Meta stated that once the system detects an account suspected of belonging to a teenager, even if the user has entered an adult birthday, the system will automatically place it in "teen account" mode. Instagram reportedly implemented this last year.

Apr 22, 2025

130

Apple Intelligence Feature Restricted on Meta Apps: Ban Sparks AI Competition Debate

According to foreign media reports, Apple's newly launched Apple Intelligence feature is disabled on Meta's apps (including Facebook, Instagram, WhatsApp, and Threads), preventing users from accessing core functionalities such as Writing Tools and the custom emoji generator (Genmoji). This move is believed to be related to Meta's strategy of promoting its own Meta AI tools, highlighting the intensifying competition between the two tech giants in the AI arena.

Apr 21, 2025

180

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

National Day Secretly Launching Something Big! Meta's Movie Gen Video Generation is Here: Generate 16-Second HD Videos with Just One Click, Including Voiceovers

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Meta Releases WebSSL Models: A New Exploration in Language-Free Visual Learning

Pixverse Launches MCP: Unlocking a New Frontier in AI Video Generation

Jidream Video 3.0 Internal Testing: Smooth Camera Work, Accurate Capture of Facial Expressions

Meta Ray-Ban Smart Glasses Roll Out Real-Time Translation, Offline Support Included

Meta Launches Real-Time Translation for Ray-Ban Smart Glasses

Revolutionizing Video Creation! Alibaba's VACE Model Unifies Text, Image, and Video Inputs

Top 20 AI Video Generation Companies of 2025 Announced: Keling AI, Jimeng AI, and PixVerse AI Take the Lead

Vidu Q1 Officially Launched: Higher Definition, Smoother Frame Rates

Meta Uses AI to Identify Underage Users on Instagram, Triggering Protective Mode

Apple Intelligence Feature Restricted on Meta Apps: Ban Sparks AI Competition Debate