Stability AI Launches a Generative Model for Multi-View Video Conversion: Stable Video 4D

AIbase基地

Published inAI News · 3 min read · Jul 25, 2024

206

Recently, Stability AI announced the launch of a revolutionary video processing technology—Stable Video4D. This technology can transform a single-perspective video into new videos from eight different angles, providing creators with unprecedented flexibility and creativity.

Stable Video4D is built on the foundation of the company's previous Stable Video Diffusion model. Unlike converting images into videos, the new model can accept video inputs and generate multiple new-perspective video outputs, achieving a significant leap from image-based video generation to full 3D dynamic video synthesis.

When used, users only need to upload a video and specify the desired 3D camera positions, and Stable Video4D can generate eight new-perspective videos, providing a full range of multi-angle views. Currently, the model can generate eight perspectives of a 5-frame video in about 40 seconds, with the entire 4D optimization process taking approximately 20-25 minutes.

Compared to previous methods, Stable Video4D can simultaneously generate multiple new-perspective videos, significantly improving consistency in both spatial and temporal axes. This not only ensures consistency of objects across multiple perspectives and timestamps but also achieves a more lightweight 4D optimization framework.

Stability AI stated that Stable Video4D is currently in the research phase and is expected to be widely applied in areas such as game development, video editing, and virtual reality in the future. The company is actively optimizing the model to handle a broader range of real-world videos.

Stable Video4D is now available on the Hugging Face platform. Stability AI looks forward to further enhancing the potential of this technology to create realistic multi-angle videos through continuous research and development. The company will continue to collaborate with researchers, experts, and the community to drive technological innovation and continuously improve model performance.

Model URL: https://huggingface.co/stabilityai/sv4d

Black Forest Shocks Open Source FLUX.1 Kontext [dev]: Image Editing Comparable to GPT-4o

Black Forest Labs officially announced that its new image editing model FLUX.1Kontext [dev] is now open source, drawing widespread attention from the AI community. As the latest member of the FLUX.1 series, this model is praised as an open-source alternative comparable to GPT-4o, thanks to its powerful image editing capabilities and efficient performance. FLUX.1Kontext [dev] is based on a 1.2 billion parameter flow matching transformer architecture, specifically designed for image editing tasks, and supports consumer-grade hardware.

Google Launches Imagen4: Breaking the Text-to-Image Generation Bottleneck, Gemini API Empowers Text-to-Image

Recently, Google officially launched its latest text-to-image model **Imagen4** through the Gemini API, marking an important milestone in the field of generative AI (AIGC). According to Google's official blog and community feedback, Imagen4 has achieved breakthroughs in generating text within images, solving a long-standing technical bottleneck in AIGC, and providing developers with a tool for creating high-quality visual content. It is reported that the model comes in two versions: **Imagen4** and **Imagen4Ultra**, with respective pricing details yet to be fully disclosed.

Image Giant Getty Images Reverses Core Copyright Lawsuit Against Stability AI, UK Case Continues

Recently, Getty Images announced in the London High Court that it has withdrawn its main copyright infringement allegations against Stability AI, further narrowing the focus of this closely watched legal battle. The core of this lawsuit revolves around how AI companies use copyrighted content to train their models. Image source note: The image is AI-generated, and the image licensing service is Midjourney. Although Getty Images' dismissal of the case did not end it, the company is still pursuing other allegations.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Stability AI Launches a Generative Model for Multi-View Video Conversion: Stable Video 4D

AIbase基地

This article is from AIbase Daily

AI News Recommendations

New Open Source AI System OmniGen 2: Integrates Image and Text Generation Like GPT-4o

Memory Optimization! NVIDIA DLSS 4 Makes Games Smoother, Reducing VRAM by 20% with Transformer Model

OpenAI Releases New Model for Deep Research API: o3/o4-mini-deep research

Black Forest Shocks Open Source FLUX.1 Kontext [dev]: Image Editing Comparable to GPT-4o

Open Source Magic is Here! FLUX.1 Kontext [dev] Challenges GPT-4o, Bringing Image Editing into a New Era

Gaokao Volunteer Application Brings Heat to Kuaishou Deep Search, Each Student Uses It an Average of 4 Times

Google Launches Imagen4: Breaking the Text-to-Image Generation Bottleneck, Gemini API Empowers Text-to-Image

Image Giant Getty Images Reverses Core Copyright Lawsuit Against Stability AI, UK Case Continues

Breaking the Bottleneck of 3D Reconstruction SuperDec Empowers Robots and Content Generation

4D-LRM Launches with a Shocking Impact! AI Reconstructs Time and Space, Instantly Restore Any Perspective and Any Moment