Stability AI Launches New AI Model Stable Fast 3D: Generate 3D Images in Half a Second with 1200 Times Speed Improvement

AIbase基地

Published inAI News · 5 min read · Aug 2, 2024

820

Once, generating 3D images was an extremely challenging task involving complex wireframes, software, and hardware. But now, the situation has changed dramatically. Stability AI recently announced a new generative AI technology called Stable Fast3D, which can quickly generate 3D images from a single image.

The most impressive part is, according to Stability AI, the new model can generate 3D images in just half a second. This processing speed is a significant leap compared to previous models, which might have taken minutes to produce similar results, while Stable Fast3D completes the same task at a staggering 1200 times the speed of its predecessors.

Back in March, Stability AI released Stable Video3D (SV3D), which took 10 minutes to generate 3D assets, and now Stable Fast3D has made significant progress.

Stability AI anticipates that this new model will have practical applications in multiple industries, including design, architecture, retail, virtual reality, and game development. Users can access the model through the Stable Assistant chatbot, the Stability AI API, and the community license Hugging Face.

Stable Fast3D Principles

Stable Fast3D did not start from scratch but evolved from the previous TripoSR model. In March, Stability AI partnered with 3D modeling supplier Trip AI to focus on creating rapid 3D asset generation technology.

In the research paper, researchers detailed the innovative working method of Stable Fast3D. At its core, Stable Fast3D uses an enhanced transformer network to generate high-resolution tri-planes, i.e., 3D volumetric representations, from input images. This network is designed to efficiently handle larger resolutions without significantly increasing computational complexity, thereby achieving finer detail capture and reducing aliasing artifacts.

Researchers also detailed an innovative method for material and lighting estimation. The material estimation network uses a novel probabilistic approach to predict global metalness and roughness values, resulting in improved image quality and consistency.

It is also noteworthy that the Stable Fast3D model can combine multiple elements required for 3D images (including meshes, textures, and material properties) into a compact, ready-to-use 3D asset.

Stability AI is perhaps best known for its Stable Diffusion text-to-image generation technology, but it has been researching 3D at least since November 2023. The March release of Stable Video3D improved the quality of 3D image generation and viewing experience. Moreover, last week the company announced Stable Video4D, adding a temporal dimension to short 3D video generation.

Technical Report: https://static1.squarespace.com/static/6213c340453c3f502425776e/t/66ab9814a3551056403508b4/1722521625313/SF3D-10.pdf

Key Points:
😃 Stability AI introduces Stable Fast3D technology, generating 3D images in half a second, far surpassing previous speeds.
👍 The new model has practical value in multiple industries, with various access methods available.
👏 Stability AI continues to lead the development of image generation technology, from 2D to 4D.

Founder of Neuracle Technologies Peng Lei Predicts Five Disruptive Trends in Brain-Computer Interface for the Next Five Years

At the 11th Innovation Annual Meeting of the 2025 Yabuli China Entrepreneurs Forum, Peng Lei, founder and chairman of Neuracle Technologies, deeply discussed the future development of brain-computer interface (BCI) technology and proposed five major new trends in this field over the next five years. These trends are expected to completely change people's lifestyles and the technological landscape. 1. Integration of Brain-Computer Interface and Spinal Cord: A Hope for Paralyzed Patients. Peng Lei pointed out that the integration of brain-computer interfaces with the spinal cord will be a major trend in the future. Since the brain and spinal cord are closely connected, spinal cord injuries in patients with high-level paralysis hinder the conduction of nerve signals. In the future,

E Ink Launches AI Touchpad: E-Paper Technology May Change the Way Laptops Are Interacted With

E Ink recently announced the development of a new touchpad for laptops, which uses the same e-paper technology as e-readers. This innovative product is not simply about increasing the size of the touchpad or adding secondary display features, but rather positioning it as a dedicated platform for AI applications and assistants, designed to run in parallel with mainstream operating systems. E Ink released a prototype image showing the upgraded touchpad, which is equipped with a color e-ink screen similar to the Amazon Kindle Color.

Open Source Revolution! Kyutai TTS Launches: Ultra-Low Latency Speech Synthesis, the New Era of AI Voice is Here!

Recently, the French AI laboratory Kyutai announced the official open source of its new text-to-speech model, Kyutai TTS, providing global developers and researchers with a high-performance, low-latency speech synthesis solution. This breakthrough release not only promotes the development of open-source AI technology but also opens up new possibilities for multilingual voice interaction applications. AIbase provides an exclusive analysis of this technological highlight and its potential impact. Ultra-low latency, a new experience in real-time interaction. Kyutai TTS has become an industry standout with its exceptional performance.

DeepMind introduces Crome: Enhancing the Alignment of Large Language Models with Human Feedback

In the field of artificial intelligence, reward models are a critical component for aligning large language models (LLMs) with human feedback, but existing models face the issue of "reward hacking." These models often focus on superficial features, such as the length or format of responses, rather than identifying genuine quality metrics, such as factual accuracy and relevance. The root cause lies in standard training objectives failing to distinguish between spurious associations and true causal drivers present in the training data. This failure leads to fragile reward models (RMs), which generate misaligned policies.

CoreWeave Launches NVIDIA's Latest AI Chip, Driving Innovation in Cloud Computing

Recently, NVIDIA and CoreWeave announced that NVIDIA's latest artificial intelligence graphics processor, Blackwell Ultra, has been commercially deployed on CoreWeave. This news undoubtedly injects new vitality into AI cloud computing services. Dell also stated that CoreWeave has received customized devices based on the NVIDIA GB300NVL72AI system, marking CoreWeave as the first to install a Blackwe

Former OpenAI Researcher Reveals: Signing with Meta Did Not Bring $100 Million Bonus

Recently, a former OpenAI researcher's remarks have sparked widespread attention. He stated that although Meta claimed to offer up to $100 million in signing bonuses when poaching research talent from OpenAI, he and his colleagues did not receive this bonus. This news has undoubtedly raised questions about Meta's hiring practices. Image source note: The image was generated by AI, and the image licensing service provider is Midjourney. This researcher is named Lucas Beyer, and he and his colleague are'

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Stability AI Launches New AI Model Stable Fast 3D: Generate 3D Images in Half a Second with 1200 Times Speed Improvement

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Founder of Neuracle Technologies Peng Lei Predicts Five Disruptive Trends in Brain-Computer Interface for the Next Five Years

E Ink Launches AI Touchpad: E-Paper Technology May Change the Way Laptops Are Interacted With

Open Source Revolution! Kyutai TTS Launches: Ultra-Low Latency Speech Synthesis, the New Era of AI Voice is Here!

DeepMind introduces Crome: Enhancing the Alignment of Large Language Models with Human Feedback

MiniMax Launches the World's First Open-Source Large-Scale AI Model, Technological Breakthrough Attracts Industry Attention

CoreWeave Launches NVIDIA's Latest AI Chip, Driving Innovation in Cloud Computing

Google Veo 3 Video Generation Model Now Available to Pro/Ultra Subscribers, Will Add Photo-to-Video Function

Former OpenAI Researcher Reveals: Signing with Meta Did Not Bring $100 Million Bonus

Chip Design Company Ambiq Micro Applies for U.S. IPO, Benefiting from Market Demand Driven by Generative AI

Meta Tests AI Chatbot Active Feature Aimed at Enhancing User Engagement