Stable Diffusion 3 Model Release: Architecture Details Revealed, Is It Helpful for Reproducing Sora?

机器之心

Published inAI News · 2 min read · Mar 6, 2024

The Stable Diffusion 3 model has been released, utilizing the same DiT architecture as Sora, with significant improvements in quality. The authors claim that Stable Diffusion 3 outperforms other text-to-image generation systems, with parameter sizes ranging from 800M to 8B. The SD3 architecture is based on a collaboration between core Sora developers and an assistant professor from New York University, and it employs the MMDiT architecture, which surpasses UViT and DiT. Stable Diffusion 3 incorporates the Rectified Flow (RF) formula, and the authors' proposed reweighted RF variant continues to enhance performance. The model has undergone extensive research, utilizing a flexible text encoder for improvements, and has been compared against other models in terms of performance.

Stable Diffusion 3 DiT Architecture Text-to-Image Generation

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Launches Eighth-Generation TPU and Gemini Enterprise Proxy Platform, Redefining Enterprise Infrastructure

Google unveils 'Agent Enterprise' infrastructure at Cloud Next 26, reshaping AI architecture to advance competition into an era of autonomous agents. Key updates include splitting the 8th-gen TPU into dedicated training (TPU8t) and inference-optimized (TPU8i) versions, revolutionizing compute scalability.....

Apr 23, 2026

180

Google Gemini Deeply Integrated with Google Photos, Supporting Personalized AI Image Generation from Album

Google AI assistant Gemini launches new features, allowing users to generate personalized AI images by accessing the Google Photos photo library. Users don't need to provide detailed descriptions of appearance; the AI will automatically complete details based on real images, making the generated characters highly consistent with reality.

Apr 23, 2026

160

Qwen AI PPT Major Upgrade: Agent Architecture Empowers Full-Process Automated Creation

Qwen AI PPT completed a major upgrade of the "PPT Agent" on April 22nd, adopting a new agent architecture to achieve full-process automated creation, from content concepting, material retrieval to visual formatting. After users input their requirements, a standard downloadable PPT file can be generated within 1-3 minutes, and supports batch uploading of up to 10 files (including documents), significantly improving work efficiency and quality.

Apr 22, 2026

210

Generation Z's Shift in Attitude Toward AI: Rising Awareness of Risks and Falling Enthusiasm

Generation Z's attitude toward AI has become more complex: they acknowledge its core role in the future, but are concerned about technology getting out of control. Surveys show that since 2025, enthusiasm for AI among young people has dropped by 14%, while anger and anxiety have increased, reflecting dual pressures in the workplace and on campus.

Apr 22, 2026

160

GPT-Image 2 Officially Released: The Image Model with Thinking Capabilities Makes Its Debut

OpenAI launches GPT-Image-2 (ChatGPT Images 2.0), introducing 'thinking' capability for image generation models. It enhances image quality and compliance through logical reasoning and deep understanding, surpassing traditional tools reliant on data patterns.....

Apr 22, 2026

260

Debut: Quantum Computing Power Embraces AI - Domestic Superconducting Quantum Computer Wukong Achieves Significant Breakthrough

China's third-gen superconducting quantum computer 'Origin Wukong' now supports AI operations, integrating domestic quantum computing into AI ecosystems and marking a milestone in 'quantum+AI' synergy.....

Apr 21, 2026

280

OpenAI Makes Another Big Move! New Image Model to Be Launched, Complex Chart Generation Capabilities May See a Breakthrough

OpenAI is launching a new image model focused on enhancing understanding and generation of complex structures and professional charts, optimizing performance in challenging visual tasks.....

Apr 21, 2026

330

Another Breakthrough in Domestic Large Models: Qwen3.6-35B-A3B is Officially Open Sourced, Focused on High Efficiency and Multimodal Thinking

The domestic AI model Qwen3.6-35B-A3B is officially open-sourced, using a hybrid expert architecture. It has a total of 35 billion parameters but activates only 3 billion during inference, achieving 'winning with small strength' high efficiency performance, significantly reducing computing costs.

Apr 20, 2026

300

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

The efficiency of large language model inference has made a breakthrough. Tsinghua University and Moonshot AI jointly proposed a new architecture called "Prefill-as-a-Service," which splits the inference process into two stages: prefilling and decoding, and optimizes the allocation of computing resources, effectively solving hardware limitations and significantly improving model service performance.

Apr 20, 2026

210

Lingguang Launches the Next Generation of Flash Applications: Let Everyone Have a Coding Agent

Ant Lingguang App upgrades and launches the "Lingguang Circle," creating a consumer-level Coding Agent. Building on the "30-Second App Generation" feature, it enhances multi-agent collaboration, full-modal generation, and mobile integration, becoming the first platform to allow users to create, distribute, use, and iterate AI applications on their smartphones using natural language, achieving 0-code, 0-deployment, and 0-barrier creation. Currently, users have created over 30 million flash applications.

Apr 20, 2026

280

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Stable Diffusion 3 Model Release: Architecture Details Revealed, Is It Helpful for Reproducing Sora?

机器之心

This article is from AIbase Daily

AI News Recommendations

Google Launches Eighth-Generation TPU and Gemini Enterprise Proxy Platform, Redefining Enterprise Infrastructure

Google Gemini Deeply Integrated with Google Photos, Supporting Personalized AI Image Generation from Album

Qwen AI PPT Major Upgrade: Agent Architecture Empowers Full-Process Automated Creation

Generation Z's Shift in Attitude Toward AI: Rising Awareness of Risks and Falling Enthusiasm

GPT-Image 2 Officially Released: The Image Model with Thinking Capabilities Makes Its Debut

Debut: Quantum Computing Power Embraces AI - Domestic Superconducting Quantum Computer Wukong Achieves Significant Breakthrough

OpenAI Makes Another Big Move! New Image Model to Be Launched, Complex Chart Generation Capabilities May See a Breakthrough

Another Breakthrough in Domestic Large Models: Qwen3.6-35B-A3B is Officially Open Sourced, Focused on High Efficiency and Multimodal Thinking

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

Lingguang Launches the Next Generation of Flash Applications: Let Everyone Have a Coding Agent

AI News Recommendations

Google Launches Eighth-Generation TPU and Gemini Enterprise Proxy Platform, Redefining Enterprise Infrastructure

Google Gemini Deeply Integrated with Google Photos, Supporting Personalized AI Image Generation from Album

Qwen AI PPT Major Upgrade: Agent Architecture Empowers Full-Process Automated Creation

Generation Z's Shift in Attitude Toward AI: Rising Awareness of Risks and Falling Enthusiasm

GPT-Image 2 Officially Released: The Image Model with Thinking Capabilities Makes Its Debut

Debut: Quantum Computing Power Embraces AI - Domestic Superconducting Quantum Computer Wukong Achieves Significant Breakthrough

OpenAI Makes Another Big Move! New Image Model to Be Launched, Complex Chart Generation Capabilities May See a Breakthrough

Another Breakthrough in Domestic Large Models: Qwen3.6-35B-A3B is Officially Open Sourced, Focused on High Efficiency and Multimodal Thinking

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

Lingguang Launches the Next Generation of Flash Applications: Let Everyone Have a Coding Agent