Free for Personal Commercial Use! Stability AI Releases Stable Diffusion 3.5 Series Text-to-Image Models

AIbase基地

Published inAI News · 5 min read · Oct 23, 2024

387

Last night, Stability AI released its most powerful model to date—Stable Diffusion 3.5. This isn't just a single model; it's a comprehensive suite featuring three versions designed to cater to a diverse range of users, from researchers and hobbyists to startups and enterprises.

The three versions include Stable Diffusion 3.5 Large, Stable Diffusion 3.5 Large Turbo, and the upcoming Stable Diffusion 3.5 Medium, set to be released on October 29th.

WeChat Screenshot_20241023082320.png

Stable Diffusion 3.5 Large is a foundational model with 8 billion parameters, renowned for its exceptional image quality and precision in prompt handling. It's ideal for professional use, capable of generating images up to 1 million pixels in resolution.

Stable Diffusion 3.5 Large Turbo is a distilled version of the former, capable of producing high-quality images in just four steps, significantly faster than Stable Diffusion 3.5 Large.

Stable Diffusion 3.5 Medium, with 2.5 billion parameters, utilizes an improved MMDiT-X architecture and training methodology, designed for plug-and-play functionality on consumer-grade hardware. It balances image quality with customizability, generating images with resolutions ranging from 0.25 to 2 million pixels.

WeChat Screenshot_20241023082357.png

The development of these models prioritizes customizability, integrating Query-Key Normalization into the transformer blocks to stabilize the training process and simplify further fine-tuning and development. To support the flexibility of downstream tasks, Stability AI has retained a broad knowledge base and diverse styles within the models, although this may increase the uncertainty of output results.

Stable Diffusion 3.5 models excel in several areas, including customizability, efficient performance, and diverse outputs. They can be easily fine-tuned to meet specific creative needs or built into applications tailored to custom workflows. Optimized for operation on standard consumer-grade hardware, they don't require high-end specifications. Additionally, these models can create images representative of the world without extensive prompts, and generate a wide range of styles and aesthetics, such as 3D, photography, painting, line art, and almost any imaginable visual style.

Stability AI also emphasizes its commitment to safety, implementing reasonable measures to prevent the misuse of Stable Diffusion 3.5 and focusing on integrity from the early stages of development. Moreover, the Stability AI community license is very permissive, allowing individuals and organizations to use the model for free for non-commercial purposes, including scientific research. Startups, SMEs, and creators with annual revenues under $1 million can also use the model for free for commercial purposes, retaining ownership of generated media without restrictive licensing constraints.

The Stable Diffusion 3.5 models are available for self-hosting on Hugging Face, with inference code also being open-source. Additionally, the models can be accessed through platforms such as Stability AI API, Replicate, ComfyUI, and DeepInfra.

Experience Link: https://huggingface.co/spaces/stabilityai/stable-diffusion-3.5-large

Huawei Open Sources Dense Pangu 7B and Mixture of Experts Model with 72B Parameters

On June 30, Huawei officially announced the open sourcing of the Pangu dense model with 7 billion parameters, the PanguPro MoE model with 72 billion parameters, and the model inference technology based on Ascend. This open-source initiative is a key step in Huawei's strategy to build an Ascend ecosystem, aiming to promote research and innovation in large model technology, accelerate the application of artificial intelligence across industries, and create value.

Image Giant Getty Images Reverses Core Copyright Lawsuit Against Stability AI, UK Case Continues

Recently, Getty Images announced in the London High Court that it has withdrawn its main copyright infringement allegations against Stability AI, further narrowing the focus of this closely watched legal battle. The core of this lawsuit revolves around how AI companies use copyrighted content to train their models. Image source note: The image is AI-generated, and the image licensing service is Midjourney. Although Getty Images' dismissal of the case did not end it, the company is still pursuing other allegations.

Microsoft releases the innovative small-parameter model Mu: Performance comparable to Phi-3.5-mini, empowering Windows agents

This morning, Microsoft officially announced its latest innovative small-parameter model, Mu. This model has only 330 million parameters, yet it can match the performance of Microsoft's previously released Phi-3.5-mini, while being just one-tenth the size of Phi-3.5-mini. More notably, Mu can achieve a response speed of over 100 tokens per second on offline NPU laptops, which is a rare breakthrough in the field of small-parameter models. A major highlight of the Mu model is its support for setting intelligent features in Windows.

Cursor Pro removes the 500 request limit, ushering in a new chapter of unlimited use

According to AIbase, AI code editor Cursor announced on June 17, 2025, a major upgrade for its Pro plan, officially removing the monthly 500 fast request limit and launching the long-anticipated 'unlimited use' mode. This move is seen as a milestone change by Cursor in responding to user demands and enhancing developer experience. AIbase pointed out that for a long time, the 500 fast request limit of the Cursor Pro plan has sparked extensive discussions among users. Many developers have reflected that in high-intensity coding environments

Small Model Triumph! HKUST and Kuaishou Jointly Develop Evolutionary Search Technology, Letting AI Art Generation Move Beyond 'Brawn Over Brains'

In the field of AI art generation, there has long been a general understanding that generating high-quality images and videos requires larger models, more parameters, and stronger computing power. However, the research team from Hong Kong University of Science and Technology and Kuaishou Technology recently proposed the EvoSearch (evolutionary search) technology, which is completely overturning this traditional notion. The most impressive performance of this technology is: after using EvoSearch, the generation quality of the Stable Diffusion 2.1 model with only 865M parameters has astonishing results.

OpenAudio Releases Open Source TTS Model S1-Mini: Super Natural AI Voice Created with 0.5B Parameters

Significant progress has been made in the field of AI voice technology as Fish Audio announces the open sourcing of its new Text-to-Speech (TTS) model, OpenAudio S1-Mini. As a streamlined version of the highly-acclaimed S1 model, S1-Mini has triggered industry discussions due to its lightweight design, high expressiveness, and multi-language support. Key Features: Lightweight and High Performance OpenAudio S1-Mini is a lightweight version distilled from the 4B-parameter S1 model, containing only 0.5B parameters.

Three Giants Join Hands: China National Petroleum Corporation Launches Kunlun Large Model with 300 Billion Parameters

China National Petroleum Corporation has officially released its latest Kunlun large model, with a parameter count as high as 300 billion. This milestone advancement marks an important step forward for China National Petroleum Corporation in the field of AI. The development of this large model was jointly created by four major giants: China National Petroleum Corporation, China Mobile, Huawei, and iFlytek. It is expected to complete formal filing by August 2024 and become the first approved large model in the energy and chemical industry. It is worth noting that the launch of the Kunlun large model was not accidental. As early as August 28 last year, China National Petroleum Corporation had launched a 33 billion parameter model.

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

Free for Personal Commercial Use! Stability AI Releases Stable Diffusion 3.5 Series Text-to-Image Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

ByteDance Open Sources New Model VINCIE-3B: 300 Million Parameters Support Continuous Image Editing with Context