The New Ruler of AI Image Generation! The Open Source Model FLUX.1 Emerges, Are Midjourney and DALL·E 3 Worried?

AIbase基地

Published inAI News · 5 min read · Aug 2, 2024

1.5k

In the field of artificial intelligence, transformative changes can occur on a daily basis. Just the day after Midjourney's major update, the open-source image generation sector welcomed a remarkable dark horse—FLUX.1. This unexpected newcomer not only claims to significantly outperform closed-source models like DALL·E3 and Midjourney V6 but also outclasses the entire open-source SD3 series, instantly igniting the AI community.

Let's first get to know the mastermind behind FLUX.1. Its founder, Robin Rombach, is no small figure but a leading expert in diffusion models. His notable works include VQGAN, Taming Transformers, and Latent Diffusion. He has served as the Chief Scientist at Stability AI, leading the globally renowned Stable Diffusion series projects. Robin Rombach can be considered an "old hand" among the "old hands" in the AI image generation field.

In March of this year, due to internal turmoil at Stability AI, Robin chose to leave. After four months of incubation, he returned with the new open-source large model platform FLUX.1. Even more astonishingly, FLUX.1 received $32 million in seed funding led by the prestigious venture capital firm Andreessen Horowitz upon its debut. This undoubtedly injected a strong dose of confidence into FLUX.1's future development.

So, what makes FLUX.1 stand out? Firstly, it is based on the Vision Transformer architecture, employs a flow matching training method, and uses rotational position embeddings and parallel attention layers to enhance model performance and hardware efficiency. This model with 12 billion parameters is released in three versions:

Pro Edition: Accessible via API, offering the most robust performance.

Dev Edition: A non-commercial guided distillation model, inheriting most of the Pro Edition's capabilities.

Schnell Edition: A commercially viable open-source model with impressive performance.

According to FLUX.1 team's test data, even the open-source Schnell version surpasses mainstream models like Midjourney v6.0, DALL·E3 (HD), and SD3-Ultra in text semantic restoration, image quality, action consistency, coherence, and diversity. Especially in embedding text into images, FLUX.1 demonstrates a clear advantage.

QQ截图20240802091854.jpg

FLUX.1's ambitions clearly do not stop there. The team indicates that text-to-image generation is just the beginning, and they plan to introduce text-to-video models in the future, challenging top products like Sora, Gen-3, and Luma.

For developers and AI enthusiasts, the emergence of FLUX.1 is undoubtedly a significant boon. The Schnell version is fully open-source and has received support from Comfyui. If you have more than 36GB of GPU memory, you can even run the t5 fp16 version. However, note that t5xxl_fp16.safetensors or clip_l.safetensors and VAE need to be downloaded separately.

FLUX.1's sudden arrival not only brings new hope to the open-source AI image generation field but also injects new vitality into the entire AI industry. Its powerful performance and open-source nature are likely to accelerate the popularization and innovation of AI image generation technology. For ordinary users, this means we may soon be able to run AI image generation models on home computers that rival or even surpass Midjourney.

Project Link: https://github.com/black-forest-labs/flux

Try It Out: https://replicate.com/black-forest-labs/flux-pro

240Hz Big Screen on the Go! ASUS ROG X XREAL R1 Gaming Glasses Unveiled at CES, Transforming into a Private Cinema Anytime

ASUS ROG and XREAL jointly launched the world's first 240Hz micro-OLED gaming glasses, ROG XREAL R1, at CES 2026. It features dual 1080p micro-OLED panels, supporting an ultra-high 240Hz refresh rate and 3ms ultra-low latency, specifically designed for hardcore gamers, integrating high-performance gaming display into a lightweight wearable device, revolutionizing the traditional gaming screen experience.

New Breakthrough! Falcon H1R 7B Open-Source Large Model Leads the New Trend in Reasoning

Abu Dhabi Innovation Technology Institute has released the open-source model Falcon H1R 7B, which demonstrates leading reasoning performance with only 7 billion parameters, challenging the concept that 'the bigger the better.' Its training is divided into two stages: first, a supervised fine-tuning of Falcon-H1-7B, focusing on improving mathematical and programming capabilities.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

The New Ruler of AI Image Generation! The Open Source Model FLUX.1 Emerges, Are Midjourney and DALL·E 3 Worried?

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Bangalore startup Arrowhead raises $3 million to expand voice AI capabilities

Open-Source Version of Veo 3 Is Here: LTX-2 Officially Released - Generate a 20-Second 4K AI Video with Synchronized Audio and Video in One Go - Run Smoothly on Local Graphics Cards

Live E-commerce Ends Unregulated Growth! Two Departments Release Supervisory New Regulations, AI Hosts' Violations Are Held Responsible by Operators

Breaking News! Google AI Studio is about to undergo a revolutionary update: Major upgrade of tools integrated, with five powerful features of Gemini 3 Pro making a strong debut

HP Launches the World's First Keyboard-style AI PC, Revolutionizing the Traditional Office Experience!

xAI Announces $20 Billion Series E Financing! Grok Monthly Active Users Reach 600 Million, but Faces Investigations from Multiple Countries Due to Generating Child Deepfake Pornographic Content

xAI Announces $2 Billion Series E Financing! Grok Has 600 Million Monthly Active Users, but Faces Investigations in Multiple Countries for Generating Child Deepfake Pornographic Content

Excess Fundraising of 20 Billion Dollars! Elon Musk's xAI Completes Series E Financing, Intensifying the Competition for Computing Power

240Hz Big Screen on the Go! ASUS ROG X XREAL R1 Gaming Glasses Unveiled at CES, Transforming into a Private Cinema Anytime

New Breakthrough! Falcon H1R 7B Open-Source Large Model Leads the New Trend in Reasoning

AI News Recommendations

Bangalore startup Arrowhead raises $3 million to expand voice AI capabilities

Open-Source Version of Veo 3 Is Here: LTX-2 Officially Released - Generate a 20-Second 4K AI Video with Synchronized Audio and Video in One Go - Run Smoothly on Local Graphics Cards

Live E-commerce Ends Unregulated Growth! Two Departments Release Supervisory New Regulations, AI Hosts' Violations Are Held Responsible by Operators

Breaking News! Google AI Studio is about to undergo a revolutionary update: Major upgrade of tools integrated, with five powerful features of Gemini 3 Pro making a strong debut

HP Launches the World's First Keyboard-style AI PC, Revolutionizing the Traditional Office Experience!

xAI Announces $20 Billion Series E Financing! Grok Monthly Active Users Reach 600 Million, but Faces Investigations from Multiple Countries Due to Generating Child Deepfake Pornographic Content

xAI Announces $2 Billion Series E Financing! Grok Has 600 Million Monthly Active Users, but Faces Investigations in Multiple Countries for Generating Child Deepfake Pornographic Content

Excess Fundraising of 20 Billion Dollars! Elon Musk's xAI Completes Series E Financing, Intensifying the Competition for Computing Power

240Hz Big Screen on the Go! ASUS ROG X XREAL R1 Gaming Glasses Unveiled at CES, Transforming into a Private Cinema Anytime

New Breakthrough! Falcon H1R 7B Open-Source Large Model Leads the New Trend in Reasoning

GEO Services