AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

BytePush 1.58-bit Quantized FLUX Model Reduces Memory by 7.7 Times, While Performance Increases Instead!

AIbase基地

Published inAI News · 5 min read · Dec 31, 2024

337

Artificial Intelligence (AI) driven text-to-image (T2I) generation models, such as DALLE3 and Adobe Firefly3, demonstrate exceptional generative capabilities with limitless potential in real-world applications. However, these models typically have billions of parameters, requiring significant memory, which poses a great challenge for deployment on resource-constrained platforms like mobile devices.

To address these challenges, researchers from ByteDance and POSTECH explored techniques for extremely low-bit quantization of T2I models. Among numerous advanced models, FLUX.1-dev became the target of research due to its public availability and outstanding performance. The researchers employed a method called 1.58-bit quantization to compress the visual transformer weights in the FLUX model, reducing them to just three values: {-1, 0, +1}. This quantization method does not require access to image data and relies solely on the self-supervision of the FLUX.1-dev model. Unlike the BitNet b1.58 method, this approach does not involve training a large language model from scratch but serves as a post-training quantization solution for T2I models.

With this method, the model's storage space was reduced by 7.7 times, as the 1.58-bit weights were stored using 2-bit signed integers, achieving a compression from 16-bit precision. To further enhance inference efficiency, the researchers developed a custom kernel optimized for low-bit computation. This kernel reduced inference memory usage by over 5.1 times and improved inference latency. Evaluations in the GenEval and T2I Compbench benchmarks indicated that 1.58-bit FLUX significantly increased computational efficiency while maintaining generation quality comparable to the full-precision FLUX model.

Specifically, the researchers quantized 99.5% of the visual transformer parameters (a total of 11.9 billion) in the FLUX model to 1.58 bits, significantly lowering storage requirements. Experimental results showed that 1.58-bit FLUX performed comparably to the original FLUX model on the T2I CompBench and GenEval datasets. In terms of inference speed, 1.58-bit FLUX exhibited more significant improvements on lower-performance GPUs (such as L20 and A10).

In summary, the emergence of 1.58-bit FLUX marks a significant step towards enabling high-quality T2I models to be practically deployed on devices with limited memory and latency. Although 1.58-bit FLUX still has some limitations in speed improvements and high-resolution image detail rendering, its enormous potential in enhancing model efficiency and reducing resource consumption is expected to provide new insights for future research.

Key improvements summary:

Model Compression: Model storage space reduced by 7.7 times.

Memory Optimization: Inference memory usage reduced by over 5.1 times.

Performance Retention: 1.58-bit FLUX maintained performance comparable to the full-precision FLUX model in the GenEval and T2I Compbench benchmarks.

No Image Data Required: The quantization process does not require access to any image data, relying solely on the model's self-supervision.

Custom Kernel: A custom kernel optimized for low-bit computation was adopted, enhancing inference efficiency.

Project Page: https://chenglin-yang.github.io/1.58bit.flux.github.io/

Paper Link: https://arxiv.org/pdf/2412.18653

Model Link: https://huggingface.co/papers/2412.18653

AI DALLE3 AdobeFirefly3 FLUX.1-dev

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Jimeng AI 3.0 Global Launch: Cinematic Visuals and Precise English Typography Lead the Way in AI Creation

ByteDance's Jimeng AI has officially launched Jimeng AI 3.0 globally, marking a significant expansion of its text-to-image and video generation technology into international markets. According to AIbase, the new version boasts cinematic image quality, 2K resolution output, hyperrealistic textures, and precise English typography, particularly excelling in English text generation and font control, surpassing the performance of the previous Chinese version. The launch announcement has generated significant buzz on social media platforms. Features can be experienced via the Jimeng AI website and mobile application.

Apr 24, 2025

150

Perplexity Launches New iOS AI Voice Assistant

Apr 24, 2025

140

Alimama: Three Major AI Dividends to be Invested During Tmall's 618 Promotion

Apr 24, 2025

140

AI Daily: OpenAI Launches gpt-image-1 Image Generation API; Nano AI Releases MCP Universal Toolbox; China Accounts for 60% of Global AI Patents

Apr 24, 2025

130

Google I/O 2025 Outlook: Material 3, Android XR, and Generative AI Reshape Developer Experience

At this morning's Google I/O 2025 conference, Google announced a series of exciting new technologies, further showcasing its latest advancements in artificial intelligence, immersive experiences, and developer tools. Here are the major highlights we can expect: 1. Material 3 Expressive: The Future of Expressive Design. Google will unveil Material 3 Expressive at the conference, a new design system described as "the future of Google's user experience design." Material 3 Ex...

Apr 24, 2025

150

Revenue Target Increased Nearly 10-Fold! Commercialization of Innovative Medical AI Large Models Accelerates, Aiming for 40 Million Next Year

Apr 24, 2025

150

Sequoia-backed AI startup Listen Labs raises $27M to disrupt market research

Listen Labs, an AI-powered market research company backed by Sequoia Capital, has secured $27 million in funding to revolutionize the market research industry.

Apr 24, 2025

150

Xiaomi Launches New Smart Speaker for ¥199: AI Large Model Integration Upgrades Intelligent Dialogue

Xiaomi has released a new smart speaker for ¥199, featuring an integrated AI large language model for enhanced intelligent dialogue capabilities.

Apr 24, 2025

130

Jieyue Xingchen and Yuanli Lingji Announce Strategic Partnership

Jieyue Xingchen and Yuanli Lingji have signed a strategic cooperation agreement in Beijing. Both parties will leverage their respective technological advantages to carry out in-depth cooperation in multimodal large model technology, intelligent terminal Agents, and embodied AI scenarios. The goal of this cooperation is to achieve "reasoning in the physical world", jointly developing an intelligent robot named "RoboAgent", and promoting the practical application of Artificial General Intelligence (AGI). At the signing ceremony, Dr. Jiang Daxin, founder and CEO of Jieyue Xingchen, and the co-founders of Yuanli Lingji...

Apr 24, 2025

110

China Leads the World in AI Patents, Holding 60% of the Global Share: State Intellectual Property Office

According to the State Intellectual Property Office of China, China now holds the largest number of global AI patents, accounting for 60% of the total.

Apr 24, 2025

150