Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

Tools

GEO Brand Visibility

All-in-One GEO Brand Insights Platform

AI Brand Monitoring Tool

Analyze & Track How AI Models Cite Your Brand

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Service

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

AI Tutorial

No Need for High-End Graphics Cards, Meissonic Allows You to Easily Generate HD Images Comparable to SDXL!

AIbase基地

Published inAI News · 5 min read · Nov 12, 2024

281

The emergence of models like Stable Diffusion marks significant progress in the field of image generation. However, their fundamental differences from autoregressive language models hinder the development of unified language-vision models. To address this issue, researchers have introduced Meissonic, which elevates non-autoregressive masked image modeling (MIM) text-to-image techniques to a level comparable with state-of-the-art diffusion models like SDXL.

At the core of Meissonic are a series of architectural innovations, advanced positional encoding strategies, and optimized sampling conditions, which significantly enhance the performance and efficiency of MIM. Additionally, Meissonic leverages high-quality training data, integrates fine-tuning based on human preference scores, and employs feature compression layers, further improving the fidelity and resolution of images.

Unlike large diffusion models such as SDXL and DeepFloyd-XL, Meissonic, with only 1 billion parameters, can generate high-quality images at a resolution of 1024×1024 and run on consumer-grade GPUs with just 8GB of VRAM, without the need for any additional model optimizations. Moreover, Meissonic can easily generate images with solid color backgrounds, which typically require model fine-tuning or noise offset adjustments in diffusion models.

To achieve efficient training, Meissonic's training process is divided into four meticulously designed stages:

Stage One: Understanding basic concepts from vast data. Meissonic utilizes the curated LAION-2B dataset, training at a resolution of 256×256 to learn foundational concepts.

Stage Two: Aligning text and images with long prompts. The training resolution is increased to 512×512, using high-quality synthetic image-text pairs and internal datasets to enhance the model's ability to understand long descriptive prompts.

Stage Three: Mastering feature compression for higher resolution generation. By introducing feature compression layers, Meissonic can seamlessly transition from 512×512 to 1024×1024 generation, trained with carefully selected high-quality, high-resolution image-text pairs.

Stage Four: Optimizing high-resolution aesthetic image generation. In this stage, the model undergoes fine-tuning with a smaller learning rate and incorporates human preference scores as fine conditions to enhance the performance of generating high-quality images.

Through a series of quantitative and qualitative evaluations, including HPS, MPS, GenEval benchmarks, and GPT4o assessments, Meissonic demonstrates superior performance and efficiency. Compared to DALL-E2 and SDXL, Meissonic achieves competitive results in human performance and text alignment, while also showcasing its efficiency.

Furthermore, Meissonic excels in zero-shot image-to-image editing. On the EMU-Edit dataset, Meissonic leads in seven different operations, including background changes, content alterations, style shifts, object removal, addition, local modifications, and color/texture changes, all without training or fine-tuning on image-specific data or instruction sets.

Project Link: https://github.com/viiika/Meissonic

Paper Link: https://arxiv.org/pdf/2410.08261

StableDiffusion Meissonic MIM SDXL

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

New Tool for AI Art Prompts: PromptFill Is Launched! Making Complex Prompts as Simple as Fill-in-the-Blank Questions

PromptFill's new 'fill-in-the-blank' visual interface simplifies AI art prompt creation, making it more accessible for users.....

Dec 22, 2025

320

vLLM-Omni Open Source: Integrating Diffusion Models, ViT, and LLM into a Pipeline, Completing Multimodal Inference in One Go

vLLM-Omni is the first 'full-modal' inference framework, enabling unified generation of text, images, audio, and video. It features a decoupled pipeline with modality encoders, an LLM core, and generators, supporting multi-modal I/O. Available on GitHub and installable via pip.....

Dec 2, 2025

670

AI Daily: FLUX.2 Open Source Release; Tencent Hunyuan 3D Creation Engine Launched on International Site; Baidu Establishes Two New Large Model R&D Departments

FLUX.2 series released with 32B dev weights, supporting 10-image reference and 4MP editing for image generation and manipulation.....

Nov 26, 2025

590

Comfy Cloud Beta Shakes the Market! Browser Opens Stable Diffusion AI Creation in a Flash, Truly Achieving Zero Barriers

ComfyUI cloud platform launches public beta, enabling browser-based access to full Stable Diffusion AI image generation without local setup or high-end GPUs, lowering barriers for creators.....

Nov 6, 2025

350

London High Court Rules AI Image Generator Stable Diffusion Does Not Constitute Infringing Copying

UK High Court ruled Stable Diffusion's AI training not copyright infringement. Getty Images dropped main claims after alleging copyright threat to creative industry. Case highlights AI development and copyright balance.....

Nov 6, 2025

310

EA and Stability AI Collaborate: Integrating AI into Game Development to Accelerate Content Creation

EA has formed a partnership with Stability AI, integrating AI technologies such as Stable Diffusion into game development. The two parties plan to jointly develop AI models and tools, redefining content production methods, aiming to accelerate iteration and expand creative boundaries. EA emphasizes that AI is positioned as an auxiliary tool to enhance efficiency, supporting rapid iteration and process optimization, rather than replacing human creativity.

Oct 24, 2025

390

Small Model Triumph! HKUST and Kuaishou Jointly Develop Evolutionary Search Technology, Letting AI Art Generation Move Beyond 'Brawn Over Brains'

In the field of AI art generation, there has long been a general understanding that generating high-quality images and videos requires larger models, more parameters, and stronger computing power. However, the research team from Hong Kong University of Science and Technology and Kuaishou Technology recently proposed the EvoSearch (evolutionary search) technology, which is completely overturning this traditional notion. The most impressive performance of this technology is: after using EvoSearch, the generation quality of the Stable Diffusion 2.1 model with only 865M parameters has astonishing results.

Jun 10, 2025

1.1k

AMD GPU Performance Leap! Significant Stable Diffusion Model Optimization

AMD's advancements in AI are noteworthy, particularly its latest optimizations for the Stable Diffusion model. Recently, Stability AI released an ONNX-optimized version of Stable Diffusion, resulting in significantly improved performance for AMD Radeon GPUs and Ryzen integrated graphics in AI tasks, with speed increases up to 3.8 times faster. This progress narrows the gap with NVIDIA's ecosystem...

Apr 18, 2025

3.6k

Snap Launches SnapGen AI: Instantly Generate High-Resolution Images on Mobile

Jan 5, 2025

3.5k

Stable Diffusion 3.5 Large Officially Launched on Amazon Bedrock Platform

Dec 20, 2024

3.3k

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

No Need for High-End Graphics Cards, Meissonic Allows You to Easily Generate HD Images Comparable to SDXL!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

New Tool for AI Art Prompts: PromptFill Is Launched! Making Complex Prompts as Simple as Fill-in-the-Blank Questions

vLLM-Omni Open Source: Integrating Diffusion Models, ViT, and LLM into a Pipeline, Completing Multimodal Inference in One Go

AI Daily: FLUX.2 Open Source Release; Tencent Hunyuan 3D Creation Engine Launched on International Site; Baidu Establishes Two New Large Model R&D Departments

Comfy Cloud Beta Shakes the Market! Browser Opens Stable Diffusion AI Creation in a Flash, Truly Achieving Zero Barriers

London High Court Rules AI Image Generator Stable Diffusion Does Not Constitute Infringing Copying

EA and Stability AI Collaborate: Integrating AI into Game Development to Accelerate Content Creation

Small Model Triumph! HKUST and Kuaishou Jointly Develop Evolutionary Search Technology, Letting AI Art Generation Move Beyond 'Brawn Over Brains'

AMD GPU Performance Leap! Significant Stable Diffusion Model Optimization

Snap Launches SnapGen AI: Instantly Generate High-Resolution Images on Mobile

Stable Diffusion 3.5 Large Officially Launched on Amazon Bedrock Platform

AI News Recommendations

New Tool for AI Art Prompts: PromptFill Is Launched! Making Complex Prompts as Simple as Fill-in-the-Blank Questions

vLLM-Omni Open Source: Integrating Diffusion Models, ViT, and LLM into a Pipeline, Completing Multimodal Inference in One Go

AI Daily: FLUX.2 Open Source Release; Tencent Hunyuan 3D Creation Engine Launched on International Site; Baidu Establishes Two New Large Model R&D Departments

Comfy Cloud Beta Shakes the Market! Browser Opens Stable Diffusion AI Creation in a Flash, Truly Achieving Zero Barriers

London High Court Rules AI Image Generator Stable Diffusion Does Not Constitute Infringing Copying

EA and Stability AI Collaborate: Integrating AI into Game Development to Accelerate Content Creation

Small Model Triumph! HKUST and Kuaishou Jointly Develop Evolutionary Search Technology, Letting AI Art Generation Move Beyond 'Brawn Over Brains'

AMD GPU Performance Leap! Significant Stable Diffusion Model Optimization

Snap Launches SnapGen AI: Instantly Generate High-Resolution Images on Mobile

Stable Diffusion 3.5 Large Officially Launched on Amazon Bedrock Platform

GEO Services