A Stunning Debut! Nvidia Launches Open Source Image Generation Model Sana, 1 Second Image Generation, Supports Chinese, English, and Emoji

AIbase基地

Published inAI News · 4 min read · Jan 14, 2025

699

Recently, NVIDIA open-sourced an image generation model called Sana, which has only 60 million parameters, greatly lowering the barrier to entry.

It is reported that Sana can generate images with a resolution of 4096×4096 and can run on a 16GB graphics card, generating high-quality images at 1024×1024 resolution in less than one second, which is outstanding compared to similar models.

Sana operates using DC-AE (Dual-Channel Autoencoder) technology, employing a latent space that is 32 times larger for image generation. The tool is equipped with 8 GPUs, including the powerful GTX 3090, allowing it to process complex images faster and more effectively. It is claimed that Sana's 0.6B performance is competitive with Flux-12B, having only 1/20 of the parameters but being 100 times faster.

Interestingly, Sana supports prompts in English, Chinese, and emoji. Users can generate images in various styles through simple text prompts, from cyberpunk-style cats to athletic Shiba Inus in white T-shirts, and even pirate ships in cosmic whirlpools, with Sana performing exceptionally well. Users can even input Chinese poetry to generate related artistic images. Additionally, Sana has a certain level of safety; when inappropriate words are entered, the system automatically replaces them with a red heart symbol ❤️ to prevent the generation of unsuitable content.

For example, using the prompt "A cat playing on the grass, stars 🌟," the generation speed is very fast, and the results are quite impressive.

Another example is the prompt "A cute 🐼 eating 🎋 in ink wash painting style," where the model can accurately recognize emojis.

It is worth mentioning that Sana has received official support for ComfyUI and is equipped with Lora training tools. This makes it more convenient and significantly enhances usability, and interested friends can try it out for themselves.

Project link: https://nv-sana.mit.edu/

AI Daily: OpenAI's New Image Generation Model Can Create Images from a Single Sentence; Keling AI Revenue Exceeds 100 Million; Google Launches Gemini 2.5, Its Most Powerful Reasoning Large Language Model

Welcome to the 【AI Daily】column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with the hottest content in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1、OpenAI has launched a new image generation model, challenging Google's single-sentence image generation capabilities. OpenAI recently launched its latest GPT-4o model, which integrates an advanced image generator and demonstrates...

AI Daily: 1 Second Image Creation! NVIDIA Open Sources Text-to-Image Model Sana; OpenAI Releases Economic Blueprint; Adobe's New AI Tool Edits 10,000 Images with One Click

Welcome to the [AI Daily] segment! Here is your daily guide to exploring the world of artificial intelligence. Each day, we present you with the hottest topics in the AI field, focusing on developers to help you gain insights into technological trends and understand innovative AI product applications. Discover fresh AI products here: https://top.aibase.com/1. Exciting news! The Hitems project is increasing its technological investment to promote the application of GenAI and 3D models, further expanding its market space.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

A Stunning Debut! Nvidia Launches Open Source Image Generation Model Sana, 1 Second Image Generation, Supports Chinese, English, and Emoji

AIbase基地

This article is from AIbase Daily

AI News Recommendations

China's First Open-Source AI Image Generation Model, HiDream-I1, Released; Power Comparable to GPT-4

Ant Group Releases EnergyTS Large Model to Enhance New Energy Prediction Capabilities

AI Daily: OpenAI's New Image Generation Model Can Create Images from a Single Sentence; Keling AI Revenue Exceeds 100 Million; Google Launches Gemini 2.5, Its Most Powerful Reasoning Large Language Model

Jensen Huang Criticizes the Market: "You All Got It Wrong!" - DeepSeek R1 is Not Here to Disrupt, but Rather a "Godsend" for Nvidia

NVIDIA and Carnegie Mellon University Launch New Framework ASAP to Enhance Robot Training Precision

AI Daily: 1 Second Image Creation! NVIDIA Open Sources Text-to-Image Model Sana; OpenAI Releases Economic Blueprint; Adobe's New AI Tool Edits 10,000 Images with One Click

NVIDIA Opensources Sana: Generate 4K Ultra HD Images in Seconds on Laptops

Luma Launches New Image Generation Model Luma Photon, Faster and More Cost-Effective!

Artificial Intelligence Startup Sana Raises $55 Million in Funding, Valuation Reaches $500 Million

The Identity of the Mysterious Large Model 'Panda' Revealed: UK's AI Company Unveils Latest Image Generation Model Recraft V3