Sana_600M_1024px
High-resolution, efficient text-to-image generation framework
CommonProductImageText-to-imageHigh resolution
Sana is a text-to-image generation framework developed by NVIDIA, capable of efficiently producing images up to 4096×4096 resolution. With its rapid processing speed and robust text-image alignment capabilities, it can even be deployed on laptop GPUs. It is based on a linear diffusion transformer (text-to-image generative model) with 1648M parameters, specifically designed for generating multi-scale images at a base resolution of 1024px. Key advantages of the Sana model include high-resolution image generation, rapid synthesis speed, and strong text-image alignment capabilities. The model's background reveals that it is developed using open-source code, available on GitHub, and adheres to specific licensing (CC BY-NC-SA 4.0 License).
Sana_600M_1024px Visit Over Time
Monthly Visits
20899836
Bounce Rate
46.04%
Page per Visit
5.2
Visit Duration
00:04:57