Snap Inc.'s research team recently launched an AI image generator named SnapGen, capable of generating high-resolution images directly on high-end smartphones. This technology allows users to enjoy an efficient and convenient image creation experience on their phones, breaking the limitations of traditional AI image generation that requires powerful computing capabilities.

The core advantage of SnapGen lies in its compact and efficient model. Compared to popular image generators like SDXL, SnapGen has only 379 million parameters, about one-seventh of the latter. This compact design not only reduces storage space but also enhances operational speed. According to test results, SnapGen excels in matching images with text descriptions, scoring 0.66, surpassing SDXL's 0.55, demonstrating a significant quality advantage.

image.png

In terms of speed, SnapGen stands out remarkably. On the iPhone 16 Pro Max, the system can generate a high-quality image with a resolution of 1024×1024 pixels in about 1.4 seconds. This speed improvement allows users to experience almost no delay during the creative process, instantly enjoying the fun of generating images.

To achieve this series of performance enhancements, the research team systematically redesigned the network architecture, streamlining model parameters and latency while ensuring high-quality image generation. They specifically optimized the decoder section, making it 36 times smaller than similar systems. Additionally, to enable the performance of the small model to reach the level of larger models, the team drew on the learning methods of large AI systems like SD3 and SD3.5, developing a special training process that dynamically adjusts learning strategies based on task difficulty.

With the advent of SnapGen, AI image generation technology on mobile devices has ushered in a new breakthrough. In the future, users will experience faster and higher-quality image creation on their smartphones, further advancing the progress of social media content creation.