Open-MAGVIT2

Open-source autoregressive visual generation model project

CommonProductImageImage GenerationAutoregressive Model
Open-MAGVIT2 is an open-source series of autoregressive image generation models released by Tencent ARC Lab, featuring models ranging from 300M to 1.5B parameters. This project reproduces Google's MAGVIT-v2 tokenizer and achieves state-of-the-art reconstruction performance with a rFID of 1.17 on the ImageNet 256×256 dataset. By introducing asymmetric tokenization techniques, it decomposes large vocabularies into sub-vocabularies of varying sizes and enhances inter-token interaction through 'next sub-token prediction' to improve generation quality. All models and code are open-source, aimed at advancing innovation and creativity in the field of autoregressive visual generation.
Visit

Open-MAGVIT2 Visit Over Time

Monthly Visits

503747431

Bounce Rate

37.31%

Page per Visit

5.7

Visit Duration

00:06:44

Open-MAGVIT2 Visit Trend

Open-MAGVIT2 Visit Geography

Open-MAGVIT2 Traffic Sources

Open-MAGVIT2 Alternatives