Open-MAGVIT2
Open-source autoregressive visual generation model project
CommonProductImageImage GenerationAutoregressive Model
Open-MAGVIT2 is an open-source series of autoregressive image generation models released by Tencent ARC Lab, featuring models ranging from 300M to 1.5B parameters. This project reproduces Google's MAGVIT-v2 tokenizer and achieves state-of-the-art reconstruction performance with a rFID of 1.17 on the ImageNet 256×256 dataset. By introducing asymmetric tokenization techniques, it decomposes large vocabularies into sub-vocabularies of varying sizes and enhances inter-token interaction through 'next sub-token prediction' to improve generation quality. All models and code are open-source, aimed at advancing innovation and creativity in the field of autoregressive visual generation.
Open-MAGVIT2 Visit Over Time
Monthly Visits
494758773
Bounce Rate
37.69%
Page per Visit
5.7
Visit Duration
00:06:29