HunyuanDiT-v1.1

A multi-resolution diffusion transformer that supports Chinese and English understanding

PremiumNewProductImageAI Image GenerationMulti-Modal Dialogue
HunyuanDiT-v1.1 is a multi-resolution diffusion transformer model developed by the Tencent Hunyuan team. It has excellent Chinese and English understanding capabilities. The model realizes data iterative optimization by combining a meticulously designed transformer architecture, text encoder, and positional encoding, along with a fully constructed data pipeline from scratch. HunyuanDiT-v1.1 can conduct multi-round multi-modal dialogues and generate and refine images based on context. After comprehensive evaluation by over 50 professional human evaluators, HunyuanDiT-v1.1 has achieved new state-of-the-art results in Chinese-to-image generation compared to other open-source models.
Visit

HunyuanDiT-v1.1 Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

HunyuanDiT-v1.1 Visit Trend

HunyuanDiT-v1.1 Visit Geography

HunyuanDiT-v1.1 Traffic Sources

HunyuanDiT-v1.1 Alternatives