HunyuanCaptioner
AI model for generating high-quality image descriptions
PremiumNewProductImageimage descriptiontext generation
HunyuanCaptioner is a text-to-image technology model based on LLaVA, capable of generating highly accurate text descriptions for images, including object descriptions, object relationships, background information, and image style. It supports both Chinese and English single-image and multi-image reasoning, and can be locally demonstrated through Gradio.
HunyuanCaptioner Visit Over Time
Monthly Visits
20899836
Bounce Rate
46.04%
Page per Visit
5.2
Visit Duration
00:04:57