HunyuanCaptioner
AI model for generating high-quality image descriptions
PremiumNewProductImageimage descriptiontext generation
HunyuanCaptioner is a text-to-image technology model based on LLaVA, capable of generating highly accurate text descriptions for images, including object descriptions, object relationships, background information, and image style. It supports both Chinese and English single-image and multi-image reasoning, and can be locally demonstrated through Gradio.
HunyuanCaptioner Visit Over Time
Monthly Visits
19075321
Bounce Rate
45.07%
Page per Visit
5.5
Visit Duration
00:05:32