LLaVA-Mini

LLaVA-Mini is a large-scale multimodal model designed for efficient comprehension of images and videos.

CommonProductVideo\Image UnderstandingVideo Processing
A multimodal model developed by the ictnlp team that enhances performance with only one visual token. It is open source and free, suitable for scenarios requiring rapid and accurate understanding of visual content.
Visit

LLaVA-Mini Visit Over Time

Monthly Visits

490881889

Bounce Rate

37.92%

Page per Visit

5.6

Visit Duration

00:06:18

LLaVA-Mini Visit Trend

LLaVA-Mini Visit Geography

LLaVA-Mini Traffic Sources

LLaVA-Mini Alternatives