LLaVA-Mini
LLaVA-Mini is a large-scale multimodal model designed for efficient comprehension of images and videos.
CommonProductVideo\Image UnderstandingVideo Processing
A multimodal model developed by the ictnlp team that enhances performance with only one visual token. It is open source and free, suitable for scenarios requiring rapid and accurate understanding of visual content.
LLaVA-Mini Visit Over Time
Monthly Visits
490881889
Bounce Rate
37.92%
Page per Visit
5.6
Visit Duration
00:06:18