LLaVA-3b
LLaVA-3b is a model fine-tuned based on Dolphin 2.6 Phi, using the SigLIP 400M visual tower in an LLaVA manner. The model features multiple image labels and outputs from the latest layer of the visual encoder.
LLaVA-3b Visit Over Time
Monthly Visits
27175375
Bounce Rate
44.30%
Page per Visit
5.8
Visit Duration
00:04:57