llava-llama-3-8b-v1_1

A LLaVA model optimized by XTuner, which combines image and text processing capabilities.

PremiumNewProductProgrammingArtificial IntelligenceMultimodal Learning
llava-llama-3-8b-v1_1 is an optimized LLaVA model by XTuner, based on meta-llama/Meta-Llama-3-8B-Instruct and CLIP-ViT-Large-patch14-336. It has been fine-tuned with ShareGPT4V-PT and InternVL-SFT. Designed for the combination of image and text processing, the model features strong multimodal learning capabilities and is suitable for various downstream deployment and evaluation toolkits.
Visit

llava-llama-3-8b-v1_1 Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

llava-llama-3-8b-v1_1 Visit Trend

llava-llama-3-8b-v1_1 Visit Geography

llava-llama-3-8b-v1_1 Traffic Sources

llava-llama-3-8b-v1_1 Alternatives