InternVL2_5-8B

A multimodal large language model supporting interaction understanding between images and text.

CommonProductImageMultimodalLarge Language Model
InternVL2_5-8B is a multimodal large language model (MLLM) developed by OpenGVLab, significantly enhanced with training and testing strategies as well as data quality improvements based on InternVL 2.0. This model employs the 'ViT-MLP-LLM' architecture, integrating the newly pre-trained InternViT with various pre-trained language models, such as InternLM 2.5 and Qwen 2.5, utilizing a randomly initialized MLP projector. The InternVL 2.5 series models demonstrate outstanding performance on multimodal tasks, including image and video understanding and multilingual comprehension.
Visit

InternVL2_5-8B Visit Over Time

Monthly Visits

26103677

Bounce Rate

43.69%

Page per Visit

5.5

Visit Duration

00:04:43

InternVL2_5-8B Visit Trend

InternVL2_5-8B Visit Geography

InternVL2_5-8B Traffic Sources

InternVL2_5-8B Alternatives