InternVL2_5-2B
A multimodal large language model that supports deep interaction between images and text.
CommonProductImageMultimodalLarge Language Model
InternVL 2.5 is an advanced series of multimodal large language models. Building on InternVL 2.0, it enhances training and testing strategies and improves data quality while maintaining its core architecture. This model integrates the newly pre-trained InternViT with various large language models, such as InternLM 2.5 and Qwen 2.5, utilizing a randomly initialized MLP projector. InternVL 2.5 supports multiple images and video data, employing dynamic high-resolution training methods to provide better performance when processing multimodal data.
InternVL2_5-2B Visit Over Time
Monthly Visits
20899836
Bounce Rate
46.04%
Page per Visit
5.2
Visit Duration
00:04:57