InternVL2_5-1B

A large multimodal language model that supports image and text understanding.

CommonProductImageMultimodalLarge Language Model
InternVL 2.5 is a series of advanced multimodal large language models (MLLMs). Building on InternVL 2.0, it enhances training and testing strategies and improves data quality while maintaining its core model architecture. This model integrates the newly pre-trained InternViT with various pre-trained large language models (LLMs) such as InternLM 2.5 and Qwen 2.5, using a randomly initialized MLP projector. InternVL 2.5 supports multiple images and video data, employing a dynamic high-resolution training method to enhance its capability to handle multimodal data.
Visit

InternVL2_5-1B Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

InternVL2_5-1B Visit Trend

InternVL2_5-1B Visit Geography

InternVL2_5-1B Traffic Sources

InternVL2_5-1B Alternatives