InternVL2_5-26B

A large multimodal language model that integrates visual and linguistic understanding.

CommonProductImageMultimodalLarge Language Model
InternVL2_5-26B is an advanced multimodal large language model (MLLM) developed based on InternVL 2.0. It has been further enhanced through significant training and testing strategies, as well as improvements in data quality. The model retains the core architecture of its predecessor, the 'ViT-MLP-LLM', while integrating the newly pre-trained InternViT along with various pre-trained large language models (LLMs) such as InternLM 2.5 and Qwen 2.5, utilizing randomly initialized MLP projectors. The InternVL 2.5 series models demonstrate exceptional performance in multimodal tasks, particularly in visual perception and multimodal capabilities.
Visit

InternVL2_5-26B Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

InternVL2_5-26B Visit Trend

InternVL2_5-26B Visit Geography

InternVL2_5-26B Traffic Sources

InternVL2_5-26B Alternatives