InternVL2_5-4B-MPO-AWQ
A multimodal large language model designed to enhance image and text interaction capabilities.
CommonProductImageMultimodalLarge Language Model
InternVL2_5-4B-MPO-AWQ is a multimodal large language model (MLLM) focused on improving performance in image and text interaction tasks. Based on the InternVL2.5 series and further enhanced through Mixed Preference Optimization (MPO), it can handle a variety of inputs, including single images, multiple images, and video data, making it suitable for complex tasks requiring an understanding of both image and text interactions. With its exceptional multimodal capabilities, InternVL2_5-4B-MPO-AWQ offers a powerful solution for image-to-text and text-to-image tasks.
InternVL2_5-4B-MPO-AWQ Visit Over Time
Monthly Visits
20899836
Bounce Rate
46.04%
Page per Visit
5.2
Visit Duration
00:04:57