DeepSeek-VL2-Small

An advanced large-scale mixture of experts visual language model.

CommonProductImageVisual Question AnsweringOptical Character Recognition
DeepSeek-VL2 is a series of advanced large-scale mixture of experts (MoE) visual language models, significantly improved compared to its predecessor DeepSeek-VL. This model series demonstrates exceptional capabilities across various tasks, including visual question answering, optical character recognition, document/table/chart understanding, and visual localization. Comprising three variants: DeepSeek-VL2-Tiny, DeepSeek-VL2-Small, and DeepSeek-VL2, with 1 billion, 2.8 billion, and 4.5 billion active parameters respectively, DeepSeek-VL2 achieves competitive or state-of-the-art performance against existing dense and MoE-based open-source models, even with a similar or fewer number of active parameters.
Visit

DeepSeek-VL2-Small Visit Over Time

Monthly Visits

20899836

Bounce Rate

46.04%

Page per Visit

5.2

Visit Duration

00:04:57

DeepSeek-VL2-Small Visit Trend

DeepSeek-VL2-Small Visit Geography

DeepSeek-VL2-Small Traffic Sources

DeepSeek-VL2-Small Alternatives