NVLM 1.0

A cutting-edge multimodal large language model that achieves state-of-the-art performance on visual-language tasks.

CommonProductProductivityMultimodal LearningLarge Language Models
NVLM 1.0 is a series of advanced multimodal large language models (LLMs) that have achieved state-of-the-art results on visual-language tasks, comparable to leading proprietary and open-access models. Notably, NVLM 1.0 surpasses its LLM backbone model in text performance following multimodal training. We have made the model weights and code open-source for the community.
Visit

NVLM 1.0 Visit Over Time

Monthly Visits

34332

Bounce Rate

38.47%

Page per Visit

2.1

Visit Duration

00:00:17

NVLM 1.0 Visit Trend

NVLM 1.0 Visit Geography

NVLM 1.0 Traffic Sources

NVLM 1.0 Alternatives