Video-MME

The first comprehensive benchmark for evaluating the performance of Multi-Modal Large Language Models (MLLMs) in video analysis.

CommonProductVideoMulti-modalVideo Analysis
Video-MME is a benchmark for evaluating the performance of Multi-Modal Large Language Models (MLLMs) in video analysis. It fills the gap in existing evaluation methods regarding the ability of MLLMs to process continuous visual data, providing researchers with a high-quality and comprehensive evaluation platform. The benchmark covers videos of different lengths and evaluates core MLLM capabilities.
Visit

Video-MME Visit Over Time

Monthly Visits

3869

Bounce Rate

46.22%

Page per Visit

1.1

Visit Duration

00:00:18

Video-MME Visit Trend

Video-MME Visit Geography

Video-MME Traffic Sources

Video-MME Alternatives