Video-MME
The first comprehensive benchmark for evaluating the performance of Multi-Modal Large Language Models (MLLMs) in video analysis.
CommonProductVideoMulti-modalVideo Analysis
Video-MME is a benchmark for evaluating the performance of Multi-Modal Large Language Models (MLLMs) in video analysis. It fills the gap in existing evaluation methods regarding the ability of MLLMs to process continuous visual data, providing researchers with a high-quality and comprehensive evaluation platform. The benchmark covers videos of different lengths and evaluates core MLLM capabilities.
Video-MME Visit Over Time
Monthly Visits
4452
Bounce Rate
50.87%
Page per Visit
1.3
Visit Duration
00:00:04