The Compass Arena, a Large Model Evaluation Platform, Adds a Multi-Modal Large Model Competition Section

AIbase基地

Published inAI News · 3 min read · Aug 13, 2024

290

The team from Shanghai Artificial Intelligence Laboratory's OpenCompass and ModelScope recently announced significant updates to their large model evaluation platform, Compass Arena, introducing a new multi-modal large model competition section called Compass Multi-Modal Arena. This new section provides users with a platform to experience and compare the performance of several mainstream multi-modal large models, helping them find the model that best suits their needs.

WeChat Screenshot_20240813080725.png

The official website of Compass Multi-Modal Arena and the ModelScope page are now open to the public, offering a user-friendly interface where users can upload images and input questions. The system will then arrange two anonymous multi-modal large models to generate answers based on the input content. Users can subjectively evaluate the quality of the generated content and choose the model they believe performs better. After the evaluation, users can see the names of each model.

WeChat Screenshot_20240813080734.png

The platform also features a built-in question bank, designed for use when users are unable to upload images. This question bank focuses on subjective visual question-answering tasks, such as meme understanding, art appreciation, and photography appreciation. This design aims to assess the performance and user experience of multi-modal large models on subjective tasks.

Compass Multi-Modal Arena Official Website

https://opencompass.org.cn/arena?type=multimodal

ModelScope Page:

https://modelscope.cn/studios/opencompass/CompassArena

HuggingFace Page

https://huggingface.co/spaces/opencompass/CompassArena

OpenCompass Multi-Modal Evaluation Tool Open Source Link:

https://github.com/open-compass/VLMEvalKit

ArtificialIntelligenceLaboratory OpenCompass ModelScope MultimodalLargeModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Shanghai AI Lab Releases the Multimodal Large Model Shuengwan InternVL3.5

Shanghai AI Lab opens InternVL3.5, a multimodal model with innovations like cascaded RL and dynamic vision routing, offering 1B-241B versions with top performance.....

Sep 1, 2025

420

X-SAM: Breaking the Boundaries of Image Segmentation Achieving a New Breakthrough in Arbitrary Segmentation

X-SAM, a multimodal model for image segmentation, advances from 'segment anything' to 'segment everything', enhancing precision and flexibility via unified I/O formats supporting text/visual queries.....

Aug 19, 2025

Xiaomi Unveils Another AI Star! Open-Source Multimodal Large Model MiMo-VL-7B-2508 with Significant Performance Improvement, Supports Thinking Mode Switching

Xiaomi open-sources MiMo-VL-7B-2508, a multimodal model with SFT/RL versions. Features 'thinking mode' switching, improved RL stability, and leading scores in MMMU/ChartQA benchmarks.....

Aug 12, 2025

110

Xiaomi Opensources Latest Multimodal Large Model Xiaomi MiMo-VL-7B-2508

The Xiaomi large model team announced the open source of the latest multimodal large model Xiaomi MiMo-VL-7B-2508, which includes two versions: RL and SFT. Official data shows that the new model has set new records in four core capabilities: subject reasoning, document understanding, graphical interface positioning, and video understanding. Among them, the MMMU benchmark has broken through the 70-point mark for the first time, ChartQA has risen to 94.4, ScreenSpot-v2 has reached 92.5, and VideoMME has improved to 70.8.

Aug 9, 2025

430

Xiaohongshu Launches Open-Source Multimodal Large Model dots.vlm1, Leading the Industry with NaViT Vision Encoder

Aug 7, 2025

240

MiniCPM-V4.0 Open Source Release, Considered GPT-4V on Mobile Devices

Aug 7, 2025

170

Mingi AI's New Multimodal Model MiniCPM-V 4.0 Open Sourced

The ModelScope community announced that Mingi AI's new multimodal model MiniCPM-V 4.0 is officially open-sourced. With 4 billion parameters, the model has achieved state-of-the-art (SOTA) results on multiple benchmarks such as OpenCompass, OCRBench, and MathVista, and it runs stably and smoothly on mobile devices such as smartphones. At the same time, the official also open-sourced the inference and deployment tool MiniCPM-V

Aug 7, 2025

130

AI Daily: Alibaba Launches New Image Model Qwen-Image; Zread.ai Powered by GLM-4.5; Claude Opus 4.1 May Start Internal Testing

AI highlights: 1. Alibaba's Qwen-Image excels in Chinese text rendering. 2. ChatGPT hits 700M users, OpenAI earns $12B. 3. Anthropic tests Claude 4.1. 4. Zread.ai by Zhipu. 5. xAI's Grok Imagine4 for text-to-video. 6. Character.AI's social features. 7. Alibaba & Nankai's LLaVA-Scissor. 8. Beijing's humanoid robot vision. 9. 8 AI models in Kaggle chess. 10. OpenMind's OM1 OS.....

Aug 5, 2025

240

Xiaomi Fully Open-Sources MiDashengLM-7B: Audio Understanding Performance Sets SOTA, Inference Speed Increases by 20 Times

Xiaomi releases MiDashengLM-7B, a multimodal model with breakthroughs in audio understanding. It features dual-core architecture, achieves SOTA on 22 benchmarks, offers 4x faster inference and 20x higher throughput. Supports unified processing of speech/environmental sounds/music, with offline deployment. Fully open-sourced.....

Aug 4, 2025

170

Step 3, the latest generation of foundational large models from StepZen, is officially open-sourced

The StepZen team announced that their latest generation foundational large model, Step3, is officially open-sourced. Step3 is a model designed for enterprises and developers who pursue the optimal balance between performance and cost, aiming to create the most suitable model for the inference era. The open-source address of this model includes GitHub, Hugging Face, and ModelScope, allowing developers to freely download and experience it. Step3 adopts an MoE architecture, with a total parameter count of 321 billion and an activated parameter count of 38 billion. It

Aug 1, 2025

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

The Compass Arena, a Large Model Evaluation Platform, Adds a Multi-Modal Large Model Competition Section

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Shanghai AI Lab Releases the Multimodal Large Model Shuengwan InternVL3.5

X-SAM: Breaking the Boundaries of Image Segmentation Achieving a New Breakthrough in Arbitrary Segmentation

Xiaomi Unveils Another AI Star! Open-Source Multimodal Large Model MiMo-VL-7B-2508 with Significant Performance Improvement, Supports Thinking Mode Switching

Xiaomi Opensources Latest Multimodal Large Model Xiaomi MiMo-VL-7B-2508

Xiaohongshu Launches Open-Source Multimodal Large Model dots.vlm1, Leading the Industry with NaViT Vision Encoder

MiniCPM-V4.0 Open Source Release, Considered GPT-4V on Mobile Devices

Mingi AI's New Multimodal Model MiniCPM-V 4.0 Open Sourced

AI Daily: Alibaba Launches New Image Model Qwen-Image; Zread.ai Powered by GLM-4.5; Claude Opus 4.1 May Start Internal Testing

Xiaomi Fully Open-Sources MiDashengLM-7B: Audio Understanding Performance Sets SOTA, Inference Speed Increases by 20 Times

Step 3, the latest generation of foundational large models from StepZen, is officially open-sourced

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

The Compass Arena, a Large Model Evaluation Platform, Adds a Multi-Modal Large Model Competition Section

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Shanghai AI Lab Releases the Multimodal Large Model Shuengwan InternVL3.5

X-SAM: Breaking the Boundaries of Image Segmentation Achieving a New Breakthrough in Arbitrary Segmentation

Xiaomi Unveils Another AI Star! Open-Source Multimodal Large Model MiMo-VL-7B-2508 with Significant Performance Improvement, Supports Thinking Mode Switching

Xiaomi Opensources Latest Multimodal Large Model Xiaomi MiMo-VL-7B-2508

Xiaohongshu Launches Open-Source Multimodal Large Model dots.vlm1, Leading the Industry with NaViT Vision Encoder

MiniCPM-V4.0 Open Source Release, Considered GPT-4V on Mobile Devices

Mingi AI's New Multimodal Model MiniCPM-V 4.0 Open Sourced

AI Daily: Alibaba Launches New Image Model Qwen-Image; Zread.ai Powered by GLM-4.5; Claude Opus 4.1 May Start Internal Testing

Xiaomi Fully Open-Sources MiDashengLM-7B: Audio Understanding Performance Sets SOTA, Inference Speed Increases by 20 Times

Step 3, the latest generation of foundational large models from StepZen, is officially open-sourced

GEO Services