ZhiYuan Research Institute Launches FlagEval Large Model Arena Featuring Text-to-Video Model Combat Evaluation Service

AIbase基地

Published inAI News · 3 min read · Sep 5, 2024

333

On September 4, 2024, the Beijing Academy of Artificial Intelligence (BAAI) announced the launch of the world's first model battle evaluation service that includes text-to-video capabilities—the FlagEval Large Model Arena.

This service is open to users, covering approximately 40 large models from both domestic and international sources, and supports custom online or offline evaluations for four major tasks: language question answering, multimodal image-text understanding, text-to-image, and text-to-video. The introduction of the FlagEval Large Model Arena not only provides evaluations for preset questions such as simple understanding, knowledge application, coding ability, and reasoning ability, but also introduces a subjective preference ladder scoring system for more precise revelation of model performance differences.

WeChat Screenshot_20240905084138.png

The service conducts evaluations anonymously to ensure the fairness of the process. Users can participate in the evaluation through the web portal or the first mobile access point in China, experiencing efficient model battle evaluations. The scoring results of the FlagEval Large Model Arena will be immediately publicized, forming an arena leaderboard to showcase the battle capabilities of each model.

The BAAI stated that it will open-source the entire chain of data for model battle evaluations to promote the development of the large model evaluation ecosystem. The launch of the FlagEval Large Model Arena further expands BAAI's technical layout and tool development in the field of model evaluation, providing new testing and evaluation tools for research and application in the field of artificial intelligence.

Experience URL:https://flageval.baai.ac.cn/#/home

FlagEval large model artificial intelligence text-to-video

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Shenzhen University's Artificial Intelligence Institute Officially Unveiled, Boosting AI Talent Cultivation

On April 21, 2025, Shenzhen University officially unveiled its Artificial Intelligence Institute, marking a significant step forward in the university's AI education and research. According to Shenzhen TV's Deep Vision News report, the institute will establish a basic research center and a computing platform, and will collaborate with Tencent Cloud to build an industry academy, promoting deep integration of industry, academia, and research. Image Note: Image generated by AI, image authorization service provider Midjourney. Currently, the Artificial Intelligence Institute boasts a strong team of approximately 80 teachers and researchers.

Apr 21, 2025

180

Rapid Advancement of AI in Advertising: Publishers Leading the Way

According to a 2025 early release study by the Interactive Advertising Bureau (IAB), while the adoption of Artificial Intelligence (AI) in advertising is rising, only 30% of advertising professionals have fully integrated AI into their media advertising lifecycle. The study reveals that while agencies and brands primarily leverage AI for audience identification and targeting, publishers are more inclined to utilize AI for ad inventory forecasting and demand analysis. The survey highlights two major challenges facing the advertising industry in AI adoption...

Apr 21, 2025

Ant Group and Tsinghua University Joint Project Wins First Prize for Scientific and Technological Progress, Tackling Large Model Security Challenges

At the recently concluded 18th China Electronic Information Annual Conference, the China Electronics Society announced the winners of the 2024 Science and Technology Awards. Among them, the project "Key Technologies and Applications of Secure and Trusted Dynamic Behavior in the Internet with High-Efficiency Collaboration," jointly developed by Tsinghua University, Beijing Zhongguancun Laboratory, and Ant Group, won the first prize for scientific and technological progress. This achievement not only demonstrates the enormous potential of cutting-edge technology in the field of secure and trustworthy computing, but also provides an effective solution to address the increasingly complex network environment. With the proliferation of the internet, malicious traffic attacks and covert...

Apr 21, 2025

iFlytek's StarFire X1 Deep Reasoning Model Achieves Major Upgrades Across Multiple Domains

iFlytek announced a significant upgrade to its deep reasoning large model, StarFire X1. As the industry's only deep reasoning large model trained using entirely domestic computing power, StarFire X1 has achieved remarkable breakthroughs in several key areas, further solidifying its leading position in the AI field.

Apr 21, 2025

MCP Security Checklist: Secure MCP and Understand MCP Interaction Flow

Imagine you've painstakingly trained a brilliant AI large model capable of handling complex tasks. However, if the pipeline to this intelligent brain—the Model Context Protocol (MCP)—is insecure, you've left an opening for hackers. Don't worry! Created by blockchain security experts SlowMist, the MCP Security Checklist acts as a professional AI 'shield' providing enhanced security for MCP-based AI tools.

Apr 21, 2025

120

MeiYin Intelligence Secures Millions in Seed Funding for AI-Powered Disease Risk Prediction

Hangzhou-based medical AI company, MeiYin Intelligence Technology Co., Ltd., recently announced it has secured millions of yuan in seed funding. The funding will primarily be used for R&D and commercialization of its core product. The round was led by Zhuoyuan Asia, with participation from Xihu Government Direct Investment Fund. MeiYin Intelligence focuses on leveraging AI to predict disease risk and promote health management. Its core product, based on its self-developed DP-LLM large model, supports the processing of multimodal medical data to accurately quantify an individual's future disease risk, covering hundreds of diseases and more.

Apr 21, 2025

Accelerated Transformation of Banking Technology: Large Models Deepen into Core Business

As the challenges and pressures faced by the banking industry in digital transformation intensify, more and more banks are beginning to integrate large model technology into their core businesses, rather than simply relying on chatbot applications. The latest financial reports show that some major domestic banks have made significant progress in technology investment and large model applications, but also reveal a trend of differentiated investment. According to an analysis of ten major banks by Titanium Media App, including the six major state-owned banks and several joint-stock banks, six of them have seen a reduction in technology investment. For example,

Apr 18, 2025

220

Tencent Cloud Breakthrough Upgrade! Large Model Knowledge Engine First to Integrate with MCP; AI Application Development Enters a New Era

In Chengdu's sunny April, a significant breakthrough in China's AI technology development was quietly unveiled. The 2025 Tencent Global Digital Ecosystem Summit Chengdu Conference grandly opened on April 18th, with Wang Wei, Tencent Cloud Intelligent Regional Solution Director, delivering exciting news: Tencent Cloud's large model knowledge engine has become the industry's first platform to officially integrate with MCP. This technological breakthrough means developers and enterprise users will enjoy an unprecedentedly convenient experience when building AI applications. Through Tencent Cloud's large model knowledge engine, users can easily access...

Apr 18, 2025

280

Tencent Unveils Ready-to-Use Enterprise-Grade AI Application

At the grand opening of the 2025 Tencent Global Digital Ecology Conference in Chengdu, Tencent announced its latest enterprise-grade AI application – the Tencent Cloud Large Model Knowledge Engine. This innovative tool aims to provide businesses with more flexible atomic capabilities and application development models, helping them build their own knowledge management systems. The Tencent Cloud Large Model Knowledge Engine has reportedly been successfully implemented in various sectors, including finance, energy, transportation, retail, healthcare, government, education, and tourism. Numerous companies, such as Sichuan Wen...

Apr 18, 2025

200

Tencent Cloud's Wang Qi: Large Models and Knowledge Bases Empower Enterprise AI Application Landing

At the recently concluded 2025 Tencent Global Digital Ecosystem Summit Chengdu Forum, Tencent Cloud Vice President Wang Qi delivered a compelling speech on how enterprises can effectively implement Artificial Intelligence (AI) applications. He highlighted that combining large models with knowledge bases is currently the optimal path for enterprises to achieve AI implementation. Wang Qi emphasized that Tencent Cloud adheres to a "core technology self-research + embracing advanced open source" multi-model strategy, a philosophy that permeates Tencent's comprehensive layout across underlying computing power, foundational large models, model development platforms, and intelligent applications. Image source note:

Apr 18, 2025

230

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

ZhiYuan Research Institute Launches FlagEval Large Model Arena Featuring Text-to-Video Model Combat Evaluation Service

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Shenzhen University's Artificial Intelligence Institute Officially Unveiled, Boosting AI Talent Cultivation

Rapid Advancement of AI in Advertising: Publishers Leading the Way

Ant Group and Tsinghua University Joint Project Wins First Prize for Scientific and Technological Progress, Tackling Large Model Security Challenges

iFlytek's StarFire X1 Deep Reasoning Model Achieves Major Upgrades Across Multiple Domains

MCP Security Checklist: Secure MCP and Understand MCP Interaction Flow

MeiYin Intelligence Secures Millions in Seed Funding for AI-Powered Disease Risk Prediction

Accelerated Transformation of Banking Technology: Large Models Deepen into Core Business

Tencent Cloud Breakthrough Upgrade! Large Model Knowledge Engine First to Integrate with MCP; AI Application Development Enters a New Era

Tencent Unveils Ready-to-Use Enterprise-Grade AI Application

Tencent Cloud's Wang Qi: Large Models and Knowledge Bases Empower Enterprise AI Application Landing