AI Ranking

AI Ranking

Search AI Products and News

Explore worldwide AI information, discover new AI opportunities

AI News
AI Tools
AI Cases
AI Tutorial

Type :

AI News
AI Tools
AI Cases
AI Tutorial

2024-11-12 14:12:18.AIbase

Baidu Releases First Chinese Large Model AI Glasses: 45g Lightweight Design, 56 Hours of Battery Life

Baidu Releases First Chinese Large Model AI Glasses: 45g Lightweight Design, 56 Hours of Battery Life

2024-09-30 14:08:02.AIbase

BAAI Launches the World’s First Chinese Large Model Debate Platform, FlagEval Debate

Beijing Academy of Artificial Intelligence (BAAI) has recently launched the world’s first Chinese large model debate platform, FlagEval Debate. This new platform aims to provide a novel measurement approach for evaluating the capabilities of large language models through a competitive mechanism of model debates. It is an extension of the FlagEval arena for large model combat evaluation services, designed to identify capability differences among large language models.

BAAI Launches the World’s First Chinese Large Model Debate Platform, FlagEval Debate

2023-12-12 16:20:29.AIbase

Zhipu AI Releases Chinese LLM Alignment Evaluation Benchmark AlignBench

["Zhipu AI has released an evaluation benchmark AlignBench for Chinese large models", "AlignBench can evaluate the alignment level of models and human intentions in multiple dimensions", "The dataset is divided into 8 major categories, including knowledge Q&A, writing generation, role-playing, and various types of questions", "Developers can use AlignBench for evaluation and score using a robust scoring model", "By logging into the AlignBench website, users can submit their evaluations."]

2023-08-29 10:09:08.AIbase

August Rankings! SuperCLUE Releases Latest Rankings for Chinese Large Model Evaluation Benchmark

SuperCLUE has released the August rankings for Chinese large models, featuring 5 different ranking evaluations that selected 16 general large language models, utilizing 3,337 new test questions. The performance gap between domestic large models on Chinese tasks and GPT-3.5 is narrowing.