en
每月不到10元,就可以无限制地访问最好的AIbase。立即成为会员
Home
News
Daily Brief
Income Guide
Tutorial
Tools Directory
Product Library
en
Search AI Products and News
Explore worldwide AI information, discover new AI opportunities
AI News
AI Tools
AI Cases
AI Tutorial
Type :
AI News
AI Tools
AI Cases
AI Tutorial
2024-09-30 14:08:02
.
AIbase
.
12.1k
BAAI Launches the World’s First Chinese Large Model Debate Platform, FlagEval Debate
Beijing Academy of Artificial Intelligence (BAAI) has recently launched the world’s first Chinese large model debate platform, FlagEval Debate. This new platform aims to provide a novel measurement approach for evaluating the capabilities of large language models through a competitive mechanism of model debates. It is an extension of the FlagEval arena for large model combat evaluation services, designed to identify capability differences among large language models.
2023-12-12 16:20:29
.
AIbase
.
4.1k
Zhipu AI Releases Chinese LLM Alignment Evaluation Benchmark AlignBench
["Zhipu AI has released an evaluation benchmark AlignBench for Chinese large models", "AlignBench can evaluate the alignment level of models and human intentions in multiple dimensions", "The dataset is divided into 8 major categories, including knowledge Q&A, writing generation, role-playing, and various types of questions", "Developers can use AlignBench for evaluation and score using a robust scoring model", "By logging into the AlignBench website, users can submit their evaluations."]
2023-08-29 10:09:08
.
AIbase
.
887
August Rankings! SuperCLUE Releases Latest Rankings for Chinese Large Model Evaluation Benchmark
SuperCLUE has released the August rankings for Chinese large models, featuring 5 different ranking evaluations that selected 16 general large language models, utilizing 3,337 new test questions. The performance gap between domestic large models on Chinese tasks and GPT-3.5 is narrowing.