AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation

C-Eval Assesses Advanced Knowledge and Reasoning Abilities of Chinese Foundation Models

机器之心

Published inAI News · 1 min read · Oct 8, 2023

121

The translated data: C-Eval is a comprehensive benchmark designed to assess the advanced knowledge and reasoning abilities of Chinese foundational models. It encompasses multiple-choice questions across four difficulty levels, covering 52 distinct subject areas. The question bank is sourced from simulated exams available on the internet. The C-Eval leaderboard showcases the performance of open-source models in this evaluation. This benchmark aids in selecting large models suitable for the field of natural language processing, thereby promoting the development of AI applications.

Chinese Large Models C-Eval Knowledge Reasoning

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Doubao Team Open-Sources SuperGPQA: Challenging the Limits of AI Reasoning Across 285 Disciplines

Mar 4, 2025

100

Ant Launches Self-Developed Knowledge-Enhanced Large Model Service Framework KAG: Improving Knowledge Reasoning Accuracy

At the 2024 Inclusion · Bund Conference, Ant Group shared its latest progress in building knowledge-enhanced professional intelligent agents and launched the research and development achievement of combining knowledge graphs with large models—the Knowledge-Enhanced Large Model Service Framework KAG. This framework, introduced by Liang Lei, head of the knowledge graph at Ant Group, aims to guide decision-making and retrieval through graph logical symbols, significantly improving the precision and logical rigor of decisions in vertical domains.

Sep 13, 2024

3.3k

Zhipu AI Releases Chinese LLM Alignment Evaluation Benchmark AlignBench

["Zhipu AI has released an evaluation benchmark AlignBench for Chinese large models", "AlignBench can evaluate the alignment level of models and human intentions in multiple dimensions", "The dataset is divided into 8 major categories, including knowledge Q&A, writing generation, role-playing, and various types of questions", "Developers can use AlignBench for evaluation and score using a robust scoring model", "By logging into the AlignBench website, users can submit their evaluations."]

Dec 12, 2023

2.7k

What Makes GS-LLM-Beta, the Dark Horse of the Greater Bay Area, Rank Among the Top Three in C-Eval?

{point1: The large model GS-LLM-Beta from Symbiotic Matrix outperformed several industry giants to secure a spot in the top three of the C-Eval leaderboard. point2: The team members have a deep accumulation of knowledge in large model theory and engineering, with algorithmic capability being the key to achieving good results. point3: Symbiotic Matrix is confident in breaking through technical bottlenecks and hopes to lead the development of general AI in China.}

Aug 25, 2023

640