2024-08-13 08:11:01.AIbase.
The Compass Arena, a Large Model Evaluation Platform, Adds a Multi-Modal Large Model Competition Section
2023-11-29 09:08:23.AIbase.
"Baimao Battle" Family's First, When Will Cheating in Large Model 'Scoring' Stop?
2023-11-02 15:21:41.AIbase.
Ant Group Releases Benchmark for Large Model Evaluation in the DevOps Field
2023-09-25 09:54:21.AIbase.
Investigation into the Chaos of Large Model Evaluation: Parameter Scale Does Not Represent Everything
2023-08-29 10:09:08.AIbase.