AIbase
Product LibraryTool Navigation

4onebench

Public

A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.

Creat2024-11-04T07:15:19
Update2024-12-15T21:24:57
21
Stars
0
Stars Increase

Related projects