AIbase
Product LibraryTool Navigation

AgentBench

Public

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Creat2023-07-28T12:32:06
Update2025-03-27T12:16:38
https://llmbench.ai
2.5K
Stars
2
Stars Increase

Related projects