AIbase
Product LibraryTool Navigation

stark-agent

Public

STaRK: Agentic AI benchmark, which is designed to evaluate how well LLMs and retrieval systems work with semi-structured knowledge bases.

Creat2025-02-23T16:23:00
Update2025-02-28T07:58:16
https://stark.stanford.edu/
0
Stars
0
Stars Increase

Related projects