AIbase
Product LibraryTool Navigation

parrot-python

Public

PARROT (Performance Assessment of Reasoning and Responses On Trivia) is a novel benchmarking framework designed to evaluate Large Language Models (LLMs) on real-world, complex, and ambiguous QA tasks.

Creat2024-09-27T04:30:11
Update2024-10-22T08:22:52
https://huggingface.co/datasets/RedBlock/parrot
1
Stars
0
Stars Increase