Berkeley Function-Calling Leaderboard
Leaderboard for evaluating the function calling ability of large language models
CommonProductProgrammingAI EvaluationProgramming
The Berkeley Function-Calling Leaderboard (BCL) is an online platform specifically designed to evaluate the accuracy of large language models (LLMs) in calling functions (or tools). The leaderboard is based on real-world data and is regularly updated, providing a benchmark for measuring and comparing the performance of different models on specific programming tasks. It is a valuable resource for developers, researchers, and anyone interested in the programming capabilities of AI.