← all datasets

Berkeley Function Calling Leaderboard v-3

Emerging
8papers using it
2024first seen

The 'Berkeley Function Calling Leaderboard v3' is a benchmark dataset that contains 200 tasks used to evaluate the performance of function-calling language agents in relation to their reasoning length and accuracy.

Papers using Berkeley Function Calling Leaderboard v-3 (8)

Berkeley Function Calling Leaderboard v-3 β€” datasets β€” ai-agents