← all datasets

BFCLv-3

Emerging
12papers using it
2025first seen

The 'BFCLv3' dataset/benchmark is used to evaluate the effectiveness of tool-use agents by providing a structured representation of interaction dynamics through multi-trial rollouts of tasks.

Papers using BFCLv-3 (12)

BFCLv-3 β€” datasets β€” ai-agents