← all datasets

BFCLv-3

Emerging
7papers using it
2025first seen

The 'BFCLv-3' dataset/benchmark is used to evaluate the effectiveness of tool-use agents by providing a structured set of tasks that capture interaction dynamics and error recovery in their performance.

Papers using BFCLv-3 (7)

BFCLv-3 β€” datasets β€” reinforcement-learning