BFCLv-3

Emerging

7papers using it

2025first seen

The 'BFCLv-3' dataset/benchmark is used to evaluate the effectiveness of tool-use agents by providing a structured set of tasks that capture interaction dynamics and error recovery in their performance.

🔎 Find this dataset

Papers using BFCLv-3 (7)

TIER: Trajectory-Invariant Execution Rewards for Multi-Step Tool Composition2026

HINT-SD: Targeted Hindsight Self-Distillation for Long-Horizon Agents2026

Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning2026

TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training2026

MagicAgent: Towards Generalized Agent Planning2026

ToolSample: Dual Dynamic Sampling Methods with Curriculum Learning for RL-based Tool Learning2025

ShiQ: Bringing back Bellman to LLMs2025