← all datasets

BFCL v-4

Emerging
2papers using it
2026first seen

The 'BFCL-V-4' dataset/benchmark contains a collection of multi-turn interaction scenarios designed to evaluate the performance of AI agents in reasoning and invoking external tools during complex tasks.

Papers using BFCL v-4 (2)

BFCL v-4 β€” datasets β€” reinforcement-learning