BFCL v-4
Emerging2papers using it
2026first seen
The 'BFCL-V-4' dataset/benchmark contains a collection of multi-turn interaction scenarios designed to evaluate the performance of AI agents in reasoning and invoking external tools during complex tasks.