← all datasets

BFCL v-3

Emerging
4papers using it
27HF downloads
0HF likes
2025first seen

The 'BFCL-v3' dataset/benchmark is used to evaluate the performance of Large Language Models (LLMs) in executing complex, multi-step tasks by providing a structured set of data that reflects the model's capabilities and weaknesses.

Papers using BFCL v-3 (4)

BFCL v-3 β€” datasets β€” llm-papers