← all datasets

BFCL v-3

Emerging
13papers using it
2025first seen

The 'BFCL v-3' dataset/benchmark contains a collection of tasks designed to evaluate the performance of long-horizon reinforcement learning agents, particularly in the context of providing feedback for improving decision-making in complex environments.

Papers using BFCL v-3 (13)

BFCL v-3 β€” datasets β€” ai-agents