← all datasets

Plan-RewardBench

Emerging
2papers using it
92HF downloads
3HF likes
2026first seen

πŸ† Plan-RewardBench A Comprehensive Benchmark for Trajectory-Level Reward Modeling in Tool-Augmented Agents ⚠️ Important: This is an evaluation-only benchmark. The HuggingFace train split is simply the default container for the full benchmark data β€” it does not represent a training set. The dataset viewer may be tempor

Papers using Plan-RewardBench (2)

Plan-RewardBench β€” datasets β€” reinforcement-learning