Plan-RewardBench
Emerging2papers using it
92HF downloads
3HF likes
2026first seen
π Plan-RewardBench A Comprehensive Benchmark for Trajectory-Level Reward Modeling in Tool-Augmented Agents β οΈ Important: This is an evaluation-only benchmark. The HuggingFace train split is simply the default container for the full benchmark data β it does not represent a training set. The dataset viewer may be tempor
π€ Hugging Faceβ cc-by-4.0