← all datasets

VL-RewardBench

Emerging
4papers using it
750HF downloads
15HF likes
2025first seen

Dataset Card for VLRewardBench Project Page: https://vl-rewardbench.github.io Dataset Summary VLRewardBench is a comprehensive benchmark designed to evaluate vision-language generative reward models (VL-GenRMs) across visual perception, hallucination detection, and reasoning tasks. The benchmark contains 1,250 high-qua

Papers using VL-RewardBench (4)

VL-RewardBench β€” datasets β€” llm-papers