RM-Bench
Emerging5papers using it
1,213HF downloads
10HF likes
2025first seen
RM-Bench This repository contains the data of the paper "RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style" News [2025/07/12] π― The RM-Bench Leaderboard is now publicly available! Check it out and submit your result at RM-Bench Leaderboard! Dataset Details the samples are formatted as fol
π€ Hugging Faceβ odc-by
Papers using RM-Bench (5)
- PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward ModelingBayesian Preference Learning for Test-Time Steerable Reward ModelsCodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward ModelsIRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward ModelsRLBFF: Binary Flexible Feedback to bridge between Human Feedback & Verifiable Rewards