RewardBench
Emerging13papers using it
2024first seen
Papers using RewardBench (13)
- PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward ModelingIRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward ModelsTiny Reward ModelsEfficient Online RFT with Plug-and-Play LLM Judges: Unlocking State-of-the-Art PerformanceIntra-Trajectory Consistency for Reward ModelingCritique-out-Loud Reward ModelsPost-hoc Reward Calibration: A Case Study on Length BiasQuantile Regression for Distributional Reward Models in RLHFEvaluating Robustness of Reward Models for Mathematical ReasoningDr. SoW: Density Ratio of Strong-over-weak LLMs for Reducing the Cost of
Human Annotation in Preference TuningAct-Adaptive Margin: Dynamically Calibrating Reward Models for Subjective AmbiguityMulti-Agent Collaborative Reward Design for Enhancing Reasoning in Reinforcement LearningSentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference