RewardBench
Emerging15papers using it
2024first seen
Papers using RewardBench (15)
- PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward ModelingNine Judges, Two Effective Votes: Correlated Errors Undermine LLM Evaluation PanelsCDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward ModelingSCOPE: Selective Conformal Optimized Pairwise LLM JudgingSR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model AlignmentExplicit Reasoning Makes Better Judges: A Systematic Study on Accuracy, Efficiency, and RobustnessIntra-Trajectory Consistency for Reward ModelingRobust Reward Modeling via Causal RubricsTime To Impeach LLM-as-a-Judge: Programs are the Future of EvaluationENCORE: Entropy-guided Reward Composition for Multi-head Safety Reward ModelsSentence-level Reward Model can Generalize Better for Aligning LLM from Human PreferenceIPO: Your Language Model is Secretly a Preference ClassifierData-adaptive Safety Rules for Training Reward ModelsMargin Matching Preference Optimization: Enhanced Model Alignment with Granular FeedbackUncertainty-aware Reward Model: Teaching Reward Models to Know What is
Unknown