← all datasets

RewardBench

Emerging

15papers using it

2024first seen

🔎 Find this dataset

Papers using RewardBench (15)

PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling2025 · 4 cites

Nine Judges, Two Effective Votes: Correlated Errors Undermine LLM Evaluation Panels2026

CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling2026

SCOPE: Selective Conformal Optimized Pairwise LLM Judging2026

SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment2025

Explicit Reasoning Makes Better Judges: A Systematic Study on Accuracy, Efficiency, and Robustness2025

Intra-Trajectory Consistency for Reward Modeling2025

Robust Reward Modeling via Causal Rubrics2025

Time To Impeach LLM-as-a-Judge: Programs are the Future of Evaluation2025

ENCORE: Entropy-guided Reward Composition for Multi-head Safety Reward Models2025

Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference2025

IPO: Your Language Model is Secretly a Preference Classifier2025

Data-adaptive Safety Rules for Training Reward Models2025

Margin Matching Preference Optimization: Enhanced Model Alignment with Granular Feedback2024

Uncertainty-aware Reward Model: Teaching Reward Models to Know What is Unknown2024

RewardBench — datasets — llm-papers