#ModelScorePaper
1infly/INF-ORM-Llama3.1-70B95.11link
2ShikaiChen/LDL-Reward-Gemma-2-27B-v0.194.99link
3nicolinho/QRM-Gemma-2-27B94.44link
4Skywork/Skywork-Reward-Gemma-2-27B-v0.294.26link
5nvidia/Llama-3.1-Nemotron-70B-Reward94.11link
6Skywork/Skywork-Reward-Gemma-2-27B93.80link
7SF-Foundation/TextEval-Llama3.1-70B93.48link
8meta-metrics/MetaMetrics-RM-v1.093.42link
9Skywork/Skywork-Critic-Llama-3.1-70B93.31link
10nicolinho/QRM-Llama3.1-8B-v293.14link
11Skywork/Skywork-Reward-Llama-3.1-8B-v0.293.13link
12nicolinho/QRM-Llama3.1-8B93.06link
13LxzGordon/URM-LLaMa-3.1-8B92.94link
14Salesforce/SFR-LLaMa-3.1-70B-Judge-r92.72link
15R-I-S-E/RISE-Judge-Qwen2.5-32B92.66link
16Skywork/Skywork-Reward-Llama-3.1-8B92.52link
17AtlaAI/Selene-192.41link
18general-preference/GPM-Llama-3.1-8B92.24link
19nvidia/Nemotron-4-340B-Reward92.00link
20Ray2333/GRM-Llama3-8B-rewardmodel-ft91.54link
21nicolinho/QRM-Llama3-8B91.10link
22SF-Foundation/TextEval-OffsetBias-12B91.05link
23Ray2333/GRM-llama3.2-3B-rewardmodel-ft90.92link
24Salesforce/SFR-nemo-12B-Judge-r90.27link
25allenai/Llama-3.1-70B-Instruct-RM-RB290.21link
26internlm/internlm2-20b-reward90.16link
27Skywork/Skywork-VL-Reward-7B90.07link
28facebook/Self-taught-evaluator-llama3.1-70B90.01link
29LxzGordon/URM-LLaMa-3-8B89.91link
30NCSOFT/Llama-3-OffsetBias-RM-8B89.42link
RewardBench (v1) rewardbench-v1 Leaderboard