← all datasets

PRM-800K

Emerging

6papers using it

456HF downloads

38HF likes

2024first seen

https://github.com/openai/prm800k/tree/main

🤗 Hugging Face⚖ mit

Papers using PRM-800K (6)

C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning2026

Process Reward Models That Think2025

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning2025

Error Typing for Smarter Rewards: Improving Process Reward Models with Error-Aware Hierarchical Supervision2025

DeepCritic: Deliberate Critique with Large Language Models2025

Understanding Chain-of-Thought in LLMs through Information Theory2024 · 1 cites