PRM-800K
Emerging6papers using it
456HF downloads
38HF likes
2024first seen
https://github.com/openai/prm800k/tree/main
π€ Hugging Faceβ mit
Papers using PRM-800K (6)
- C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought ReasoningProcess Reward Models That ThinkSPC: Evolving Self-Play Critic via Adversarial Games for LLM ReasoningError Typing for Smarter Rewards: Improving Process Reward Models with
Error-Aware Hierarchical SupervisionDeepCritic: Deliberate Critique with Large Language ModelsUnderstanding Chain-of-Thought in LLMs through Information Theory