ProcessBench

Name: ProcessBench
License: apache-2.0

Emerging

11papers using it

4,891HF downloads

59HF likes

2025first seen

ProcessBench This repository contains the dataset of the ProcessBench benchmark proposed by Qwen Team. You can refer to our GitHub repository for the evaluation code and the prompt templates we use in this work. If you find this work relevant or helpful to your work, please kindly cite us: @article{processbench, title=

🤗 Hugging Face⚖ apache-2.0

Papers using ProcessBench (11)

Unsupervised Process Reward Models2026

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback2025

GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning2025

Process Reward Models That Think2025

SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning2025

Solve-Detect-Verify: Inference-Time Scaling with Flexible Generative Verifier2025

RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning2025

Training Step-Level Reasoning Verifiers with Formal Verification Tools2025

RefCritic: Training Long Chain-of-Thought Critic Models with Refinement Feedback2025

RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning2025

Efficient Process Reward Model Training via Active Learning2025