MATH-500

Emerging

6papers using it

140,618HF downloads

317HF likes

2024first seen

Dataset Card for MATH-500 This dataset contains a subset of 500 problems from the MATH benchmark that OpenAI created in their Let's Verify Step by Step paper. See their GitHub repo for the source file: https://github.com/openai/prm800k/tree/main?tab=readme-ov-file#math-splits

🤗 Hugging Face

Papers using MATH-500 (6)

CWM: An Open-Weights LLM for Research on Code Generation with World Models2025

Large Language Model enabled Mathematical Modeling2025

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model2025

CWM: An Open-Weights LLM for Research on Code Generation with World Models2025

To Code or not to Code? Adaptive Tool Integration for Math Language Models via Expectation-Maximization2025

Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning2024