MATH-500
Emerging6papers using it
140,618HF downloads
317HF likes
2024first seen
Dataset Card for MATH-500 This dataset contains a subset of 500 problems from the MATH benchmark that OpenAI created in their Let's Verify Step by Step paper. See their GitHub repo for the source file: https://github.com/openai/prm800k/tree/main?tab=readme-ov-file#math-splits
Papers using MATH-500 (6)
- CWM: An Open-Weights LLM for Research on Code Generation with World ModelsLarge Language Model enabled Mathematical ModelingOpen-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement
Learning on the Base ModelCWM: An Open-Weights LLM for Research on Code Generation with World
ModelsTo Code or not to Code? Adaptive Tool Integration for Math Language Models via Expectation-MaximizationNot All Votes Count! Programs as Verifiers Improve Self-Consistency of
Language Models for Math Reasoning