AIME
Emerging5papers using it
74HF downloads
0HF likes
2025first seen
The AIME dataset/benchmark is used to evaluate mathematical reasoning tasks.
Papers using AIME (5)
- OpenThoughts: Data Recipes for Reasoning ModelsSBSC: Step-By-Step Coding for Improving Mathematical Olympiad
Performance$V_1$: Unifying Generation and Self-Verification for Parallel ReasonersTo Code or not to Code? Adaptive Tool Integration for Math Language Models via Expectation-MaximizationV_1: Unifying Generation and Self-Verification for Parallel Reasoners