AIME 2025
Emerging9papers using it
14,443HF downloads
55HF likes
2025first seen
AIME 2025 Dataset Dataset Description This dataset contains problems from the American Invitational Mathematics Examination (AIME) 2025-I & II.
π€ Hugging Faceβ mit
Papers using AIME 2025 (9)
- Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement LearningReflective Confidence: Correcting Reasoning Flaws via Online Self-CorrectionBAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive ClippingBAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via
Balanced Policy Optimization with Adaptive ClippingBeyond the Last Answer: Your Reasoning Trace Uncovers More than You
ThinkSolve-Detect-Verify: Inference-Time Scaling with Flexible Generative
VerifierNot All Correct Answers Are Equal: Why Your Distillation Source MattersUloRL:An Ultra-Long Output Reinforcement Learning Approach for Advancing
Large Language Models' Reasoning AbilitiesDeepPrune: Parallel Scaling without Inter-trace Redundancy