AIME-25
Emerging25papers using it
476HF downloads
2HF likes
2025first seen
The AIME25 part 1 exam from the website.
π€ Hugging Faceβ mit
Papers using AIME-25 (25)
- Transformation-Augmented GRPO for Enhancing Exploration in Reasoning of Large Language ModelsVTC-R1: Vision-Text Compression for Efficient Long-Context ReasoningTriAttention: Efficient Long Reasoning with Trigonometric KV CompressionBenchmarking EngGPT2-16B-A3B against Comparable Italian and International Open-source LLMsTest-time Recursive Thinking: Self-Improvement without External FeedbackPrompting Test-Time Scaling Is A Strong LLM Reasoning Data AugmentationRefCritic: Training Long Chain-of-Thought Critic Models with Refinement FeedbackMoL-RL: Distilling Multi-Step Environmental Feedback into LLMs for Feedback-Independent ReasoningLight-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and
BeyondPensez: Less Data, Better Reasoning -- Rethinking French LLMSkywork Open Reasoner 1 Technical ReportBeyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective
Reinforcement Learning for LLM ReasoningRefCritic: Training Long Chain-of-Thought Critic Models with Refinement
FeedbackSAND-Math: Using LLMs to Generate Novel, Difficult and Useful
Mathematics Questions and AnswersBeyond Pass@1: Self-Play with Variational Problem Synthesis Sustains
RLVRDCPO: Dynamic Clipping Policy OptimizationPromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model
ReasoningScaleDiff: Scaling Difficult Problems for Advanced Mathematical
ReasoningFrom Harm to Help: Turning Reasoning In-Context Demos into Assets for
Reasoning LMsMeta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement
LearningSkill-Targeted Adaptive TrainingA^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid
ReasoningShorter but not Worse: Frugal Reasoning via Easy Samples as Length
Regularizers in Math RLVRCan LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM ReasoningScaling Reasoning without Attention