MATH-500
Emerging8papers using it
2025first seen
Papers using MATH-500 (8)
- Walk Before You Run! Concise LLM Reasoning via Reinforcement LearningMaximizing Rollout Informativeness under a Fixed Budget: A Submodular View of Tree Search for Tool-Use Agentic Reinforcement LearningSAGE-32B: Agentic Reasoning Via Iterative DistillationCounterfactual Credit Policy Optimization for Multi-Agent CollaborationWhat If We Allocate Test-Time Compute Adaptively?PILOT: Planning via Internalized Latent Optimization Trajectories for Large Language ModelsReinforce LLM Reasoning through Multi-Agent ReflectionA*-Decoding: Token-Efficient Inference Scaling