MATH
Emerging34papers using it
2023first seen
Papers using MATH (34)
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
ThinkingEvalTree: Profiling Language Model Weaknesses via Hierarchical
Capability TreesShape of Thought: When Distribution Matters More than Correctness in Reasoning TasksRewriting Pre-Training Data Boosts LLM Performance in Math and CodeLLM Performance for Code Generation on Noisy TasksReasoning-as-Logic-Units: Scaling Test-Time Reasoning in Large Language
Models Through Logic Unit AlignmentCode-Vision: Evaluating Multimodal LLMs Logic Understanding and Code
Generation CapabilitiesMathClean: A Benchmark for Synthetic Mathematical Data CleaningProgressive-Hint Prompting Improves Reasoning in Large Language ModelsMetaMath: Bootstrap Your Own Mathematical Questions for Large Language
ModelsSolving Challenging Math Word Problems Using GPT-4 Code Interpreter with
Code-based Self-VerificationData Interpreter: An LLM Agent For Data ScienceLearning From Mistakes Makes LLM Better ReasonerMathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical
ReasoningCREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning
of Large Language ModelsOpenMathInstruct-1: A 1.8 Million Math Instruction Tuning DatasetDotaMath: Decomposition of Thought with Code Assistance and
Self-correction for Mathematical ReasoningInternLM-Math: Open Math Large Language Models Toward Verifiable
ReasoningMathGenie: Generating Synthetic Data with Question Back-translation for
Enhancing Mathematical Reasoning of LLMsMuMath-Code: Combining Tool-Use Large Language Models with
Multi-perspective Data Augmentation for Mathematical ReasoningReGAL: Refactoring Programs to Discover Generalizable AbstractionsDivide-and-Conquer Meets Consensus: Unleashing the Power of Functions in
Code GenerationBuilding Math Agents with Multi-Turn Iterative Preference LearningMARIO: MAth Reasoning with code Interpreter Output -- A Reproducible
PipelineEmbedding Self-Correction as an Inherent Ability in Large Language
Models for Enhanced Mathematical ReasoningReasonAgain: Using Extractable Symbolic Programs to Evaluate
Mathematical ReasoningUTMath: Math Evaluation with Unit Test via Reasoning-to-Coding ThoughtsInfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic
Mathematical ReasoningSolving Challenging Math Word Problems Using GPT-4 Code Interpreter with
Code-based Self-VerificationMathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical
ReasoningOpenMathInstruct-1: A 1.8 Million Math Instruction Tuning DatasetDotaMath: Decomposition of Thought with Code Assistance and
Self-correction for Mathematical ReasoningInfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic
Mathematical ReasoningrStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep
Thinking