AMC 2023
Emerging2papers using it
2025first seen
The 'AMC 2023' dataset/benchmark is used to evaluate the performance of reinforcement learning methods, specifically in the context of stabilizing large language model reasoning through techniques like Quantile Advantage Estimation.