AMC-23
Emerging1papers using it
2026first seen
The 'AMC-23' dataset is a benchmark used to evaluate mathematical reasoning capabilities of models, specifically assessing their performance improvements through critique-guided training methods.
The 'AMC-23' dataset is a benchmark used to evaluate mathematical reasoning capabilities of models, specifically assessing their performance improvements through critique-guided training methods.