AMC-23

Emerging

1papers using it

2026first seen

The 'AMC-23' dataset is a benchmark used to evaluate mathematical reasoning capabilities of models, specifically assessing their performance improvements through critique-guided training methods.

🔎 Find this dataset

Papers using AMC-23 (1)

Critique-Guided Distillation for Robust Reasoning via Refinement2026