← all datasets

AMC-23

Emerging
11papers using it
5,846HF downloads
1HF likes
2024first seen

The 'AMC23' dataset/benchmark is used to evaluate reinforcement learning models' ability to perform multi-step reasoning while managing the trade-off between efficiency and accuracy in their responses.

Papers using AMC-23 (11)

AMC-23 β€” datasets β€” reinforcement-learning