← all datasets

BeyondAIME

Emerging
6papers using it
827HF downloads
18HF likes
2025first seen

BeyondAIME: Advancing Math Reasoning Evaluation Beyond High School Olympiads Dataset Description BeyondAIME is a curated test set designed to benchmark advanced mathematical reasoning. Its creation was guided by the following core principles to ensure a fair and challenging evaluation: High Difficulty: Problems are sou

Papers using BeyondAIME (6)

BeyondAIME β€” datasets β€” reinforcement-learning