AIME

Emerging

1papers using it

2026first seen

The 'AIME' dataset/benchmark is used to evaluate the performance of reinforcement learning methods in the context of long-horizon logical reasoning tasks for large language models.

🔎 Find this dataset

Papers using AIME (1)

Shattering the Autoregressive Curse: Dynamic Epistemic Entropy Orchestrated Erasable Reinforcement Learning for LLMs2026