← all datasets

HMMT 25

Emerging
3papers using it
52HF downloads
0HF likes
2025first seen

The 'HMMT 25' dataset/benchmark contains a collection of challenging mathematical problems used to evaluate the reasoning capabilities of large language models (LLMs).

Papers using HMMT 25 (3)

HMMT 25 β€” datasets β€” ai-for-code