← all datasets

Humanity's Last Exam

Emerging
3papers using it
2025first seen

'Humanity's Last Exam' is a benchmark dataset used to evaluate the capabilities of evolving agents in scientific inquiry and experimentation.

Papers using Humanity's Last Exam (3)

Humanity's Last Exam β€” datasets β€” ai-for-science