Humanity's Last Exam
Emerging4papers using it
2025first seen
'Humanity's Last Exam' is a benchmark used to evaluate the performance of search agents, specifically assessing their capabilities in high-difficulty tasks.
'Humanity's Last Exam' is a benchmark used to evaluate the performance of search agents, specifically assessing their capabilities in high-difficulty tasks.