← all datasets

HLE

Emerging
2papers using it
34,984HF downloads
836HF likes
2025first seen

[!NOTE] IMPORTANT: Please help us protect the integrity of this benchmark by not publicly sharing, re-uploading, or distributing the dataset. Humanity's Last Exam 🌐 Website | πŸ“„ Paper | GitHub Center for AI Safety & Scale AI Humanity's Last Exam (HLE) is a multi-modal benchmark at the frontier of human knowledge, desi

Papers using HLE (2)

HLE β€” datasets β€” reinforcement-learning