HLE

Name: HLE
License: mit

Emerging

6papers using it

34,984HF downloads

836HF likes

2025first seen

[!NOTE] IMPORTANT: Please help us protect the integrity of this benchmark by not publicly sharing, re-uploading, or distributing the dataset. Humanity's Last Exam 🌐 Website | 📄 Paper | GitHub Center for AI Safety & Scale AI Humanity's Last Exam (HLE) is a multi-modal benchmark at the frontier of human knowledge, desi

🤗 Hugging Face⚖ mit

Papers using HLE (6)

Self-Verified Distillation: Your Language Model Is Secretly Its Own Synthetic Data Pipeline2026

MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks2026

SCOPE: Prompt Evolution for Enhancing Agent Effectiveness2025

EAPO: Enhancing Policy Optimization with On-Demand Expert Assistance2025

B-score: Detecting biases in large language models using response history2025

A^2FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning2025