SimpleQA Verified
Emerging2papers using it
587HF downloads
1HF likes
2025first seen
SimpleQA Verified is a 1,000-prompt benchmark for reliably evaluating Large Language Models (LLMs) on short-form factuality and parametric knowledge. The authors from Google DeepMind and Google Research address various limitations of SimpleQA, originally designed by Wei et al. (2024) at OpenAI, including noisy and inco
π€ Hugging Faceβ mit