← all datasets

SimpleQA

Emerging
3papers using it
4,638HF downloads
32HF likes
2025first seen

SimpleQA A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions. Sources openai/simple-evals Introducing SimpleQA Measuring short-form factuality in large language models

Papers using SimpleQA (3)

SimpleQA β€” datasets β€” ai-agents