← all datasets

TruthfulQA

Emerging
2papers using it
1,860HF downloads
49HF likes
2025first seen

Dataset Card for TruthfulQA Dataset Summary TruthfulQA: Measuring How Models Mimic Human Falsehoods We propose a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. We cr

Papers using TruthfulQA (2)