TruthfulQA
Emerging2papers using it
1,860HF downloads
49HF likes
2025first seen
Dataset Card for TruthfulQA Dataset Summary TruthfulQA: Measuring How Models Mimic Human Falsehoods We propose a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. We cr
π€ Hugging Faceβ apache-2.0