← all datasets

TruthfulQA

Canonical
27papers using it
1,860HF downloads
49HF likes
2024first seen

Dataset Card for TruthfulQA Dataset Summary TruthfulQA: Measuring How Models Mimic Human Falsehoods We propose a benchmark to measure whether a language model is truthful in generating answers to questions. The benchmark comprises 817 questions that span 38 categories, including health, law, finance and politics. We cr

Papers using TruthfulQA (27)

TruthfulQA β€” datasets β€” llm-papers