← all datasets

MedHallu

Emerging
3papers using it
2,414HF downloads
22HF likes
2025first seen

Dataset Card for MedHallu MedHallu is a comprehensive benchmark dataset designed to evaluate the ability of large language models to detect hallucinations in medical question-answering tasks. Dataset Details Dataset Description MedHallu is intended to assess the reliability of large language models in a critical domain

Papers using MedHallu (3)

MedHallu — datasets — llm-papers