RAGTruth

Emerging

8papers using it

2024first seen

'RAGTruth' is a dataset/benchmark that contains checkable factual claims and is used to evaluate the faithfulness of generated responses in large language models by verifying these claims against provided evidence.

🔎 Find this dataset

Papers using RAGTruth (8)

BALTO: Balanced Token-Level Policy Optimization for Hallucination Mitigation2026

Evaluating the Relevance of Uncertainty Estimators for LLM Hallucination2026

Detecting Contextual Hallucinations in LLMs with Frequency-Aware Attention2026

Copy-Paste to Mitigate Large Language Model Hallucinations2025

Turk-LettuceDetect: A Hallucination Detection Models for Turkish RAG Applications2025

HalluGuard: Evidence-Grounded Small Reasoning Models to Mitigate Hallucinations in Retrieval-Augmented Generation2025

Learning to Reason for Hallucination Span Detection2025

100% Elimination of Hallucinations on RAGTruth for GPT-4 and GPT-3.5 Turbo2024