RAGTruth
Emerging8papers using it
2024first seen
'RAGTruth' is a dataset/benchmark that contains checkable factual claims and is used to evaluate the faithfulness of generated responses in large language models by verifying these claims against provided evidence.
Papers using RAGTruth (8)
- BALTO: Balanced Token-Level Policy Optimization for Hallucination MitigationEvaluating the Relevance of Uncertainty Estimators for LLM HallucinationDetecting Contextual Hallucinations in LLMs with Frequency-Aware AttentionCopy-Paste to Mitigate Large Language Model HallucinationsTurk-LettuceDetect: A Hallucination Detection Models for Turkish RAG
ApplicationsHalluGuard: Evidence-Grounded Small Reasoning Models to Mitigate
Hallucinations in Retrieval-Augmented GenerationLearning to Reason for Hallucination Span Detection100% Elimination of Hallucinations on RAGTruth for GPT-4 and GPT-3.5
Turbo