Graph-aware Late Chunking For Retrieval-augmented Generation In Biomedical Literature
2026 Β· Pouria Mortezaagha, Arya Rahgozar
Abstract
Retrieval-Augmented Generation (RAG) systems for biomedical literature are typically evaluated using ranking metrics like Mean Reciprocal Rank (MRR), which measure how well the system identifies the single most relevant chunk. We argue that for full-text scientific documents, this paradigm is incomplete: it rewards retrieval precision while ignoring retrieval breadth -- the ability to surface evidence from across a document's structural sections. We propose GraLC-RAG, a framework that unifies late chunking with graph-aware structural intelligence, introducing structure-aware chunk boundary detection, UMLS knowledge graph infusion, and graph-guided hybrid retrieval. We evaluate six strategies on 2,359 IMRaD-filtered PubMed Central articles using 2,033 cross-section questions and two metric families: standard ranking metrics (MRR, Recall@k) and structural coverage metrics (SecCov@k, CS Recall). Our results expose a sharp divergence: content-similarity methods achieve the highest MRR (0.5
Authors
(none)
Tags
Stats
Related papers
- Are We On The Right Way For Assessing Document Retrieval-augmented Generation? (2025)0.00
- Graph-based Retriever Captures The Long Tail Of Biomedical Knowledge (2024)0.00
- Chunk Twice, Embed Once: A Systematic Study Of Segmentation And Representation Trade-offs In Chemistry-aware Retrieval-augmented Generation (2025)0.00
- Ragsmith: A Framework For Finding The Optimal Composition Of Retrieval-augmented Generation Methods Across Datasets (2025)0.00
- MG\(^2\)-RAG: Multi-granularity Graph For Multimodal Retrieval-augmented Generation (2026)0.00
- From BM25 To Corrective RAG: Benchmarking Retrieval Strategies For Text-and-table Documents (2026)0.00
- SRAG: RAG With Structured Data Improves Vector Retrieval (2026)0.00
- Rag-check: Evaluating Multimodal Retrieval Augmented Generation Performance (2025)0.00