A Systematic Study Of Retrieval Pipeline Design For Retrieval-augmented Medical Question Answering
2026 Β· Nusrat Sultana, Abdullah Muhammad Moosa, Kazi Afzalur Rahman, et al.
Abstract
Large language models (LLMs) have demonstrated strong capabilities in medical question answering; however, purely parametric models often suffer from knowledge gaps and limited factual grounding. Retrieval-augmented generation (RAG) addresses this limitation by integrating external knowledge retrieval into the reasoning process. Despite increasing interest in RAG-based medical systems, the impact of individual retrieval components on performance remains insufficiently understood. This study presents a systematic evaluation of retrieval-augmented medical question answering using the MedQA USMLE benchmark and a structured textbook-based knowledge corpus. We analyze the interaction between language models, embedding models, retrieval strategies, query reformulation, and cross-encoder reranking within a unified experimental framework comprising forty configurations. Results show that retrieval augmentation significantly improves zero-shot medical question answering performance. The best-pe
Authors
(none)
Tags
Stats
Related papers
- Beyond Retrieval: Ensembling Cross-encoders And GPT Rerankers With Llms For Biomedical QA (2025)0.00
- An Interactive Multi-modal Query Answering System With Retrieval-augmented Large Language Models (2024)5.84
- Are We On The Right Way For Assessing Document Retrieval-augmented Generation? (2025)0.00
- Retrieval-augmented Generation Assistant For Anatomical Pathology Laboratories (2025)0.00
- Graph-based Retriever Captures The Long Tail Of Biomedical Knowledge (2024)0.00
- Murag: Multimodal Retrieval-augmented Generator For Open Question Answering Over Images And Text (2022)14.66
- Object Retrieval For Visual Question Answering With Outside Knowledge (2024)0.00
- Machine Assistant With Reliable Knowledge: Enhancing Student Learning Via Rag-based Retrieval (2025)0.00