Pmc-patients: A Large-scale Dataset Of Patient Summaries And Relations For Benchmarking Retrieval-based Clinical Decision Support Systems
2022 Β· Zhengyun Zhao, Qiao Jin, Fangyuan Chen, et al.
Abstract
Objective: Retrieval-based Clinical Decision Support (ReCDS) can aid clinical workflow by providing relevant literature and similar patients for a given patient. However, the development of ReCDS systems has been severely obstructed by the lack of diverse patient collections and publicly available large-scale patient-level annotation datasets. In this paper, we aim to define and benchmark two ReCDS tasks: Patient-to-Article Retrieval (ReCDS-PAR) and Patient-to-Patient Retrieval (ReCDS-PPR) using a novel dataset called PMC-Patients. Methods: We extract patient summaries from PubMed Central articles using simple heuristics and utilize the PubMed citation graph to define patient-article relevance and patient-patient similarity. We also implement and evaluate several ReCDS systems on the PMC-Patients benchmarks, including sparse retrievers, dense retrievers, and nearest neighbor retrievers. We conduct several case studies to show the clinical utility of PMC-Patients. Results: PMC-Patients
Authors
(none)
Tags
Stats
Related papers
- R2MED: A Benchmark For Reasoning-driven Medical Retrieval (2025)2.51
- Radir: A Scalable Framework For Multi-grained Medical Image Retrieval Via Radiology Report Mining (2025)0.00
- Medcpt: Contrastive Pre-trained Transformers With Large-scale Pubmed Search Logs For Zero-shot Biomedical Information Retrieval (2023)15.34
- Cohort Retrieval Using Dense Passage Retrieval (2025)0.00
- Health System Scale Semantic Search Across Unstructured Clinical Notes (2026)0.00
- Medgraph: An Experimental Semantic Information Retrieval Method Using Knowledge Graph Embedding For The Biomedical Citations Indexed In Pubmed (2021)0.00
- BIMCV-R: A Landmark Dataset For 3D CT Text-image Retrieval (2024)8.09
- Content-based 3D Image Retrieval And A Colbert-inspired Re-ranking For Tumor Flagging And Staging (2025)0.00