Cohort Retrieval Using Dense Passage Retrieval
2025 Β· Pranav Jadhav
Abstract
Patient cohort retrieval is a pivotal task in medical research and clinical practice, enabling the identification of specific patient groups from extensive electronic health records (EHRs). In this work, we address the challenge of cohort retrieval in the echocardiography domain by applying Dense Passage Retrieval (DPR), a prominent methodology in semantic search. We propose a systematic approach to transform an echocardiographic EHR dataset of unstructured nature into a Query-Passage dataset, framing the problem as a Cohort Retrieval task. Additionally, we design and implement evaluation metrics inspired by real-world clinical scenarios to rigorously test the models across diverse retrieval tasks. Furthermore, we present a custom-trained DPR embedding model that demonstrates superior performance compared to traditional and off-the-shelf SOTA methods.To our knowledge, this is the first work to apply DPR for patient cohort retrieval in the echocardiography domain, establishing a framewo
Authors
(none)
Tags
Stats
Related papers
- Improving Dense Passage Retrieval With Multiple Positive Passages (2025)0.00
- Dense Passage Retrieval: Is It Retrieving? (2024)6.34
- DAPR: A Benchmark On Document-aware Passage Retrieval (2023)5.18
- Multi-cpr: A Multi Domain Chinese Dataset For Passage Retrieval (2022)0.00
- PARM: A Paragraph Aggregation Retrieval Model For Dense Document-to-document Retrieval (2022)8.35
- MA-DPR: Manifold-aware Distance Metrics For Dense Passage Retrieval (2025)0.00
- Query-as-context Pre-training For Dense Passage Retrieval (2022)7.68
- Pmc-patients: A Large-scale Dataset Of Patient Summaries And Relations For Benchmarking Retrieval-based Clinical Decision Support Systems (2022)11.39