Enhancing Medical Cross-modal Hashing Retrieval Using Dropout-voting Mixture-of-experts Fusion
2025 Β· Jaewon Ahn, Woosung Jang, Beakcheol Jang
Abstract
In recent years, cross-modal retrieval using images and text has become an active area of research, especially in the medical domain. The abundance of data in various modalities in this field has led to a growing importance of cross-modal retrieval for efficient image interpretation, data-driven diagnostic support, and medical education. In the context of the increasing integration of distributed medical data across healthcare facilities with the objective of enhancing interoperability, it is imperative to optimize the performance of retrieval systems in terms of the speed, memory efficiency, and accuracy of the retrieved data. This necessity arises in response to the substantial surge in data volume that characterizes contemporary medical practices. In this study, we propose a novel framework that incorporates dropout voting and mixture-of-experts (MoE) based contrastive fusion modules into a CLIP-based cross-modal hashing retrieval structure. We also propose the application of hybrid
Authors
(none)
Tags
Stats
Related papers
- Fusion-supervised Deep Cross-modal Hashing (2019)8.60
- CLIP Multi-modal Hashing For Multimedia Retrieval (2024)3.58
- Prompthash: Affinity-prompted Collaborative Cross-modal Learning For Adaptive Hashing Retrieval (2025)7.70
- Transitive Hashing Network For Heterogeneous Multimedia Retrieval (2016)8.35
- Deep Supervised Information Bottleneck Hashing For Cross-modal Retrieval Based Computer-aided Diagnosis (2022)0.00
- Revisiting Medical Image Retrieval Via Knowledge Consolidation (2025)6.34
- Efficient Discrete Supervised Hashing For Large-scale Cross-modal Retrieval (2019)11.08
- Lightweight Contrastive Distilled Hashing For Online Cross-modal Retrieval (2025)4.52