Exploiting "quantum-like Interference" In Decision Fusion For Ranking Multimodal Documents
2018 Β· Dimitris Gkoumas, Dawei Sogn
Abstract
Fusing and ranking multimodal information remains always a challenging task. A robust decision-level fusion method should not only be dynamically adaptive for assigning weights to each representation but also incorporate inter-relationships among different modalities. In this paper, we propose a quantum-inspired model for fusing and ranking visual and textual information accounting for the dependency between the aforementioned modalities. At first, we calculate the text-based and image-based similarity individually. Two different approaches have been applied for computing each unimodal similarity. The first one makes use of the bag-of-words model. For the second one, a pre-trained VGG19 model on ImageNet has been used for calculating the image similarity, while a query expansion approach has been applied to the text-based query for improving the retrieval performance. Afterward, the local similarity scores fit the proposed quantum-inspired model. The inter-dependency between the two mo
Authors
(none)
Tags
Stats
Related papers
- Joint Fusion And Encoding: Advancing Multimodal Retrieval From The Ground Up (2025)0.00
- MUST: An Effective And Scalable Framework For Multimodal Search Of Target Modality (2023)7.81
- Matching Images And Text With Multi-modal Tensor Fusion And Re-ranking (2019)19.77
- Multimodal Prediction Based On Graph Representations (2019)0.00
- Mire: Enhancing Multimodal Queries Representation Via Fusion-free Modality Interaction For Multimodal Retrieval (2024)3.81
- MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion (2025)2.26
- Unified Interactive Multimodal Moment Retrieval Via Cascaded Embedding-reranking And Temporal-aware Score Fusion (2025)0.00
- End-to-end Training Of Multimodal Model And Ranking Model (2024)0.00