CMIR-NET : A Deep Learning Based Model For Cross-modal Retrieval In Remote Sensing
2019 Β· Ushasi Chaudhuri, Biplab Banerjee, Avik Bhattacharya, et al.
Abstract
We address the problem of cross-modal information retrieval in the domain of remote sensing. In particular, we are interested in two application scenarios: i) cross-modal retrieval between panchromatic (PAN) and multi-spectral imagery, and ii) multi-label image retrieval between very high resolution (VHR) images and speech based label annotations. Notice that these multi-modal retrieval scenarios are more challenging than the traditional uni-modal retrieval approaches given the inherent differences in distributions between the modalities. However, with the growing availability of multi-source remote sensing data and the scarcity of enough semantic annotations, the task of multi-modal retrieval has recently become extremely important. In this regard, we propose a novel deep neural network based architecture which is considered to learn a discriminative shared feature space for all the input modalities, suitable for semantically coherent information retrieval. Extensive experiments are c
Authors
(none)
Tags
Stats
Related papers
- A Novel Self-supervised Cross-modal Image Retrieval Method In Remote Sensing (2022)8.35
- Cross-view Image Retrieval -- Ground To Aerial Image Retrieval Through Deep Learning (2020)5.24
- Exploring A Fine-grained Multiscale Method For Cross-modal Remote Sensing Image Retrieval (2022)16.73
- Remote Sensing Cross-modal Text-image Retrieval Based On Global And Local Information (2022)19.48
- Region Convolutional Features For Multi-label Remote Sensing Image Retrieval (2018)17.37
- Semi-supervised Cross-modal Retrieval With Label Prediction (2018)11.29
- Cross-modality Sub-image Retrieval Using Contrastive Multimodal Image Representations (2022)6.32
- Large Language Models For Captioning And Retrieving Remote Sensing Images (2024)0.00