Deep Reversible Consistency Learning For Cross-modal Retrieval
2025 Β· Ruitao Pu, Yang Qin, Dezhong Peng, et al.
Abstract
Cross-modal retrieval (CMR) typically involves learning common representations to directly measure similarities between multimodal samples. Most existing CMR methods commonly assume multimodal samples in pairs and employ joint training to learn common representations, limiting the flexibility of CMR. Although some methods adopt independent training strategies for each modality to improve flexibility in CMR, they utilize the randomly initialized orthogonal matrices to guide representation learning, which is suboptimal since they assume inter-class samples are independent of each other, limiting the potential of semantic alignments between sample representations and ground-truth labels. To address these issues, we propose a novel method termed Deep Reversible Consistency Learning (DRCL) for cross-modal retrieval. DRCL includes two core modules, \ie Selective Prior Learning (SPL) and Reversible Semantic Consistency learning (RSC). More specifically, SPL first learns a transformation weigh
Authors
(none)
Tags
Stats
Related papers
- Discriminative Semantic Transitive Consistency For Cross-modal Learning (2021)0.00
- Adversarial Cross-modal Retrieval Via Learning And Transferring Single-modal Similarities (2019)8.60
- Cross-modal Deep Metric Learning With Multi-task Regularization (2017)9.03
- A Unified Optimal Transport Framework For Cross-modal Retrieval With Noisy Labels (2024)5.24
- Continual Learning In Cross-modal Retrieval (2021)9.41
- Semcore: A Semantic-enhanced Generative Cross-modal Retrieval Framework With Mllms (2025)0.00
- A Comprehensive Empirical Study Of Vision-language Pre-trained Model For Supervised Cross-modal Retrieval (2022)0.00
- CL2CM: Improving Cross-lingual Cross-modal Retrieval Via Cross-lingual Knowledge Transfer (2023)8.60