Robust Remote Sensing Image-text Retrieval With Noisy Correspondence
2026 Β· Qiya Song, Yiqiang Xie, Yuan Sun, et al.
Abstract
As a pivotal task that bridges remote visual and linguistic understanding, Remote Sensing Image-Text Retrieval (RSITR) has attracted considerable research interest in recent years. However, almost all RSITR methods implicitly assume that image-text pairs are matched perfectly. In practice, acquiring a large set of well-aligned data pairs is often prohibitively expensive or even infeasible. In addition, we also notice that the remote sensing datasets (e.g., RSITMD) truly contain some inaccurate or mismatched image text descriptions. Based on the above observations, we reveal an important but untouched problem in RSITR, i.e., Noisy Correspondence (NC). To overcome these challenges, we propose a novel Robust Remote Sensing Image-Text Retrieval (RRSITR) paradigm that designs a self-paced learning strategy to mimic human cognitive learning patterns, thereby learning from easy to hard from multi-modal data with NC. Specifically, we first divide all training sample pairs into three categories
Authors
(none)
Tags
Stats
Related papers
- Transcending Fusion: A Multi-scale Alignment Method For Remote Sensing Image-text Retrieval (2024)11.92
- An Unsupervised Cross-modal Hashing Method Robust To Noisy Training Image-text Correspondences In Remote Sensing (2022)7.16
- Iebaker: Improved Remote Sensing Image-text Retrieval Framework Via Eliminate Before Align And Keyword Explicit Reasoning (2025)2.86
- Remote Sensing Cross-modal Text-image Retrieval Based On Global And Local Information (2022)19.48
- Self-supervised Cross-modal Text-image Time Series Retrieval In Remote Sensing (2025)3.58
- Pmpguard: Catching Pseudo-matched Pairs In Remote Sensing Image-text Retrieval (2025)0.00
- Tsvc:tripartite Learning With Semantic Variation Consistency For Robust Image-text Retrieval (2025)3.58
- Fast-then-fine: A Two-stage Framework With Multi-granular Representation For Cross-modal Retrieval In Remote Sensing (2026)0.00