HABIT: Chrono-synergia Robust Progressive Learning Framework For Composed Image Retrieval
2026 Β· Zixu Li, Yupeng Hu, Zhiwei Chen, et al.
Abstract
Composed Image Retrieval (CIR) is a flexible image retrieval paradigm that enables users to accurately locate the target image through a multimodal query composed of a reference image and modification text. Although this task has demonstrated promising applications in personalized search and recommendation systems, it encounters a severe challenge in practical scenarios known as the Noise Triplet Correspondence (NTC) problem. This issue primarily arises from the high cost and subjectivity involved in annotating triplet data. To address this problem, we identify two central challenges: the precise estimation of composed semantic discrepancy and the insufficient progressive adaptation to modification discrepancy. To tackle these challenges, we propose a cHrono-synergiA roBust progressIve learning framework for composed image reTrieval (HABIT), which consists of two core modules. First, the Mutual Knowledge Estimation Module quantifies sample cleanliness by calculating the Transition Rate
Authors
(none)
Tags
Stats
Related papers
- HINT: Composed Image Retrieval With Dual-path Compositional Contextualized Network (2026)0.78
- INTENT: Invariance And Discrimination-aware Noise Mitigation For Robust Composed Image Retrieval (2026)0.00
- NCL-CIR: Noise-aware Contrastive Learning For Composed Image Retrieval (2025)2.26
- Conesep: Cone-based Robust Noise-unlearning Compositional Network For Composed Image Retrieval (2026)0.00
- Air-know: Arbiter-calibrated Knowledge-internalizing Robust Network For Composed Image Retrieval (2026)0.00
- Heterogeneous Uncertainty-guided Composed Image Retrieval With Fine-grained Probabilistic Learning (2026)0.00
- Infocir: Multimedia Analysis For Composed Image Retrieval (2026)1.24
- CSMCIR: Cot-enhanced Symmetric Alignment With Memory Bank For Composed Image Retrieval (2026)0.00