Composed Image Retrieval With Text Feedback Via Multi-grained Uncertainty Regularization
2022 Β· Yiyang Chen, Zhedong Zheng, Wei Ji, et al.
Abstract
We investigate composed image retrieval with text feedback. Users gradually look for the target of interest by moving from coarse to fine-grained feedback. However, existing methods merely focus on the latter, i.e., fine-grained search, by harnessing positive and negative pairs during training. This pair-based paradigm only considers the one-to-one distance between a pair of specific points, which is not aligned with the one-to-many coarse-grained retrieval process and compromises the recall rate. In an attempt to fill this gap, we introduce a unified learning approach to simultaneously modeling the coarse- and fine-grained retrieval by considering the multi-grained uncertainty. The key idea underpinning the proposed method is to integrate fine- and coarse-grained retrieval as matching data points with small and large fluctuations, respectively. Specifically, our method contains two modules: uncertainty modeling and uncertainty regularization. (1) The uncertainty modeling simulates the
Authors
(none)
Tags
Stats
Related papers
- Ranking-aware Uncertainty For Text-guided Image Retrieval (2023)0.00
- Heterogeneous Uncertainty-guided Composed Image Retrieval With Fine-grained Probabilistic Learning (2026)0.00
- Multi-modal Reference Learning For Fine-grained Text-to-image Retrieval (2025)6.77
- Bi-directional Training For Composed Image Retrieval Via Text Prompt Learning (2023)15.63
- Learning With Multi-modal Gradient Attention For Explainable Composed Image Retrieval (2023)0.00
- Image Search With Text Feedback By Additive Attention Compositional Learning (2022)0.00
- Composing Text And Image For Image Retrieval - An Empirical Odyssey (2018)18.71
- Candidate Set Re-ranking For Composed Image Retrieval With Dual Multi-modal Encoder (2023)2.64