OFFSET: Segmentation-based Focus Shift Revision For Composed Image Retrieval
2025 Β· Zhiwei Chen, Yupeng Hu, Zixu Li, et al.
Abstract
Composed Image Retrieval (CIR) represents a novel retrieval paradigm that is capable of expressing users' intricate retrieval requirements flexibly. It enables the user to give a multimodal query, comprising a reference image and a modification text, and subsequently retrieve the target image. Notwithstanding the considerable advances made by prevailing methodologies, CIR remains in its nascent stages due to two limitations: 1) inhomogeneity between dominant and noisy portions in visual data is ignored, leading to query feature degradation, and 2) the priority of textual data in the image modification process is overlooked, which leads to a visual focus bias. To address these two limitations, this work presents a focus mapping-based feature extractor, which consists of two modules: dominant portion segmentation and dual focus mapping. It is designed to identify significant dominant portions in images and guide the extraction of visual and textual data features, thereby reducing the imp
Authors
(none)
Tags
Stats
Related papers
- FBCIR: Balancing Cross-modal Focuses In Composed Image Retrieval (2026)0.00
- HINT: Composed Image Retrieval With Dual-path Compositional Contextualized Network (2026)0.78
- A Sanity Check On Composed Image Retrieval (2026)0.00
- Infocir: Multimedia Analysis For Composed Image Retrieval (2026)1.24
- CSMCIR: Cot-enhanced Symmetric Alignment With Memory Bank For Composed Image Retrieval (2026)0.00
- TMCIR: Token Merge Benefits Composed Image Retrieval (2025)0.00
- From Mapping To Composing: A Two-stage Framework For Zero-shot Composed Image Retrieval (2025)0.00
- Finecir: Explicit Parsing Of Fine-grained Modification Semantics For Composed Image Retrieval (2025)2.16