Detailfusion: A Dual-branch Framework With Detail Enhancement For Composed Image Retrieval
2025 Β· Yuxin Yang, Yinan Zhou, Yuxin Chen, et al.
Abstract
Composed Image Retrieval (CIR) aims to retrieve target images from a gallery based on a reference image and modification text as a combined query. Recent approaches focus on balancing global information from two modalities and encode the query into a unified feature for retrieval. However, due to insufficient attention to fine-grained details, these coarse fusion methods often struggle with handling subtle visual alterations or intricate textual instructions. In this work, we propose DetailFusion, a novel dual-branch framework that effectively coordinates information across global and detailed granularities, thereby enabling detail-enhanced CIR. Our approach leverages atomic detail variation priors derived from an image editing dataset, supplemented by a detail-oriented optimization strategy to develop a Detail-oriented Inference Branch. Furthermore, we design an Adaptive Feature Compositor that dynamically fuses global and detailed features based on fine-grained information of each un
Authors
(none)
Tags
Stats
Related papers
- DAFM: Dynamic Adaptive Fusion For Multi-model Collaboration In Composed Image Retrieval (2025)0.00
- HINT: Composed Image Retrieval With Dual-path Compositional Contextualized Network (2026)0.78
- Coarse2fine: Two-layer Fusion For Image Retrieval (2016)0.00
- Far-net: Multi-stage Fusion Network With Enhanced Semantic Alignment And Adaptive Reconciliation For Composed Image Retrieval (2025)0.00
- OFFSET: Segmentation-based Focus Shift Revision For Composed Image Retrieval (2025)5.84
- Finecir: Explicit Parsing Of Fine-grained Modification Semantics For Composed Image Retrieval (2025)2.16
- A Sanity Check On Composed Image Retrieval (2026)0.00
- Generative Editing In The Joint Vision-language Space For Zero-shot Composed Image Retrieval (2025)0.00