DAFM: Dynamic Adaptive Fusion For Multi-model Collaboration In Composed Image Retrieval
2025 Β· Yawei Cai, Jiapeng Mi, Nan Ji, et al.
Abstract
Composed Image Retrieval (CIR) is a cross-modal task that aims to retrieve target images from large-scale databases using a reference image and a modification text. Most existing methods rely on a single model to perform feature fusion and similarity matching. However, this paradigm faces two major challenges. First, one model alone can't see the whole picture and the tiny details at the same time; it has to handle different tasks with the same weights, so it often misses the small but important links between image and text. Second, the absence of dynamic weight allocation prevents adaptive leveraging of complementary model strengths, so the resulting embedding drifts away from the target and misleads the nearest-neighbor search in CIR. To address these limitations, we propose Dynamic Adaptive Fusion (DAFM) for multi-model collaboration in CIR. Rather than optimizing a single method in isolation, DAFM exploits the complementary strengths of heterogeneous models and adaptively rebalance
Authors
(none)
Tags
Stats
Related papers
- Detailfusion: A Dual-branch Framework With Detail Enhancement For Composed Image Retrieval (2025)0.00
- Far-net: Multi-stage Fusion Network With Enhanced Semantic Alignment And Adaptive Reconciliation For Composed Image Retrieval (2025)0.00
- HINT: Composed Image Retrieval With Dual-path Compositional Contextualized Network (2026)0.78
- TMCIR: Token Merge Benefits Composed Image Retrieval (2025)0.00
- CSMCIR: Cot-enhanced Symmetric Alignment With Memory Bank For Composed Image Retrieval (2026)0.00
- A Sanity Check On Composed Image Retrieval (2026)0.00
- Infocir: Multimedia Analysis For Composed Image Retrieval (2026)1.24
- FBCIR: Balancing Cross-modal Focuses In Composed Image Retrieval (2026)0.00