Heterogeneous Uncertainty-guided Composed Image Retrieval With Fine-grained Probabilistic Learning
2026 Β· Haomiao Tang, Jinpeng Wang, Minyi Zhao, et al.
Abstract
Composed Image Retrieval (CIR) enables image search by combining a reference image with modification text. Intrinsic noise in CIR triplets incurs intrinsic uncertainty and threatens the model's robustness. Probabilistic learning approaches have shown promise in addressing such issues; however, they fall short for CIR due to their instance-level holistic modeling and homogeneous treatment of queries and targets. This paper introduces a Heterogeneous Uncertainty-Guided (HUG) paradigm to overcome these limitations. HUG utilizes a fine-grained probabilistic learning framework, where queries and targets are represented by Gaussian embeddings that capture detailed concepts and uncertainties. We customize heterogeneous uncertainty estimations for multi-modal queries and uni-modal targets. Given a query, we capture uncertainties not only regarding uni-modal content quality but also multi-modal coordination, followed by a provable dynamic weighting mechanism to derive comprehensive query uncert
Authors
(none)
Tags
Stats
Related papers
- HABIT: Chrono-synergia Robust Progressive Learning Framework For Composed Image Retrieval (2026)2.35
- HINT: Composed Image Retrieval With Dual-path Compositional Contextualized Network (2026)0.78
- INTENT: Invariance And Discrimination-aware Noise Mitigation For Robust Composed Image Retrieval (2026)0.00
- NCL-CIR: Noise-aware Contrastive Learning For Composed Image Retrieval (2025)2.26
- Composed Image Retrieval With Text Feedback Via Multi-grained Uncertainty Regularization (2022)0.00
- DQE-CIR: Distinctive Query Embeddings Through Learnable Attribute Weights And Target Relative Negative Sampling In Composed Image Retrieval (2026)0.00
- Pseudo-triplet Guided Few-shot Composed Image Retrieval (2024)0.00
- A Sanity Check On Composed Image Retrieval (2026)0.00