Deep Boosting Learning: A Brand-new Cooperative Approach For Image-text Matching
2024 Β· Haiwen Diao, Ying Zhang, Shang Gao, et al.
Abstract
Image-text matching remains a challenging task due to heterogeneous semantic diversity across modalities and insufficient distance separability within triplets. Different from previous approaches focusing on enhancing multi-modal representations or exploiting cross-modal correspondence for more accurate retrieval, in this paper we aim to leverage the knowledge transfer between peer branches in a boosting manner to seek a more powerful matching model. Specifically, we propose a brand-new Deep Boosting Learning (DBL) algorithm, where an anchor branch is first trained to provide insights into the data properties, with a target branch gaining more advanced knowledge to develop optimal features and distance metrics. Concretely, an anchor branch initially learns the absolute or relative distance between positive and negative pairs, providing a foundational understanding of the particular network and data distribution. Building upon this knowledge, a target branch is concurrently tasked with
Authors
(none)
Tags
Stats
Related papers
- Boosting Weak Positives For Text Based Person Search (2025)0.00
- Enhancing Image-text Matching With Adaptive Feature Aggregation (2024)6.34
- ALADIN: Distilling Fine-grained Alignment Scores For Efficient Image-text Matching And Retrieval (2022)14.00
- Deep Multimodal Image-text Embeddings For Automatic Cross-media Retrieval (2020)0.00
- A New Fine-grained Alignment Method For Image-text Matching (2023)0.00
- Learning Image-text Matching With Optimal Partial Transport (2026)0.00
- Towards Fast And Accurate Image-text Retrieval With Self-supervised Fine-grained Alignment (2023)11.99
- DEMO: A Statistical Perspective For Efficient Image-text Matching (2024)4.52