HGAN: Hierarchical Graph Alignment Network For Image-text Retrieval
2022 Β· Jie Guo, Meiting Wang, Yan Zhou, et al.
Abstract
Image-text retrieval (ITR) is a challenging task in the field of multimodal information processing due to the semantic gap between different modalities. In recent years, researchers have made great progress in exploring the accurate alignment between image and text. However, existing works mainly focus on the fine-grained alignment between image regions and sentence fragments, which ignores the guiding significance of context background information. Actually, integrating the local fine-grained information and global context background information can provide more semantic clues for retrieval. In this paper, we propose a novel Hierarchical Graph Alignment Network (HGAN) for image-text retrieval. First, to capture the comprehensive multimodal features, we construct the feature graphs for the image and text modality respectively. Then, a multi-granularity shared space is established with a designed Multi-granularity Feature Aggregation and Rearrangement (MFAR) module, which enhances the s
Authors
(none)
Tags
Stats
Related papers
- Hanet: Hierarchical Alignment Networks For Video-text Retrieval (2021)0.00
- Hyperbolic Hierarchical Alignment Reasoning Network For Text-3d Retrieval (2025)1.81
- Transcending Fusion: A Multi-scale Alignment Method For Remote Sensing Image-text Retrieval (2024)11.92
- A New Fine-grained Alignment Method For Image-text Matching (2023)0.00
- Scene Graph Based Fusion Network For Image-text Retrieval (2023)4.52
- Fine-grained Video-text Retrieval With Hierarchical Graph Reasoning (2020)18.27
- ALADIN: Distilling Fine-grained Alignment Scores For Efficient Image-text Matching And Retrieval (2022)14.00
- HADA: A Graph-based Amalgamation Framework In Image-text Retrieval (2023)7.05