Hierarchical Similarity Learning For Language-based Product Image Retrieval
2021 Β· Zhe Ma, Fenghao Liu, Jianfeng Dong, et al.
Abstract
This paper aims for the language-based product image retrieval task. The majority of previous works have made significant progress by designing network structure, similarity measurement, and loss function. However, they typically perform vision-text matching at certain granularity regardless of the intrinsic multiple granularities of images. In this paper, we focus on the cross-modal similarity measurement, and propose a novel Hierarchical Similarity Learning (HSL) network. HSL first learns multi-level representations of input data by stacked encoders, and object-granularity similarity and image-granularity similarity are computed at each level. All the similarities are combined as the final hierarchical cross-modal similarity. Experiments on a large-scale product retrieval dataset demonstrate the effectiveness of our proposed method. Code and data are available at https://github.com/liufh1/hsl.
Authors
(none)
Tags
Stats
Code
- liufh1/hslβ
Related papers
- Learning Visual Hierarchies In Hyperbolic Space For Image Retrieval (2024)0.00
- Integrating Visual And Semantic Similarity Using Hierarchies For Image Retrieval (2023)4.52
- Unified Vision-language Representation Modeling For E-commerce Same-style Products Retrieval (2023)6.34
- V\(^2\)L: Leveraging Vision And Vision-language Models Into Large-scale Product Retrieval (2022)0.00
- Hierarchical Multi-positive Contrastive Learning For Patent Image Retrieval (2025)0.00
- Adaptive Semantic-visual Tree For Hierarchical Embeddings (2020)8.09
- PRISM: Product Retrieval In Shopping Carts Using Hybrid Matching (2025)0.00
- Hihpq: Hierarchical Hyperbolic Product Quantization For Unsupervised Image Retrieval (2024)6.77