Unified Vision-language Representation Modeling For E-commerce Same-style Products Retrieval
2023 Β· Ben Chen, Linbo Jin, Xinxin Wang, et al.
Abstract
Same-style products retrieval plays an important role in e-commerce platforms, aiming to identify the same products which may have different text descriptions or images. It can be used for similar products retrieval from different suppliers or duplicate products detection of one supplier. Common methods use the image as the detected object, but they only consider the visual features and overlook the attribute information contained in the textual descriptions, and perform weakly for products in image less important industries like machinery, hardware tools and electronic component, even if an additional text matching module is added. In this paper, we propose a unified vision-language modeling method for e-commerce same-style products retrieval, which is designed to represent one product with its textual descriptions and visual contents. It contains one sampling skill to collect positive pairs from user click log with category and relevance constrained, and a novel contrastive loss unit
Authors
(none)
Tags
Stats
Related papers
- V\(^2\)L: Leveraging Vision And Vision-language Models Into Large-scale Product Retrieval (2022)0.00
- Visually Similar Products Retrieval For Shopsy (2022)2.26
- Delving Into E-commerce Product Retrieval With Vision-language Pre-training (2023)6.77
- MAKE: Vision-language Pre-training Based Product Retrieval In Taobao Search (2023)7.81
- Multimodal Semantic Retrieval For Product Search (2025)3.58
- Hierarchical Similarity Learning For Language-based Product Image Retrieval (2021)6.93
- Large-scale Product Retrieval With Weakly Supervised Representation Learning (2022)0.00
- Asr-enhanced Multimodal Representation Learning For Cross-domain Product Retrieval (2024)0.00