Entity-graph Enhanced Cross-modal Pretraining For Instance-level Product Retrieval
2022 Β· Xiao Dong, Xunlin Zhan, Yunchao Wei, et al.
Abstract
Our goal in this research is to study a more realistic environment in which we can conduct weakly-supervised multi-modal instance-level product retrieval for fine-grained product categories. We first contribute the Product1M datasets, and define two real practical instance-level retrieval tasks to enable the evaluations on the price comparison and personalized recommendations. For both instance-level tasks, how to accurately pinpoint the product target mentioned in the visual-linguistic data and effectively decrease the influence of irrelevant contents is quite challenging. To address this, we exploit to train a more effective cross-modal pertaining model which is adaptively capable of incorporating key concept information from the multi-modal data, by using an entity graph whose node and edge respectively denote the entity and the similarity relation between entities. Specifically, a novel Entity-Graph Enhanced Cross-Modal Pretraining (EGE-CMP) model is proposed for instance-level com
Authors
(none)
Tags
Stats
Related papers
- Product1m: Towards Weakly Supervised Instance-level Product Retrieval Via Cross-modal Pretraining (2021)12.61
- Multimodal Semantic Retrieval For Product Search (2025)3.58
- Asr-enhanced Multimodal Representation Learning For Cross-domain Product Retrieval (2024)0.00
- ACE-BERT: Adversarial Cross-modal Enhanced BERT For E-commerce Retrieval (2021)0.00
- Graph Contrastive Learning With Multi-objective For Personalized Product Retrieval In Taobao Search (2023)0.00
- MAKE: Vision-language Pre-training Based Product Retrieval In Taobao Search (2023)7.81
- Embedding-based Product Retrieval In Taobao Search (2021)13.70
- Specializing Joint Representations For The Task Of Product Recommendation (2017)8.35