ACE-BERT: Adversarial Cross-modal Enhanced BERT For E-commerce Retrieval
2021 Β· Boxuan Zhang, Chao Wei, Yan Jin, et al.
Abstract
Nowadays on E-commerce platforms, products are presented to the customers with multiple modalities. These multiple modalities are significant for a retrieval system while providing attracted products for customers. Therefore, how to take into account those multiple modalities simultaneously to boost the retrieval performance is crucial. This problem is a huge challenge to us due to the following reasons: (1) the way of extracting patch features with the pre-trained image model (e.g., CNN-based model) has much inductive bias. It is difficult to capture the efficient information from the product image in E-commerce. (2) The heterogeneity of multimodal data makes it challenging to construct the representations of query text and product including title and image in a common subspace. We propose a novel Adversarial Cross-modal Enhanced BERT (ACE-BERT) for efficient E-commerce retrieval. In detail, ACE-BERT leverages the patch features and pixel features as image representation. Thus the Tra
Authors
(none)
Tags
Stats
Related papers
- Asr-enhanced Multimodal Representation Learning For Cross-domain Product Retrieval (2024)0.00
- Uniecs: Unified Multimodal E-commerce Search Framework With Gated Cross-modal Fusion (2025)2.60
- Product1m: Towards Weakly Supervised Instance-level Product Retrieval Via Cross-modal Pretraining (2021)12.61
- Extending CLIP For Category-to-image Retrieval In E-commerce (2021)8.60
- Transformer-empowered Multi-modal Item Embedding For Enhanced Image Search In E-commerce (2023)4.52
- Entity-graph Enhanced Cross-modal Pretraining For Instance-level Product Retrieval (2022)5.24
- Optimizing Product Deduplication In E-commerce With Multimodal Embeddings (2025)0.00
- Fashionbert: Text And Image Matching With Adaptive Loss For Cross-modal Retrieval (2020)15.16