Semantic-enhanced Modality-asymmetric Retrieval For Online E-commerce Search
2025 Β· Zhigong Zhou, Ning Ding, Xiaochuan Fan, et al.
Abstract
Semantic retrieval, which retrieves semantically matched items given a textual query, has been an essential component to enhance system effectiveness in e-commerce search. In this paper, we study the multimodal retrieval problem, where the visual information (e.g, image) of item is leveraged as supplementary of textual information to enrich item representation and further improve retrieval performance. Though learning from cross-modality data has been studied extensively in tasks such as visual question answering or media summarization, multimodal retrieval remains a non-trivial and unsolved problem especially in the asymmetric scenario where the query is unimodal while the item is multimodal. In this paper, we propose a novel model named SMAR, which stands for Semantic-enhanced Modality-Asymmetric Retrieval, to tackle the problem of modality fusion and alignment in this kind of asymmetric scenario. Extensive experimental results on an industrial dataset show that the proposed model ou
Authors
(none)
Tags
Stats
Related papers
- Multimodal Semantic Retrieval For Product Search (2025)3.58
- MRSE: An Efficient Multi-modality Retrieval System For Large Scale E-commerce (2024)0.00
- Transformer-empowered Multi-modal Item Embedding For Enhanced Image Search In E-commerce (2023)4.52
- Combating Visual Neglect And Semantic Drift In Large Multimodal Models For Enhanced Cross-modal Retrieval (2026)0.00
- Uniecs: Unified Multimodal E-commerce Search Framework With Gated Cross-modal Fusion (2025)2.60
- Cross-modal Semantic Enhanced Interaction For Image-sentence Retrieval (2022)12.33
- Asr-enhanced Multimodal Representation Learning For Cross-domain Product Retrieval (2024)0.00
- MUST: An Effective And Scalable Framework For Multimodal Search Of Target Modality (2023)7.81