Query-life: Query-aware Language Image Fusion Embedding For E-commerce Relevance
2023 Β· Hai Zhu, Yuankai Guo, Ronggang Dou, et al.
Abstract
Relevance module plays a fundamental role in e-commerce search as they are responsible for selecting relevant products from thousands of items based on user queries, thereby enhancing users experience and efficiency. The traditional approach models the relevance based product titles and queries, but the information in titles alone maybe insufficient to describe the products completely. A more general optimization approach is to further leverage product image information. In recent years, vision-language pre-training models have achieved impressive results in many scenarios, which leverage contrastive learning to map both textual and visual features into a joint embedding space. In e-commerce, a common practice is to fine-tune on the pre-trained model based on e-commerce data. However, the performance is sub-optimal because the vision-language pre-training models lack of alignment specifically designed for queries. In this paper, we propose a method called Query-LIFE (Query-aware Langua
Authors
(none)
Tags
Stats
Related papers
- Learning Image Information For Ecommerce Queries (2019)0.00
- Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models And Vision Language Models (2024)8.82
- Mine And Refine: Optimizing Graded Relevance In E-commerce Search Retrieval (2026)0.00
- V\(^2\)L: Leveraging Vision And Vision-language Models Into Large-scale Product Retrieval (2022)0.00
- VL-CLIP: Enhancing Multimodal Recommendations Via Visual Grounding And Llm-augmented CLIP Embeddings (2025)2.26
- Large Reasoning Embedding Models: Towards Next-generation Dense Retrieval Paradigm (2025)0.00
- MAKE: Vision-language Pre-training Based Product Retrieval In Taobao Search (2023)7.81
- Transformer-empowered Multi-modal Item Embedding For Enhanced Image Search In E-commerce (2023)4.52