Taxonomy-based Negative Sampling In Personalized Semantic Search For E-commerce
2025 Β· Uthman Jinadu, Siawpeng Er, Le Yu, et al.
Abstract
Large retail outlets offer products that may be domain-specific, and this requires having a model that can understand subtle differences in similar items. Sampling techniques used to train these models are most of the time, computationally expensive or logistically challenging. These models also do not factor in users' previous purchase patterns or behavior, thereby retrieving irrelevant items for them. We present a semantic retrieval model for e-commerce search that embeds queries and products into a shared vector space and leverages a novel taxonomy-based hard-negative sampling(TB-HNS) strategy to mine contextually relevant yet challenging negatives. To further tailor retrievals, we incorporate user-level personalization by modeling each customer's past purchase history and behavior. In offline experiments, our approach outperforms BM25, ANCE and leading neural baselines on Recall@K, while live A/B testing shows substantial uplifts in conversion rate, add-to-cart rate, and average or
Authors
(none)
Tags
Stats
Related papers
- ESANS: Effective And Semantic-aware Negative Sampling For Large-scale Retrieval Systems (2025)2.26
- Unified Embedding Based Personalized Retrieval In Etsy Search (2023)2.26
- Retrieval-grpo: A Multi-objective Reinforcement Learning Framework For Dense Retrieval In Taobao Search (2025)0.00
- From Pixels To Purchase: Building And Evaluating A Taxonomy-decoupled Visual Search Engine For Home Goods E-commerce (2026)0.00
- Multi-objective Personalized Product Retrieval In Taobao Search (2022)0.00
- Multimodal Semantic Retrieval For Product Search (2025)3.58
- Pre-training Tasks For User Intent Detection And Embedding Retrieval In E-commerce Search (2022)9.41
- Embedding-based Product Retrieval In Taobao Search (2021)13.70