ESANS: Effective And Semantic-aware Negative Sampling For Large-scale Retrieval Systems
2025 Β· Haibo Xing, Kanefumi Matsuyama, Hao Deng, et al.
Abstract
Industrial recommendation systems typically involve a two-stage process: retrieval and ranking, which aims to match users with millions of items. In the retrieval stage, classic embedding-based retrieval (EBR) methods depend on effective negative sampling techniques to enhance both performance and efficiency. However, existing techniques often suffer from false negatives, high cost for ensuring sampling quality and semantic information deficiency. To address these limitations, we propose Effective and Semantic-Aware Negative Sampling (ESANS), which integrates two key components: Effective Dense Interpolation Strategy (EDIS) and Multimodal Semantic-Aware Clustering (MSAC). EDIS generates virtual samples within the low-dimensional embedding space to improve the diversity and density of the sampling distribution while minimizing computational costs. MSAC refines the negative sampling distribution by hierarchically clustering item representations based on multimodal information (visual, te
Authors
(none)
Tags
Stats
Related papers
- Taxonomy-based Negative Sampling In Personalized Semantic Search For E-commerce (2025)0.00
- Syneg: Llm-driven Synthetic Hard-negatives For Dense Retrieval (2024)0.00
- Domain-adaptive And Scalable Dense Retrieval For Content-based Recommendation (2026)0.00
- Trisampler: A Better Negative Sampling Principle For Dense Retrieval (2024)5.84
- MRSE: An Efficient Multi-modality Retrieval System For Large Scale E-commerce (2024)0.00
- Pebr: A Probabilistic Approach To Embedding Based Retrieval (2024)0.00
- Hierarchical Structured Neural Network: Efficient Retrieval Scaling For Large Scale Recommendation (2024)0.00
- Optimizing Dense Retrieval Model Training With Hard Negatives (2021)16.34