Embracing Structure In Data For Billion-scale Semantic Product Search
2021 Β· Vihan Lakshman, Choon Hui Teo, Xiaowen Chu, et al.
Abstract
We present principled approaches to train and deploy dyadic neural embedding models at the billion scale, focusing our investigation on the application of semantic product search. When training a dyadic model, one seeks to embed two different types of entities (e.g., queries and documents or users and movies) in a common vector space such that pairs with high relevance are positioned nearby. During inference, given an embedding of one type (e.g., a query or a user), one seeks to retrieve the entities of the other type (e.g., documents or movies, respectively) that are highly relevant. In this work, we show that exploiting the natural structure of real-world datasets helps address both challenges efficiently. Specifically, we model dyadic data as a bipartite graph with edges between pairs with positive associations. We then propose to partition this network into semantically coherent clusters and thus reduce our search space by focusing on a small subset of these partitions for a given
Authors
(none)
Tags
Stats
Related papers
- Unified Embedding Based Personalized Retrieval In Etsy Search (2023)2.26
- Neural IR Meets Graph Embedding: A Ranking Model For Product Search (2019)11.85
- Explainable Product Search With A Dynamic Relation Embedding Model (2019)12.33
- Enterpriseem: Fine-tuned Embeddings For Enterprise Semantic Search (2024)0.00
- Large Reasoning Embedding Models: Towards Next-generation Dense Retrieval Paradigm (2025)0.00
- Rethinking Similarity Search: Embracing Smarter Mechanisms Over Smarter Data (2023)0.00
- Learning Deep Semantic Model For Code Search Using Codesearchnet Corpus (2022)3.16
- Multimodal Semantic Retrieval For Product Search (2025)3.58