On The Value Of Behavioral Representations For Dense Retrieval
2022 Β· Nan Jiang, Dhivya Eswaran, Choon Hui Teo, et al.
Abstract
We consider text retrieval within dense representational space in real-world settings such as e-commerce search where (a) document popularity and (b) diversity of queries associated with a document have a skewed distribution. Most of the contemporary dense retrieval literature presents two shortcomings in these settings. (1) They learn an almost equal number of representations per document, agnostic to the fact that a few head documents are disproportionately more critical to achieving a good retrieval performance. (ii) They learn purely semantic document representations inferred from intrinsic document characteristics which may not contain adequate information to determine the queries for which the document is relevant--especially when the document is short. We propose to overcome these limitations by augmenting semantic document representations learned by bi-encoders with behavioral document representations learned by our proposed approach MVG. To do so, MVG (1) determines how to div
Authors
(none)
Tags
Stats
Related papers
- What Are You Token About? Dense Retrieval As Distributions Over The Vocabulary (2022)8.09
- Improving Document Representations By Generating Pseudo Query Embeddings For Dense Retrieval (2021)9.41
- Pseudo-relevance Feedback For Multiple Representation Dense Retrieval (2021)12.93
- Investigating Multi-layer Representations For Dense Passage Retrieval (2025)0.00
- Multi-view Document Representation Learning For Open-domain Dense Retrieval (2022)10.21
- Multimodal Semantic Retrieval For Product Search (2025)3.58
- Learning Diverse Document Representations With Deep Query Interactions For Dense Retrieval (2022)2.51
- Large Reasoning Embedding Models: Towards Next-generation Dense Retrieval Paradigm (2025)0.00