Generative Retrieval As Dense Retrieval
2023 Β· Thong Nguyen, Andrew Yates
Abstract
Generative retrieval is a promising new neural retrieval paradigm that aims to optimize the retrieval pipeline by performing both indexing and retrieval with a single transformer model. However, this new paradigm faces challenges with updating the index and scaling to large collections. In this paper, we analyze two prominent variants of generative retrieval and show that they can be conceptually viewed as bi-encoders for dense retrieval. Specifically, we analytically demonstrate that the generative retrieval process can be decomposed into dot products between query and document vectors, similar to dense retrieval. This analysis leads us to propose a new variant of generative retrieval, called Tied-Atomic, which addresses the updating and scaling issues by incorporating techniques from dense retrieval. In experiments on two datasets, NQ320k and the full MSMARCO, we confirm that this approach does not reduce retrieval effectiveness while enabling the model to scale to large collections.
Authors
(none)
Tags
Stats
Related papers
- Generative Retrieval As Multi-vector Dense Retrieval (2024)8.60
- Does Generative Retrieval Overcome The Limitations Of Dense Retrieval? (2025)0.00
- Generative Dense Retrieval: Memory Can Be A Burden (2024)4.52
- Hierarchical Corpus Encoder: Fusing Generative Retrieval And Dense Indices (2025)0.00
- Scalable And Effective Generative Information Retrieval (2023)10.48
- How Does Generative Retrieval Scale To Millions Of Passages? (2023)10.61
- Generative Retrieval Meets Multi-graded Relevance (2024)2.26
- Irgen: Generative Modeling For Image Retrieval (2023)7.16