Continual Learning For Generative Retrieval Over Dynamic Corpora
2023 Β· Jiangui Chen, Ruqing Zhang, Jiafeng Guo, et al.
Abstract
Generative retrieval (GR) directly predicts the identifiers of relevant documents (i.e., docids) based on a parametric model. It has achieved solid performance on many ad-hoc retrieval tasks. So far, these tasks have assumed a static document collection. In many practical scenarios, however, document collections are dynamic, where new documents are continuously added to the corpus. The ability to incrementally index new documents while preserving the ability to answer queries with both previously and newly indexed relevant documents is vital to applying GR models. In this paper, we address this practical continual learning problem for GR. We put forward a novel Continual-LEarner for generatiVE Retrieval (CLEVER) model and make two major contributions to continual learning for GR: (i) To encode new documents into docids with low computational cost, we present Incremental Product Quantization, which updates a partial quantization codebook according to two adaptive thresholds; and (ii) To
Authors
(none)
Tags
Stats
Related papers
- Generative Retrieval Meets Multi-graded Relevance (2024)2.26
- Generative Dense Retrieval: Memory Can Be A Burden (2024)4.52
- Does Generative Retrieval Overcome The Limitations Of Dense Retrieval? (2025)0.00
- Listwise Generative Retrieval Models Via A Sequential Learning Process (2024)8.60
- GLEN: Generative Retrieval Via Lexical Index Learning (2023)9.29
- Generative Retrieval As Dense Retrieval (2023)0.00
- Bootstrapped Pre-training With Dynamic Identifier Prediction For Generative Retrieval (2024)4.52
- Multi-step Semantic Reasoning In Generative Retrieval (2026)0.00