Deeperimpact: Optimizing Sparse Learned Index Structures
2024 Β· Soyuj Basnet, Jerry Gou, Antonio Mallia, et al.
Abstract
A lot of recent work has focused on sparse learned indexes that use deep neural architectures to significantly improve retrieval quality while keeping the efficiency benefits of the inverted index. While such sparse learned structures achieve effectiveness far beyond those of traditional inverted index-based rankers, there is still a gap in effectiveness to the best dense retrievers, or even to sparse methods that leverage more expensive optimizations such as query expansion and query term weighting. We focus on narrowing this gap by revisiting and optimizing DeepImpact, a sparse retrieval approach that uses DocT5Query for document expansion followed by a BERT language model to learn impact scores for document terms. We first reinvestigate the expansion process and find that the recently proposed Doc2Query -- query filtration does not enhance retrieval quality when used with DeepImpact. Instead, substituting T5 with a fine-tuned Llama 2 model for query prediction results in a conside
Authors
(none)
Tags
Stats
Related papers
- Efficient Inverted Indexes For Approximate Retrieval Over Learned Sparse Representations (2024)11.67
- Hybrid Inverted Index Is A Robust Accelerator For Dense Retrieval (2022)7.07
- Optimizing Guided Traversal For Fast Learned Sparse Retrieval (2023)9.03
- A Few Brief Notes On Deepimpact, COIL, And A Conceptual Framework For Information Retrieval Techniques (2021)0.00
- Towards Competitive Search Relevance For Inference-free Learned Sparse Retrievers (2024)0.00
- SLIM: Sparsified Late Interaction For Multi-vector Retrieval With Inverted Indexes (2023)7.50
- Expandr: Teaching Dense Retrievers Beyond Queries With LLM Guidance (2025)3.25
- Faster Learned Sparse Retrieval With Guided Traversal (2022)11.29