Efficient Neural Ranking Using Forward Indexes And Lightweight Encoders
2023 · Jurek Leonhardt, Henrik Müller, Koustav Rudra, et al.
Abstract
Dual-encoder-based dense retrieval models have become the standard in IR. They employ large Transformer-based language models, which are notoriously inefficient in terms of resources and latency. We propose Fast-Forward indexes -- vector forward indexes which exploit the semantic matching capabilities of dual-encoder models for efficient and effective re-ranking. Our framework enables re-ranking at very high retrieval depths and combines the merits of both lexical and semantic matching via score interpolation. Furthermore, in order to mitigate the limitations of dual-encoders, we tackle two main challenges: Firstly, we improve computational efficiency by either pre-computing representations, avoiding unnecessary computations altogether, or reducing the complexity of encoders. This allows us to considerably improve ranking efficiency and latency. Secondly, we optimize the memory footprint and maintenance cost of indexes; we propose two complementary techniques to reduce the index size a
Authors
(none)
Tags
Stats
Related papers
- Efficient Neural Ranking Using Forward Indexes (2021)8.82
- Improving Neural Ranking Models With Traditional IR Methods (2023)0.00
- EHI: End-to-end Learning Of Hierarchical Index For Efficient Dense Retrieval (2023)0.00
- Retrieve Fast, Rerank Smart: Cooperative And Joint Approaches For Improved Cross-modal Retrieval (2021)10.97
- CODER: An Efficient Framework For Improving Retrieval Through Contextual Document Embedding Reranking (2021)7.16
- MICE: Minimal Interaction Cross-encoders For Efficient Re-ranking (2026)0.00
- Constructing Tree-based Index For Efficient And Effective Dense Retrieval (2023)9.23
- Hypencoder: Hypernetworks For Information Retrieval (2025)4.52