Leanvec: Searching Vectors Faster By Making Them Fit
2023 Β· Mariano Tepper, Ishwar Singh Bhati, Cecilia Aguerrebere, et al.
Abstract
Modern deep learning models have the ability to generate high-dimensional vectors whose similarity reflects semantic resemblance. Thus, similarity search, i.e., the operation of retrieving those vectors in a large collection that are similar to a given query, has become a critical component of a wide range of applications that demand highly accurate and timely answers. In this setting, the high vector dimensionality puts similarity search systems under compute and memory pressure, leading to subpar performance. Additionally, cross-modal retrieval tasks have become increasingly common, e.g., where a user inputs a text query to find the most relevant images for that query. However, these queries often have different distributions than the database embeddings, making it challenging to achieve high accuracy. In this work, we present LeanVec, a framework that combines linear dimensionality reduction with vector quantization to accelerate similarity search on high-dimensional vectors while m
Authors
(none)
Tags
Stats
Related papers
- Gleanvec: Accelerating Vector Search With Minimalist Nonlinear Dimensionality Reduction (2024)0.00
- Vectorsearch: Enhancing Document Retrieval With Semantic Embeddings And Optimized Search (2024)0.00
- Semantic Vector Encoding And Similarity Search Using Fulltext Search Engines (2017)6.77
- Zoom: Ssd-based Vector Search For Optimizing Accuracy, Latency And Memory (2018)0.00
- Optimization Of Latent-space Compression Using Game-theoretic Techniques For Transformer-based Vector Search (2025)0.00
- Lucene For Approximate Nearest-neighbors Search On Arbitrary Dense Vectors (2019)0.00
- High-dimensional Similarity Search With Quantum-assisted Variational Autoencoder (2020)8.82
- Breaking The Curse Of Dimensionality: On The Stability Of Modern Vector Retrieval (2025)0.00