Diskann++: Efficient Page-based Search Over Isomorphic Mapped Graph Index Using Query-sensitivity Entry Vertex
2023 Β· Jiongkang Ni, Xiaoliang Xu, Yuxiang Wang, et al.
Abstract
Given a vector dataset \(\mathcal\{X\}\) and a query vector \(\vec\{x\}_q\), graph-based Approximate Nearest Neighbor Search (ANNS) aims to build a graph index \(G\) and approximately return vectors with minimum distances to \(\vec\{x\}_q\) by searching over \(G\). The main drawback of graph-based ANNS is that a graph index would be too large to fit into the memory especially for a large-scale \(\mathcal\{X\}\). To solve this, a Product Quantization (PQ)-based hybrid method called DiskANN is proposed to store a low-dimensional PQ index in memory and retain a graph index in SSD, thus reducing memory overhead while ensuring a high search accuracy. However, it suffers from two I/O issues that significantly affect the overall efficiency: (1) long routing path from an entry vertex to the query's neighborhood that results in large number of I/O requests and (2) redundant I/O requests during the routing process. We propose an optimized DiskANN++ to overcome above issues. Specifically, for the
Authors
(none)
Tags
Stats
Related papers
- Aisaq: All-in-storage ANNS With Product Quantization For Dram-free Information Retrieval (2024)0.00
- Routing-guided Learned Product Quantization For Graph-based Approximate Nearest Neighbor Search (2023)4.52
- DGAI: Decoupled On-disk Graph-based ANN Index For Efficient Updates And Queries (2025)0.00
- Ood-diskann: Efficient And Scalable Graph ANNS For Out-of-distribution Queries (2022)0.00
- In-place Updates Of A Graph Index For Streaming Approximate Nearest Neighbor Search (2025)0.00
- SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search (2021)0.00
- Frequency-aware Graph Construction And Search For Dynamic Vector Databases (2025)0.00
- Freshdiskann: A Fast And Accurate Graph-based ANN Index For Streaming Similarity Search (2021)0.00