Abstract

Given a vector dataset \(\mathcal\{X\}\), a query vector \(\vec\{x\}_q\), graph-based Approximate Nearest Neighbor Search (ANNS) aims to build a proximity graph (PG) as an index of \(\mathcal\{X\}\) and approximately return vectors with minimum distances to \(\vec\{x\}_q\) by searching over the PG index. It suffers from the large-scale \(\mathcal\{X\}\) because a PG with full vectors is too large to fit into the memory, e.g., a billion-scale \(\mathcal\{X\}\) in 128 dimensions would consume nearly 600 GB memory. To solve this, Product Quantization (PQ) integrated graph-based ANNS is proposed to reduce the memory usage, using smaller compact codes of quantized vectors in memory instead of the large original vectors. Existing PQ methods do not consider the important routing features of PG, resulting in low-quality quantized vectors that affect the ANNS's effectiveness. In this paper, we present an end-to-end Routing-guided learned Product Quantization (RPQ) for graph-based ANNS. It consi

Authors

(none)

Tags

  • Product Quantization
  • ANN Search

Stats

  • citations3
  • S2 citationsβ€”
  • github stars0
  • HF likes0
  • heat score4.52
  • arxiv keyyue2023routing

Related papers