Spanns: Optimizing Approximate Nearest Neighbor Search For Sparse Vectors Using Near Memory Processing | Awesome Similarity Search Papers

Spanns: Optimizing Approximate Nearest Neighbor Search For Sparse Vectors Using Near Memory Processing

Tianqi Zhang, Flavio Ponzina, Tajana Rosing Β· 2026 31st Asia and South Pacific Design Automation Conference (ASP-DAC) Β· 2026

Approximate Nearest Neighbor Search (ANNS) is a fundamental operation in vector databases, enabling efficient similarity search in high-dimensional spaces. While dense ANNS has been optimized using specialized hardware accelerators, sparse ANNS remains limited by CPU-based implementations, hindering scalability. This limitation is increasingly critical as hybrid retrieval systems, combining sparse and dense embeddings, become standard in Information Retrieval (IR) pipelines. We propose SpANNS, a near-memory processing architecture for sparse ANNS. SpANNS combines a hybrid inverted index with efficient query management and runtime optimizations. The architecture is built on a CXL Type-2 near-memory platform, where a specialized controller manages query parsing and cluster filtering, while compute-enabled DIMMs perform index traversal and distance computations close to the data. It achieves 15.2x to 21.6x faster execution over the state-of-the-art CPU baselines, offering scalable and efficient solutions for sparse vector search.

Explore more on:
ANN Search
Similar Work
Loading…