Awesome Survey Paper
Survey Paper is one of the most active areas in Awesome Similarity Search β 40 papers in this collection, evaluated on datasets like CIFAR-10, CiteseerX, CLIP. A strong starting point is "Distance And Similarity Measures Effect On The Performance Of K-nearest Neighbor Classifier -- A Review".
Datasets & benchmarks
Key papers
- Distance And Similarity Measures Effect On The Performance Of K-nearest Neighbor Classifier -- A Review (2017)V. B. Surya Prasath, Haneen Arafat Abu Alfeilat, Ahmad B. A. Hassanat, et al.20.24
- A Comprehensive Survey And Experimental Comparison Of Graph-based Approximate Nearest Neighbor Search (2021)Mengzhao Wang, Xiaoliang Xu, Qiang Yue, et al.17.35
- Vector Database Management Systems: Fundamental Concepts, Use-cases, And Current Challenges (2023)Toni Taipalus14.23
- A Comprehensive Survey on Vector Database: Storage and Retrieval Technique, Challenge (2023)Le Ma et al.13.32
- A Multi-modal Neural Embeddings Approach For Detecting Mobile Counterfeit Apps: A Case Study On Google Play Store (2020)Naveen Karunanayake, Jathushan Rajasegaran, Ashanie Gunathillake, et al.7.81
- Graph-Based Vector Search: An Experimental Evaluation of the State-of-the-Art (2025)Ilias Azizi et al.7.77
- Exploring The Meaningfulness Of Nearest Neighbor Search In High-dimensional Space (2024)Zhonghan Chen, Ruiyuan Zhang, Xi Zhao, et al.2.26
- Dimensionality-Reduction Techniques for Approximate Nearest Neighbor
Search: A Survey and Evaluation (2024)Zeyu Wang et al.2.10
- Can You Trust the Vectors in Your Vector Database? Black-Hole Attack from Embedding Space Defects (2026)Hanxi Li et al.1.89
- GPU-Accelerated Algorithms for Graph Vector Search: Taxonomy, Empirical Study, and Research Directions (2026)Yaowen Liu et al.1.78
- Vector Search for the Future: From Memory-Resident, Static Heterogeneous Storage, to Cloud-Native Architectures (2026)Yitong Song et al.1.72
- Vextra: A Unified Middleware Abstraction for Heterogeneous Vector Database Systems (2026)Chandan Suri et al.1.72
- A Survey on Deep Text Hashing: Efficient Semantic Text Retrieval with Binary Representation (2025)Liyang He et al.1.56
- Survey of Filtered Approximate Nearest Neighbor Search over the Vector-Scalar Hybrid Data (2025)Yanjun Lin et al.1.28
- Analytics Modelling over Multiple Datasets using Vector Embeddings (2025)Andreas Loizou and Dimitrios Tsoumakos1.11
- Orthogonal Matrices For MBAT Vector Symbolic Architectures, And A "soft" VSA Representation For JSON (2022)Stephen I. Gallant0.00
- On Background Bias In Deep Metric Learning (2022)Konstantin Kobs, Andreas Hotho0.00
- When Similarity Digest Meets Vector Management System: A Survey On Similarity Hash Function (2021)Zhushou Tang, Lingyi Tang, Keying Tang, et al.0.00
- A Survey On Efficient Processing Of Similarity Queries Over Neural Embeddings (2022)Yifan Wang0.00
- Dimensionality-reduction Techniques For Approximate Nearest Neighbor Search: A Survey And Evaluation (2024)Zeyu Wang, Haoran Xiong, Qitong Wang, et al.0.00
- A Survey on Learning to Hash (2016)Jingdong Wang et al.β
- Approximate Nearest Neighbor Search on High Dimensional Data ---
Experiments, Analyses, and Improvement (v1.0) (2016)Wen Li et al.β
- Approximate Nearest Neighbor Search in High Dimensions (2018)Alexandr Andoni et al.β
- Implementation Notes for the Soft Cosine Measure (2018)V\'it Novotn\'y (1) ((1) Faculty of Informatics et al.β
- A Fast Text Similarity Measure for Large Document Collections using
Multi-reference Cosine and Genetic Algorithm (2018)Hamid Mohammadi et al.β
- A Similarity Measure for Weaving Patterns in Textiles (2018)Sven Helmer and Vuong M. Ngoβ
- A Review for Weighted MinHash Algorithms (2018)Wei Wu et al.β
- Multi-Frequency Vector Diffusion Maps (2019)Yifeng Fan and Zhizhen Zhaoβ
- Soccer Team Vectors (2019)Robert M\"uller et al.β
- Similarity-based Android Malware Detection Using Hamming Distance of
Static Binary Features (2019)Rahim Taheri et al.β
- Finding the most similar textual documents using Case-Based Reasoning (2019)Marko Mihajlovic et al.β
- A Survey on Deep Hashing Methods (2020)Xiao Luo et al.β
- Deep Learning for Image Search and Retrieval in Large Remote Sensing
Archives (2020)Gencer Sumbul et al.β
- Indexing Metric Spaces for Exact Similarity Search (2020)Lu Chen et al.β
- A survey on deep hashing for image retrieval (2020)Xiaopeng Zhangβ
- Experimental Analysis of Locality Sensitive Hashing Techniques for
High-Dimensional Approximate Nearest Neighbor Searches (2020)Omid Jafari et al.β
- Document Similarity from Vector Space Densities (2020)Ilia Rushkinβ
- A Comprehensive Survey and Experimental Comparison of Graph-Based
Approximate Nearest Neighbor Search (2021)Mengzhao Wang and Xiaoliang Xu and Qiang Yue and Yuxiang Wangβ
- A Survey on Locality Sensitive Hashing Algorithms and their Applications (2021)Omid Jafari et al.β
- State of the Art: Image Hashing (2021)Rubel Biswas and Pablo Blanco-Medinaβ
- When Similarity Digest Meets Vector Management System: A Survey on
Similarity Hash Function (2021)Zhushou Tang et al.β
- Orthogonal Matrices for MBAT Vector Symbolic Architectures, and a "Soft"
VSA Representation for JSON (2022)Stephen I. Gallantβ
- A Survey on Efficient Processing of Similarity Queries over Neural
Embeddings (2022)Yifan Wangβ
- Method for Determining the Similarity of Text Documents for the Kazakh
language, Taking Into Account Synonyms: Extension to TF-IDF (2022)Bakhyt Bakiyevβ
- Description-Based Text Similarity (2023)Shauli Ravfogel et al.β
- Semantic Equivalence of e-Commerce Queries (2023)Aritra Mandal et al.β
- Survey of Vector Database Management Systems (2023)James Jie Pan et al.β
- Foundations of Vector Retrieval (2024)Sebastian Bruchβ
- The Impacts of Data, Ordering, and Intrinsic Dimensionality on Recall in Hierarchical Navigable Small Worlds (2024)Owen Pendrigh Elliott et al.β
- Learning to Hash for Recommendation: A Survey (2024)Fangyuan Luo et al.β