Rethinking Similarity Search: Embracing Smarter Mechanisms Over Smarter Data
2023 Β· Renzhi Wu, Jingfan Meng, Jie Jeff Xu, et al.
Abstract
In this vision paper, we propose a shift in perspective for improving the effectiveness of similarity search. Rather than focusing solely on enhancing the data quality, particularly machine learning-generated embeddings, we advocate for a more comprehensive approach that also enhances the underpinning search mechanisms. We highlight three novel avenues that call for a redefinition of the similarity search problem: exploiting implicit data structures and distributions, engaging users in an iterative feedback loop, and moving beyond a single query vector. These novel pathways have gained relevance in emerging applications such as large-scale language models, video clip retrieval, and data labeling. We discuss the corresponding research challenges posed by these new problem areas and share insights from our preliminary discoveries.
Authors
(none)
Tags
Stats
Related papers
- Vectorsearch: Enhancing Document Retrieval With Semantic Embeddings And Optimized Search (2024)0.00
- Evaluating The Impact Of Word Embeddings On Similarity Scoring In Practical Information Retrieval (2026)0.00
- Description-based Text Similarity (2023)0.00
- Axiomatic Explanations For Visual Search, Retrieval, And Similarity Learning (2021)0.00
- Semantic Vector Encoding And Similarity Search Using Fulltext Search Engines (2017)6.77
- Leanvec: Searching Vectors Faster By Making Them Fit (2023)0.00
- Reveal Hidden Pitfalls And Navigate Next Generation Of Vector Similarity Search From Task-centric Views (2025)0.00
- Meta-path Guided Embedding For Similarity Search In Large-scale Heterogeneous Information Networks (2016)0.00