NUDGE: Lightweight Non-parametric Fine-tuning Of Embeddings For Retrieval

Abstract

\(k\)-Nearest Neighbor search on dense vector embeddings (\(k\)-NN retrieval) from pre-trained embedding models is the predominant retrieval method for text and images, as well as Retrieval-Augmented Generation (RAG) pipelines. In practice, application developers often fine-tune the embeddings to improve their accuracy on the dataset and query workload in hand. Existing approaches either fine-tune the pre-trained model itself or, more efficiently, but at the cost of accuracy, train adaptor models to transform the output of the pre-trained model. We present NUDGE, a family of novel non-parametric embedding fine-tuning approaches that are significantly more accurate and efficient than both sets of existing approaches. NUDGE directly modifies the embeddings of data records to maximize the accuracy of \(k\)-NN retrieval. We present a thorough theoretical and experimental study of NUDGE's non-parametric approach. We show that even though the underlying problem is NP-Hard, constrained variat

NUDGE: Lightweight Non-parametric Fine-tuning Of Embeddings For Retrieval

Abstract

Authors

Tags

Stats

Related papers