Kernel Density Estimation Through Density Constrained Near Neighbor Search
2020 Β· Moses Charikar, Michael Kapralov, Navid Nouri, et al.
Abstract
In this paper we revisit the kernel density estimation problem: given a kernel \(K(x, y)\) and a dataset of \(n\) points in high dimensional Euclidean space, prepare a data structure that can quickly output, given a query \(q\), a \((1+\epsilon)\)-approximation to \(\mu:=\frac1\{|P|\}\sum_\{p\in P\} K(p, q)\). First, we give a single data structure based on classical near neighbor search techniques that improves upon or essentially matches the query time and space complexity for all radial kernels considered in the literature so far. We then show how to improve both the query complexity and runtime by using recent advances in data-dependent near neighbor search. We achieve our results by giving a new implementation of the natural importance sampling scheme. Unlike previous approaches, our algorithm first samples the dataset uniformly (considering a geometric sequence of sampling rates), and then uses existing approximate near neighbor search techniques on the resulting smaller datase
Authors
(none)
Tags
Stats
Related papers
- DEANN: Speeding Up Kernel-density Estimation Using Approximate Nearest Neighbor Search (2021)0.00
- Fast Private Kernel Density Estimation Via Locality Sensitive Quantization (2023)0.00
- A Scalable Solution To The Nearest Neighbor Search Problem Through Local-search Methods On Neighbor Graphs (2017)3.58
- PECANN: Parallel Efficient Clustering With Graph-based Approximate Nearest Neighbor Search (2023)0.00
- Local Density Estimation In High Dimensions (2018)4.52
- Adaptive Estimation For Approximate K-nearest-neighbor Computations (2019)0.00
- Product Manifold Filter: Non-rigid Shape Correspondence Via Kernel Density Estimation In The Product Space (2017)14.47
- Nearest Neighbor And Kernel Survival Analysis: Nonasymptotic Error Bounds And Strong Consistency Rates (2019)0.00