Abstract

Locality Sensitive Hashing (LSH) is an effective method to index a set of points such that we can efficiently find the nearest neighbors of a query point. We extend this method to our novel Set-query LSH (SLSH), such that it can find the nearest neighbors of a set of points, given as a query. Let \( s(x,y) \) be the similarity between two points \( x \) and \( y \). We define a similarity between a set \( Q\) and a point \( x \) by aggregating the similarities \( s(p,x) \) for all \( p\in Q \). For example, we can take \( s(p,x) \) to be the angular similarity between \( p \) and \( x \) (i.e., \(1-\{\angle (x,p)\}/\{\pi\}\)), and aggregate by arithmetic or geometric averaging, or taking the lowest similarity. We develop locality sensitive hash families and data structures for a large set of such arithmetic and geometric averaging similarities, and analyze their collision probabilities. We also establish an analogous framework and hash families for distance functions. Specifically,

Authors

(none)

Tags

  • Locality Sensitive Hashing
  • Deep Hashing
  • Supervised Hashing

Stats

  • citations0
  • S2 citationsβ€”
  • github stars0
  • HF likes0
  • heat score0.00
  • arxiv keykaplan2020locality

Related papers