Sparse Ternary Codes For Similarity Search Have Higher Coding Gain Than Dense Binary Codes | Awesome Similarity Search Papers

Sparse Ternary Codes For Similarity Search Have Higher Coding Gain Than Dense Binary Codes

Sohrab Ferdowsi, Slava Voloshynovskiy, Dimche Kostadinov, Taras Holotyak Β· 2017 IEEE International Symposium on Information Theory (ISIT) Β· 2017

This paper addresses the problem of Approximate Nearest Neighbor (ANN) search in pattern recognition where feature vectors in a database are encoded as compact codes in order to speed-up the similarity search in large-scale databases. Considering the ANN problem from an information-theoretic perspective, we interpret it as an encoding, which maps the original feature vectors to a less entropic sparse representation while requiring them to be as informative as possible. We then define the coding gain for ANN search using information-theoretic measures. We next show that the classical approach to this problem, which consists of binarization of the projected vectors is sub-optimal. Instead, a properly designed ternary encoding achieves higher coding gains and lower complexity.

Explore more on:
ANN Search
Similar Work
Loading…