Practical and optimal LSH for angular distance

We show the existence of a Locality-Sensitive Hashing (LSH) family for the angular distance that yields an approximate Near Neighbor Search algorithm with the asymptotically optimal running time exponent. Unlike earlier algorithms with this property (e.g., Spherical LSH (Andoni-Indyk-Nguyen-Razensht...

Full description

Bibliographic Details
Main Authors: Andoni, Alexandr (Author), Laarhoven, Thijs (Author), Indyk, Piotr (Contributor), Razenshteyn, Ilya (Contributor), Schmidt, Ludwig (Contributor)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Neural Information Processing Systems Foundation, 2018-02-20T21:04:45Z.
Subjects:
Online Access:Get fulltext
LEADER 01970 am a22002773u 4500
001 113844
042 |a dc 
100 1 0 |a Andoni, Alexandr  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Indyk, Piotr  |e contributor 
100 1 0 |a Razenshteyn, Ilya  |e contributor 
100 1 0 |a Schmidt, Ludwig  |e contributor 
700 1 0 |a Laarhoven, Thijs  |e author 
700 1 0 |a Indyk, Piotr  |e author 
700 1 0 |a Razenshteyn, Ilya  |e author 
700 1 0 |a Schmidt, Ludwig  |e author 
245 0 0 |a Practical and optimal LSH for angular distance 
260 |b Neural Information Processing Systems Foundation,   |c 2018-02-20T21:04:45Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/113844 
520 |a We show the existence of a Locality-Sensitive Hashing (LSH) family for the angular distance that yields an approximate Near Neighbor Search algorithm with the asymptotically optimal running time exponent. Unlike earlier algorithms with this property (e.g., Spherical LSH (Andoni-Indyk-Nguyen-Razenshteyn 2014) (Andoni-Razenshteyn 2015)), our algorithm is also practical, improving upon the well-studied hyperplane LSH (Charikar 2002) in practice. We also introduce a multiprobe version of this algorithm and conduct an experimental evaluation on real and synthetic data sets.We complement the above positive results with a fine-grained lower bound for the quality of any LSH family for angular distance. Our lower bound implies that the above LSH family exhibits a trade-off between evaluation time and quality that is close to optimal for a natural class of LSH functions. 
520 |a National Science Foundation (U.S.) 
520 |a Simons Foundation 
546 |a en_US 
655 7 |a Article 
773 |t Advances in Neural Information Processing Systems 28 (NIPS 2015)