Beyond locality-sensitive hashing

We present a new data structure for the c-approximate near neighbor problem (ANN) in the Euclidean space. For n points in R[superscript d], our algorithm achieves O[subscript c](n[superscript ρ] + d log n) query time and O[subscript c](n[superscript 1+ρ] + d log n) space, where ρ ≤ 7/(8c[superscript...

Full description

Bibliographic Details
Main Authors: Andoni, Alexandr (Author), Nguyen, Huy L. (Author), Indyk, Piotr (Contributor), Razenshteyn, Ilya (Contributor)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Association for Computing Machinery, 2018-03-14T18:14:06Z.
Subjects:
Online Access:Get fulltext
Description
Summary:We present a new data structure for the c-approximate near neighbor problem (ANN) in the Euclidean space. For n points in R[superscript d], our algorithm achieves O[subscript c](n[superscript ρ] + d log n) query time and O[subscript c](n[superscript 1+ρ] + d log n) space, where ρ ≤ 7/(8c[superscript 2]) + O(1/c[superscript 3]) + o[subscript c](1). This is the first improvement over the result by Andoni and Indyk (FOCS 2006) and the first data structure that bypasses a locality-sensitive hashing lower bound proved by O'Donnell, Wu and Zhou (ICS 2011). By a standard reduction we obtain a data structure for the Hamming space and ℓ[subscript 1] norm with ρ ≤ 7/(8c)+ O(1/c[superscript 3/2])+ o[superscript c](1), which is the first improvement over the result of Indyk and Motwani (STOC 1998).