Fast spectral analysis for approximate nearest neighbor search

被引:1
作者
Wang, Jing [1 ]
Shen, Jie [2 ]
机构
[1] Amazon, New York, NY 10001 USA
[2] Stevens Inst Technol, Hoboken, NJ 07030 USA
关键词
Approximate nearest neighbor search; Spectral analysis; Hashing; Noise; Subspace; QUANTIZATION; ALGORITHMS;
D O I
10.1007/s10994-021-06124-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In large-scale machine learning, of central interest is the problem of approximate nearest neighbor (ANN) search, where the goal is to query particular points that are close to a given object under certain metric. In this paper, we develop a novel data-driven ANN search algorithm where the data structure is learned by fast spectral technique based on s landmarks selected by approximate ridge leverage scores. We show that with overwhelming probability, our algorithm returns the (1 + epsilon/4)-ANN for any approximation parameter epsilon is an element of (0, 1). A remarkable feature of our algorithm is that it is computationally efficient. Specifically, learning k-length hash codes requires O((s(3) + ns(2)) log n) running time and O(d(2)) extra space, and returning the (1 + epsilon/4)-ANN of the query needs O(k log n) running time. The experimental results on computer vision and natural language understanding tasks demonstrate the significant advantage of our algorithm compared to state-of-the-art methods.
引用
收藏
页码:2297 / 2322
页数:26
相关论文
共 50 条
[1]   Spectral Approaches to Nearest Neighbor Search [J].
Abdullah, Amirali ;
Andoni, Alexandr ;
Kannan, Ravindran ;
Krauthgamer, Robert .
2014 55TH ANNUAL IEEE SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS 2014), 2014, :581-590
[2]  
Alaoui Ahmed, 2015, Advances in Neural Information Processing Systems, V28, P775
[3]   Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions [J].
Andoni, Alexandr ;
Indyk, Piotr .
COMMUNICATIONS OF THE ACM, 2008, 51 (01) :117-122
[4]  
Andoni A, 2021, Disc Algorithms, P1171
[5]   Data-Dependent Hashing via Nonlinear Spectral Gaps [J].
Andoni, Alexandr ;
Naor, Assaf ;
Nikolov, Aleksandar ;
Razenshteyn, Ilya ;
Waingarten, Erik .
STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2018, :787-800
[6]  
Andoni Alexandr, 2014, The Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms, P1018, DOI DOI 10.1137/1.9781611973402.76
[7]  
[Anonymous], 1990, The Design and Analysis of Spatial Data Structures
[8]  
[Anonymous], 2015, Advances in Neural Information Processing Systems
[9]  
[Anonymous], 1998, Sorting and Searching
[10]  
[Anonymous], 2012, P 25 INT C NEUR INF