On approximate nearest neighbors under I∞ norm

被引:24
作者
Indyk, P [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
D O I
10.1006/jcss.2001.1781
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The nearest neighbor search (NNS) problem is the following: Given a set of n points P = {p(i),..., p(n)} in some metric space X, preprocess P so as to efficiently answer queries which require finding a point in P closest to a query point q is an element of X. The approximate nearest neighbor search (c-NNS) is a relaxation of NNS which allows to return any point within c times the distance to the nearest neighbor (called c-nearest neighbor). This problem is of major and growing importance to a variety of applications. In this paper, we give an algorithm for (4 [log(1+p) log 4d] + 1)-NNS algorithm in l(infinity)(d) with O(dn(1+p) log(O(1)) n) storage and 0(d log(O(1)) n) query time. Moreover, we obtain an algorithm for 3-NNS for l(infinity) with n(log d+1) storage. The preprocessing time is close to linear in the size of the data structure. The algorithm can be also used (after simple modifications) to output the exact nearest neighbor in time bounded by O(d log(O(1)) n) plus the number of (4 [log(1+p), log 4d] + I)-nearest neighbors of the query point. Building on this result, we also obtain an approximation algorithm for a general class of product metrics. Finally, we show that for any c < 3 the c-NNS problem in l(infinity) is provably as hard as the subset query problem (also called the partial match problem). This indicates that obtaining a sublinear query time and subexponential (in d) space for c < 3 might be hard. (C) 2001 Elsevier Science (USA).
引用
收藏
页码:627 / 638
页数:12
相关论文
共 32 条
[1]  
AGARWAL PK, 1998, RAN BASED SIMILARITY
[2]  
AGRAWAL R, 1995, P 21 INT C VERY LARG
[3]  
AGRAWAL R, 1998, ADV DISCRETE COMPUTA
[4]  
AMBAINIS A, 1998, UNPUB
[5]  
[Anonymous], 1984, CONTEMP MATH
[6]  
[Anonymous], 1973, ART COMPUTER PROGRAM
[7]  
ARYA S, 1993, PROCEEDINGS OF THE FOURTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P271
[8]   An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].
Arya, S ;
Mount, DM ;
Netanyahu, NS ;
Silverman, R ;
Wu, AY .
JOURNAL OF THE ACM, 1998, 45 (06) :891-923
[9]   APPROXIMATE CLOSEST-POINT QUERIES IN HIGH DIMENSIONS [J].
BERN, M .
INFORMATION PROCESSING LETTERS, 1993, 45 (02) :95-99
[10]  
BORODIN A, STOC 99