An optimal algorithm for approximate nearest neighbor searching in fixed dimensions

被引:1376
作者
Arya, S
Mount, DM
Netanyahu, NS
Silverman, R
Wu, AY
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[2] Univ Maryland, Inst Adv Comp Studies, College Pk, MD 20742 USA
[3] Univ Maryland, Ctr Automat Res, College Pk, MD 20742 USA
[4] Univ Dist Columbia, Dept Comp Sci, Washington, DC USA
[5] American Univ, Dept Comp Sci & Informat Syst, Washington, DC 20016 USA
[6] NASA, Goddard Space Flight Ctr, Greenbelt, MD 20771 USA
[7] Hong Kong Univ Sci & Technol, Kowloon, Peoples R China
关键词
approximation algorithms; box-decomposition trees; closest-point queries; nearest neighbor searching; post-office problem; priority search;
D O I
10.1145/293347.293348
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Consider a set S of n data points in real d-dimensional space, R-d, where distances are measured using any Minkowski metric. In nearest neighbor searching, we preprocess S into a data structure, so that given any query point q epsilon R-d, is the closest point of S to q can be reported quickly. Given any positive real epsilon, a data point p is a (1 + epsilon)-approximate nearest neighbor of q if its distance from q is within a factor of (1 + epsilon) Of the distance to the true nearest neighbor. We show that it is possible to preprocess a set of n points in R-d in O(dn log n) time and O(dn) space, so that given a query point q epsilon R-d, and epsilon > 0, a (1 + epsilon)-approximate nearest neighbor of q can be computed in O(c(d,epsilon) log n) time, where c(d,epsilon) less than or equal to d [1 + 6d/epsilon](d) is a factor depending only on dimension and epsilon. In general, we show that given an integer k greater than or equal to 1 (1 + epsilon)-approximations to the k nearest neighbors of q cart be computed in additional O(kd log n) time.
引用
收藏
页码:891 / 923
页数:33
相关论文
共 59 条
[51]  
Preparata F., 1985, COMPUTATIONAL GEOMET
[52]  
RIVEST RL, 1974, INFORMATION PROCESSI, P678
[53]  
Roussopoulos N., 1995, SIGMOD Record, V24, P71, DOI 10.1145/568271.223794
[54]  
Samet H., 1990, DESIGN ANAL SPATIAL
[55]   AN OPTIMAL ALGORITHM FOR THE ONLINE CLOSEST-PAIR PROBLEM [J].
SCHWARZ, C ;
SMID, M ;
SNOEYINK, J .
ALGORITHMICA, 1994, 12 (01) :18-29
[56]   A DATA STRUCTURE FOR DYNAMIC TREES [J].
SLEATOR, DD ;
TARJAN, RE .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1983, 26 (03) :362-391
[57]   REFINEMENTS TO NEAREST-NEIGHBOR SEARCHING IN K-DIMENSIONAL TREES [J].
SPROULL, RF .
ALGORITHMICA, 1991, 6 (04) :579-589
[58]   AN O(N LOG N) ALGORITHM FOR THE ALL-NEAREST-NEIGHBORS PROBLEM [J].
VAIDYA, PM .
DISCRETE & COMPUTATIONAL GEOMETRY, 1989, 4 (02) :101-115
[59]   Similarity indexing with the SS-tree [J].
White, DA ;
Jain, R .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :516-523