Nearest neighbor queries in metric spaces

被引:89
作者
Clarkson, KL [1 ]
机构
[1] Lucent Technol, Bell Labs, Murray Hill, NJ 07974 USA
关键词
D O I
10.1007/PL00009449
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given a set S of n sites (points), and a distance measure d, the nearest neighbor searching problem is to build a data structure so that given a query point q, the site nearest to q can be found quickly. This paper gives data structures for this problem when the sites and queries are in a metric space. One data structure, D(S), uses a divide-and-conquer recursion. The other data structure, M(S, Q), is somewhat like a skiplist. Both are simple and implementable. The data structures are analyzed when the metric space obeys a certain sphere-packing bound, and when the sites and query points are random and have distributions with an exchangeability property. This property implies, for example, that query point q is a random element of S boolean OR {q}. Under these conditions, the preprocessing and space bounds for the algorithms are close to linear in n. They depend also on the sphere-packing bound, and on the logarithm of the distance ratio Y(S) of S, the ratio of the distance between the farthest pair of points in S to the distance between the closest pair. The data structure M(S, Q), requires as input data an additional set Q, taken to be representative of the query points. The resource bounds of M(S, Q) have a dependence on the distance ratio of S boolean OR Q. While M(S, Q), can return wrong answers, its failure probability can be bounded, and is decreasing in a parameter K. Here K less than or equal to \Q\/n is chosen when building M(S, Q). The expected query time for M(S, Q) is O(K log n) log Y(S boolean OR Q), and the resource bounds increase linearly in K. The data structure D(S) has expected O (log n)O-(1) query time, for fixed distance ratio. The preprocessing algorithm for M(S, Q) can be used to solve the all nearest neighbor problem for S in O (n(log n)(2)(log Y(S))(2)) expected time.
引用
收藏
页码:63 / 93
页数:31
相关论文
共 50 条
[11]   On optimizing nearest neighbor queries in high-dimensional data spaces [J].
Berchtold, S ;
Böhm, C ;
Keim, D ;
Krebs, F ;
Kriegel, HP .
DATABASE THEORY - ICDT 2001, PROCEEDINGS, 2001, 1973 :435-449
[12]   Nearest and reverse nearest neighbor queries for moving objects [J].
Benetis, Rimantas ;
Jensen, Christian S. ;
Karciauskas, Gytis ;
Saltenis, Simonas .
VLDB JOURNAL, 2006, 15 (03) :229-U1
[13]   Nearest and reverse nearest neighbor queries for moving objects [J].
Rimantas Benetis ;
Christian S. Jensen ;
Gytis Karĉiauskas ;
Simonas Ŝaltenis .
The VLDB Journal, 2006, 15 :229-249
[14]   Practical construction of k-nearest neighbor graphs in metric spaces [J].
Paredes, Rodrigo ;
Chavez, Edgar ;
Figueroa, Karina ;
Navarro, Gonzalo .
EXPERIMENTAL ALGORITHMS, PROCEEDINGS, 2006, 4007 :85-97
[15]   Approximate nearest neighbor queries revisited [J].
Chan, TM .
DISCRETE & COMPUTATIONAL GEOMETRY, 1998, 20 (03) :359-373
[16]   Reverse Approximate Nearest Neighbor Queries [J].
Hidayat, Arif ;
Yang, Shiyu ;
Cheema, Muhammad Aamir ;
Taniar, David .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (02) :339-352
[17]   Optimal-nearest-neighbor queries [J].
Gao, Yunjun ;
Zhang, Jing ;
Chen, Gencai ;
Li, Qing ;
Liu, Shen ;
Chen, Chun .
2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, :1454-+
[18]   Approximate Nearest Neighbor Queries Revisited [J].
T. M. Chan .
Discrete & Computational Geometry, 1998, 20 :359-373
[19]   In-route nearest neighbor queries [J].
Jin, SY ;
Shekhar, S .
GEOINFORMATICA, 2005, 9 (02) :117-137
[20]   Continuous aggregate nearest neighbor queries [J].
Elmongui, Hicham G. ;
Mokbel, Mohamed F. ;
Aref, Walid G. .
GEOINFORMATICA, 2013, 17 (01) :63-95