Quicker range- and k-NN joins in metric spaces

被引:4
|
作者
Fredriksson, Kimmo [1 ]
Braithwaite, Billy [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Kuopio 70211, Finland
关键词
Metric spaces; Similarity joins; k-nearest neighbors; Range searching; QUERIES; SEARCH; INDEX;
D O I
10.1016/j.is.2014.09.006
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the join operation in metric spaces. Given two sets A and B of objects drawn from some universe U, we want to compute the set A x B = {(a, b) is an element of A (sic) B vertical bar d(a, b) <= r) efficiently, where d: U x U -> R+ is a metric distance function and r is an element of R+ is user supplied query radius. In particular we are interested in the case where we have no index available (nor we can afford to build it) for either A or B. In this paper we improve the Quickjoin algorithm [6], inspired by the well-known Quicksort algorithm, by (i) replacing the low level component that handles small subsets with essentially brute-force nested loop with a more efficient method; (ii) showing that, contrary to Quicksort, in Quickjoin unbalanced partitioning can improve the algorithm; (iii) making the algorithm probabilistic while still obtaining most of the relevant results; and (iv) improving the (worst case) work space needed, from O(n(2)) to O(logn), where n is the size of the dataset(s). We also discuss pivot selection, show how to adapt Quickjoin to compute k-nearest neighbor joins and sketch a parallel version of the algorithm. The experimental results show that the method works well in practice. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:189 / 204
页数:16
相关论文
共 50 条
  • [1] Quicker Similarity Joins in Metric Spaces
    Fredriksson, Kimmo
    Braithwaite, Billy
    SIMILARITY SEARCH AND APPLICATIONS (SISAP), 2013, 8199 : 127 - 140
  • [2] Universal consistency of the k-NN rule in metric spaces and Nagata dimension***
    Collins, Benoit
    Kumari, Sushma
    Pestov, Vladimir G.
    ESAIM-PROBABILITY AND STATISTICS, 2020, 24 : 914 - 934
  • [3] A MapReduce Based k-NN Joins Probabilistic Classifier
    Chatzigeorgakidis, Georgios
    Karagiorgou, Sophia
    Athanasiou, Spiros
    Skiadopoulos, Spiros
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 952 - 957
  • [4] Improving the space cost of k-NN search in metric spaces by using distance estimators
    Bustos, Benjamin
    Navarro, Gonzalo
    MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 41 (02) : 215 - 233
  • [5] Improving the space cost of k-NN search in metric spaces by using distance estimators
    Benjamin Bustos
    Gonzalo Navarro
    Multimedia Tools and Applications, 2009, 41 : 215 - 233
  • [6] Universal consistency of the k-NN rule in metric spaces and Nagata dimension. II
    Kumari, Sushma
    Pestov, Vladimir G.
    ESAIM-PROBABILITY AND STATISTICS, 2024, 28 : 132 - 160
  • [7] Metric Learning by Directly Minimizing the k-NN Training Error
    Chernoff, Konstantin
    Loog, Marco
    Nielsen, Mads
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 1265 - 1268
  • [8] Classification in medical images using adaptive metric k-NN
    Chen, C.
    Chernoff, K.
    Karemore, G.
    Lo, P.
    Nielsen, M.
    Lauze, F.
    MEDICAL IMAGING 2010: IMAGE PROCESSING, 2010, 7623
  • [9] Elliptic metric K-NN method with asymptotic MDL measure
    Satonaka, Takami
    Uchimura, Keiichi
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 2065 - +
  • [10] A Gradient-Based Metric Learning Algorithm for k-NN Classifiers
    Zaidi, Nayyar Abbas
    Squire, David McG
    Suter, David
    AI 2010: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2010, 6464 : 194 - +