Quicker Similarity Joins in Metric Spaces

被引:0
作者
Fredriksson, Kimmo [1 ]
Braithwaite, Billy [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Kuopio, Finland
来源
SIMILARITY SEARCH AND APPLICATIONS (SISAP) | 2013年 / 8199卷
关键词
RANGE QUERIES; SEARCH; INDEX;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the join operation in metric spaces. Given two sets A and B of objects drawn from some universe U, we want to compute the set A (sic) B = {(a, b) is an element of A x B vertical bar d(a, b) <= r} efficiently, where d : U x U -> R+ is a metric distance function and r is an element of R+ is user supplied query radius. In particular we are interested in the case where we have no index available (nor we can afford to build it) for either A or B. In this paper we improve the Quickjoin algorithm (Jacox and Samet, 2008), based on the well-know Quicksort algorithm, by (i) replacing the low level component that handles small subsets with essentially brute-force nested loop with a more efficient method; (ii) showing that, contrary to Quicksort, in Quickjoin unbalanced partitioning can improve the algorithm; and (iii) making the algorithm probabilistic while still obtaining most of the relevant results. We also show how to use Quickjoin to compute k-nearest neighbor joins. The experimental results show that the method works well in practice.
引用
收藏
页码:127 / 140
页数:14
相关论文
共 35 条
[31]   UCORM: Indexing Uncorrelated Metric Spaces for Concise Content-Based Retrieval of Medical Images [J].
Zabot, Guilherme F. ;
Cazzolato, Mirela T. ;
Scabora, Lucas C. ;
Faical, Bruno S. ;
Traina, Agma J. M. ;
Traina, Caetano, Jr. .
2019 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2019, :306-311
[32]   Improving the space cost of k-NN search in metric spaces by using distance estimators [J].
Bustos, Benjamin ;
Navarro, Gonzalo .
MULTIMEDIA TOOLS AND APPLICATIONS, 2009, 41 (02) :215-233
[33]   Approximation of the Capacitated Vehicle Routing Problem with a Limited Number of Routes in Metric Spaces of Fixed Doubling Dimension [J].
Ogorodnikov, Yu Yu ;
Khachay, M. Yu .
COMPUTATIONAL MATHEMATICS AND MATHEMATICAL PHYSICS, 2021, 61 (07) :1194-1206
[34]   Extended Hybrid Image Similarity - Combined Full-Reference Image Quality Metric Linearly Correlated with Subjective Scores [J].
Okarma, K. .
ELEKTRONIKA IR ELEKTROTECHNIKA, 2013, 19 (10) :129-132
[35]   Haimovich-Rinnooy Kan polynomial-time approximation scheme for the CVRP in metric spaces of a fixed doubling dimension [J].
Khachai, M. Yu ;
Ogorodnikov, Yu Yu .
TRUDY INSTITUTA MATEMATIKI I MEKHANIKI URO RAN, 2019, 25 (04) :235-248