Quicker Similarity Joins in Metric Spaces

被引:0
|
作者
Fredriksson, Kimmo [1 ]
Braithwaite, Billy [1 ]
机构
[1] Univ Eastern Finland, Sch Comp, Kuopio, Finland
来源
SIMILARITY SEARCH AND APPLICATIONS (SISAP) | 2013年 / 8199卷
关键词
RANGE QUERIES; SEARCH; INDEX;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the join operation in metric spaces. Given two sets A and B of objects drawn from some universe U, we want to compute the set A (sic) B = {(a, b) is an element of A x B vertical bar d(a, b) <= r} efficiently, where d : U x U -> R+ is a metric distance function and r is an element of R+ is user supplied query radius. In particular we are interested in the case where we have no index available (nor we can afford to build it) for either A or B. In this paper we improve the Quickjoin algorithm (Jacox and Samet, 2008), based on the well-know Quicksort algorithm, by (i) replacing the low level component that handles small subsets with essentially brute-force nested loop with a more efficient method; (ii) showing that, contrary to Quicksort, in Quickjoin unbalanced partitioning can improve the algorithm; and (iii) making the algorithm probabilistic while still obtaining most of the relevant results. We also show how to use Quickjoin to compute k-nearest neighbor joins. The experimental results show that the method works well in practice.
引用
收藏
页码:127 / 140
页数:14
相关论文
共 50 条
  • [21] Organizing Similarity Spaces Using Metric Hulls
    Janosova, Miriama
    Prochazka, David
    Dohnal, Vlastislav
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2021, 2021, 13058 : 3 - 16
  • [22] Similarity Between Points in Metric Measure Spaces
    Dantsin, Evgeny
    Wolpert, Alexander
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2020, 2020, 12440 : 177 - 184
  • [23] Distributed and scalable similarity searching in metric spaces
    Batko, M
    CURRENT TRENDS IN DATABASE TECHNOLOGY - EDBT 2004 WORKSHOPS, PROCEEDINGS, 2004, 3268 : 44 - 53
  • [24] Totally bounded metric spaces and similarity detecting algorithms
    Sagi, Gabor
    Al-Sabti, Karrar
    2019 IEEE 15TH INTERNATIONAL SCIENTIFIC CONFERENCE ON INFORMATICS (INFORMATICS 2019), 2019, : 457 - 461
  • [25] An efficient MapReduce algorithm for similarity join in metric spaces
    Liu, Wen
    Shen, Yanming
    Wang, Peng
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (03): : 1179 - 1200
  • [26] Properties of embedding methods for similarity searching in metric spaces
    Hjaltason, GR
    Samet, H
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (05) : 530 - 549
  • [27] A Learned Index for Exact Similarity Search in Metric Spaces
    Tian, Yao
    Yan, Tingyun
    Zhao, Xi
    Huang, Kai
    Zhou, Xiaofang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (08) : 7624 - 7638
  • [28] Index-driven similarity search in metric spaces
    Hjaltason, GR
    Samet, H
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2003, 28 (04): : 517 - 580
  • [29] On Index-Free Similarity Search in Metric Spaces
    Skopal, Tomas
    Bustos, Benjamin
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 516 - +
  • [30] A novel indexing scheme for similarity search in metric spaces
    Tosun, Umut
    PATTERN RECOGNITION LETTERS, 2015, 54 : 69 - 74