Efficient k-closest pair queries in general metric spaces

被引:0
|
作者
Gao, Yunjun [1 ,2 ]
Chen, Lu [1 ]
Li, Xinhan [1 ]
Yao, Bin [3 ]
Chen, Gang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310003, Zhejiang, Peoples R China
[2] Zhejiang Univ, Innovat Joint Res Ctr Cyber Phys Soc Syst, Hangzhou 310003, Zhejiang, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China
来源
VLDB JOURNAL | 2015年 / 24卷 / 03期
关键词
k-Closest pair query; Metric space; Query processing; Cost model; Algorithm; NEAREST-NEIGHBOR SEARCH; COST MODEL; DISTANCE; ALGORITHMS; JOINS;
D O I
10.1007/s00778-015-0383-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Given two object sets and , a k-closest pair query finds closest object pairs from . This operation is common in many real-life applications such as GIS, data mining, and recommender systems. Although it has received much attention in the Euclidean space, there is little prior work on the metric space. In this paper, we study the problem of kCP query processing in general metric spaces, namely Metric kCP search, and propose several efficient algorithms using dynamic disk-based metric indexes (e.g., M-tree), which can be applied to arbitrary type of data as long as a certain metric distance is defined and satisfies the triangle inequality. Our approaches follow depth-first and/or best-first traversal paradigm(s), employ effective pruning rules based on metric space properties and the counting information preserved in the metric index, take advantage of aggressive pruning and compensation to further boost query efficiency, and derive a node-based cost model for retrieval. In addition, we extend our techniques to tackle two interesting variants of queries. Extensive experiments with both real and synthetic data sets demonstrate the performance of our proposed algorithms, the effectiveness of our developed pruning rules, and the accuracy of our presented cost model.
引用
收藏
页码:415 / 439
页数:25
相关论文
共 50 条
  • [41] Efficient Reverse Top-k Boolean Spatial Keyword Queries on Road Networks
    Gao, Yunjun
    Qin, Xu
    Zheng, Baihua
    Chen, Gang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (05) : 1205 - 1218
  • [42] An eigenvalue-based pivot selection strategy for efficient indexing and searching in metric spaces
    Kim, Sung-Hwan
    Lee, Da-Young
    Cho, Hwan-Gue
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2017, 20 (04): : 3643 - 3655
  • [43] An eigenvalue-based pivot selection strategy for efficient indexing and searching in metric spaces
    Sung-Hwan Kim
    Da-Young Lee
    Hwan-Gue Cho
    Cluster Computing, 2017, 20 : 3643 - 3655
  • [44] Coupled common fixed point theorems for a pair of commuting mappings in partially ordered G-metric spaces
    Feng Gu
    Shuhang Zhou
    Fixed Point Theory and Applications, 2013
  • [45] General constructive fixed point theorems for Ciric-type almost contractions in metric spaces
    Berinde, Vasile
    CARPATHIAN JOURNAL OF MATHEMATICS, 2008, 24 (02) : 10 - 19
  • [46] A Partitioned-Based Method of Convex Skyline for Efficient Processing Top-k Queries
    Lee, Ki-Eun
    Ihm, Sun-Young
    Heo, Jun-Seok
    Lee, Jeong-Joon
    Park, Young-Ho
    SECOND INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING / SECOND INTERNATIONAL CONFERENCE ON SOCIAL COMPUTING AND ITS APPLICATIONS (CGC/SCA 2012), 2012, : 788 - 793
  • [47] Cantor’s intersection theorem for K-metric spaces with a solid cone and a contraction principle
    Jacek Jachymski
    Jakub Klima
    Journal of Fixed Point Theory and Applications, 2016, 18 : 445 - 463
  • [48] Efficient Top-k Indexing via General Reductions
    Rahul, Saladi
    Tao, Yufei
    PODS'16: PROCEEDINGS OF THE 35TH ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS, 2016, : 277 - 288
  • [49] Pareto-Based Dominant Graph: An Efficient Indexing Structure to Answer Top-K Queries
    Zou, Lei
    Chen, Lei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (05) : 727 - 741
  • [50] Efficient Group Processing for Multiple Reverse Top-k Geo-Social Keyword Queries
    Jin, Pengfei
    Gao, Yunjun
    Chen, Lu
    Zhao, Jingwen
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2020), PT I, 2020, 12112 : 279 - 287