Efficient k-closest pair queries in general metric spaces

被引:0
|
作者
Gao, Yunjun [1 ,2 ]
Chen, Lu [1 ]
Li, Xinhan [1 ]
Yao, Bin [3 ]
Chen, Gang [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci, Hangzhou 310003, Zhejiang, Peoples R China
[2] Zhejiang Univ, Innovat Joint Res Ctr Cyber Phys Soc Syst, Hangzhou 310003, Zhejiang, Peoples R China
[3] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200030, Peoples R China
来源
VLDB JOURNAL | 2015年 / 24卷 / 03期
关键词
k-Closest pair query; Metric space; Query processing; Cost model; Algorithm; NEAREST-NEIGHBOR SEARCH; COST MODEL; DISTANCE; ALGORITHMS; JOINS;
D O I
10.1007/s00778-015-0383-4
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Given two object sets and , a k-closest pair query finds closest object pairs from . This operation is common in many real-life applications such as GIS, data mining, and recommender systems. Although it has received much attention in the Euclidean space, there is little prior work on the metric space. In this paper, we study the problem of kCP query processing in general metric spaces, namely Metric kCP search, and propose several efficient algorithms using dynamic disk-based metric indexes (e.g., M-tree), which can be applied to arbitrary type of data as long as a certain metric distance is defined and satisfies the triangle inequality. Our approaches follow depth-first and/or best-first traversal paradigm(s), employ effective pruning rules based on metric space properties and the counting information preserved in the metric index, take advantage of aggressive pruning and compensation to further boost query efficiency, and derive a node-based cost model for retrieval. In addition, we extend our techniques to tackle two interesting variants of queries. Extensive experiments with both real and synthetic data sets demonstrate the performance of our proposed algorithms, the effectiveness of our developed pruning rules, and the accuracy of our presented cost model.
引用
收藏
页码:415 / 439
页数:25
相关论文
共 50 条
  • [21] A GENERAL FIXED POINT THEOREM FOR A PAIR OF SELF MAPPINGS WITH COMMON LIMIT RANGE PROPERTY IN G - METRIC SPACES
    Popa, Valeriu
    Patriciu, Alina-Mihaela
    FACTA UNIVERSITATIS-SERIES MATHEMATICS AND INFORMATICS, 2014, 29 (04): : 351 - 370
  • [22] EFFICIENT RANDOMIZED PARALLEL ALGORITHM FOR THE CLOSEST PAIR PROBLEM IN D-DIMENSION
    MOHAN, PJ
    KAMAKOTI, V
    RANGAN, CP
    INFORMATION PROCESSING '94, VOL I: TECHNOLOGY AND FOUNDATIONS, 1994, 51 : 547 - 552
  • [23] FIXED POINTS IN k COMPLETE METRIC SPACES
    Kikina, Luljeta
    Kikina, Kristaq
    DEMONSTRATIO MATHEMATICA, 2011, 44 (02) : 349 - 357
  • [24] A queries-based structure for similarity searching in static and dynamic metric spaces
    Hanyf, Youssef
    Silkan, Hassan
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (02) : 188 - 196
  • [25] Efficient computation of spatial queries over points stored in k2-tree compact data structures
    Santolaya, Fernando
    Caniupan, Monica
    Gajardo, Luis
    Romero, Miguel
    Torres-Aviles, Rodrigo
    THEORETICAL COMPUTER SCIENCE, 2021, 892 : 108 - 131
  • [26] Exact Trajectory Similarity Search With N-tree: An Efficient Metric Index for kNN and Range Queries
    Gueting, Ralf hartmut
    Das, Suvam kumar
    Valdes, Fabio
    Ray, Suprio
    ACM TRANSACTIONS ON SPATIAL ALGORITHMS AND SYSTEMS, 2025, 11 (01)
  • [27] Efficient Processing of Top-k Queries in Uncertain Databases with x-Relations
    Yi, Ke
    Li, Feifei
    Kollios, George
    Srivastava, Divesh
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2008, 20 (12) : 1669 - 1682
  • [28] Efficient Processing of Reverse Top-k Dominating Queries
    Jiang, Tao
    Zhang, Bin
    Yang, Jun
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ARTIFICIAL INTELLIGENCE (CSAI 2018) / 2018 THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION AND MULTIMEDIA TECHNOLOGY (ICIMT 2018), 2018, : 115 - 119
  • [29] Efficient Approaches to k Representative G-Skyline Queries
    Zhou, Xu
    Li, Kenli
    Yang, Zhibang
    Gao, Yunjun
    Li, Keqin
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2020, 14 (05)
  • [30] Efficient approximation of the metric CVRP in spaces of fixed doubling dimension
    Khachay, Michael
    Ogorodnikov, Yuri
    Khachay, Daniel
    JOURNAL OF GLOBAL OPTIMIZATION, 2021, 80 (03) : 679 - 710