Efficient Flexible M-Tree Bulk Loading Using FastMap and Space-Filling Curves

被引:1
作者
Loh, Woong-Kee [1 ]
机构
[1] Gachon Univ, Dept Software, Seongnam 13120, South Korea
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2021年 / 66卷 / 02期
基金
新加坡国家研究基金会;
关键词
M-tree; metric space; bulk loading; FastMap; space-filling curve;
D O I
10.32604/cmc.2020.012763
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many database applications currently deal with objects in a metric space. Examples of such objects include unstructured multimedia objects and points of interest (POIs) in a road network. The M-tree is a dynamic index structure that facilitates an efficient search for objects in a metric space. Studies have been conducted on the bulk loading of large datasets in an M-tree. However, because previous algorithms involve excessive distance computations and disk accesses, they perform poorly in terms of their index construction and search capability. This study proposes two efficient M-tree bulk loading algorithms. Our algorithms minimize the number of distance computations and disk accesses using FastMap and a space-filling curve, thereby significantly improving the index construction and search performance. Our second algorithm is an extension of the first, and it incorporates a partitioning clustering technique and flexible node architecture to further improve the search performance. Through the use of various synthetic and real-world datasets, the experimental results demonstrated that our algorithms improved the index construction performance by up to three orders of magnitude and the search performance by up to 20.3 times over the previous algorithm.
引用
收藏
页码:1251 / 1267
页数:17
相关论文
共 27 条
  • [1] k-Nearest Neighbors on Road Networks: A Journey in Experimentation and In-Memory Implementation
    Abeywickrama, Tenindra
    Cheema, Muhammad Aamir
    Taniar, David
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (06): : 492 - 503
  • [2] Abraham I, 2011, LECT NOTES COMPUT SC, V6630, P230
  • [3] Akiba Takuya, 2014, 2014 P 16 WORKSH ALG, P147
  • [4] [Anonymous], 1994, P 20 INT C VER LARG
  • [5] Berchtold S, 1996, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, P28
  • [6] Borg I., 2005, Modern Multidimensional Scaling: Theory and Applications, P261
  • [7] Scalable Single Source Shortest Path Algorithms for Massively Parallel Systems
    Chakaravarthy, Venkatesan T.
    Checconi, Fabio
    Petrini, Fabrizio
    Sabharwal, Yogish
    [J]. 2014 IEEE 28TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM, 2014,
  • [8] Chu Qiu, 2010, Proceedings of the Third International Joint Conference on Computational Sciences and Optimization (CSO 2010), P300, DOI 10.1109/CSO.2010.96
  • [9] Ciaccia P, 1997, PROCEEDINGS OF THE TWENTY-THIRD INTERNATIONAL CONFERENCE ON VERY LARGE DATABASES, P426
  • [10] Ciaccia P., 1998, 9 AUSTR DAT C PERTH, P5