Workload-Based Ordering of Multi-Dimensional Data

被引:5
|
作者
Yang, Shengxun [1 ]
He, Zhen [1 ]
Chen, Yi-Ping Phoebe [1 ]
机构
[1] La Trobe Univ, Dept Comp Sci & Comp Engn, Bundoora, Vic 3086, Australia
关键词
Space-filling curve; multi-dimensional data ordering; spatial databases; SPACE-FILLING CURVES;
D O I
10.1109/TKDE.2015.2496252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transforming multi-dimensional data into a one-dimensional sequence using space-filling curves such as the Hilbert curve, the Gray curve, and the Z-curve has been studied extensively. These techniques are not sensitive to data or workload skewness, however, in practice, user-access patterns and data distributions are often very skewed in high dimensional space. It is desirable to produce a one-dimensional sequence which keeps the multi-dimensional grid cells that are queried together close to each other. This generates sequences with higher spatial locality. We propose a workload-based approach to produce one-dimensional ordering from multi-dimensional data in this paper. An extensive experimental evaluation suggests that our approach produces a high quality ordering sequence which outperforms the existing state-of-the-art Hilbert curve by a factor of 4.84, the Gray curve by a factor of 6.66, and the Z-curve by a factor of 7.26 for the number of subsequences used to answer a query; and for IO time, it outperforms the Hilbert curve by a factor of 2.20, the Gray curve by a factor of 2.25, and the Z-curve by 2.38.
引用
收藏
页码:831 / 844
页数:14
相关论文
共 50 条
  • [1] An efficient workload-based data layout scheme for multidimensional data
    Zaman, KA
    Padmanabhan, S
    DATA & KNOWLEDGE ENGINEERING, 2001, 39 (03) : 271 - 291
  • [2] Workload-based multi-task scheduling in cloud manufacturing
    Liu, Yongkui
    Xu, Xun
    Zhang, Lin
    Wang, Long
    Zhong, Ray Y.
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2017, 45 : 3 - 20
  • [3] An optimal workload-based data allocation approach for multidisk databases
    Lin, Ming-Hua
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (05) : 499 - 508
  • [4] A Workload-Based Dynamic Adaptive Data Replica Placement Method
    Guo, Wei
    Wang, Xinjun
    Dong, Yongquan
    2014 11TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2014, : 184 - 187
  • [5] Workload-Based Software Rejuvenation in Cloud Systems
    Bruneo, Dario
    Distefano, Salvatore
    Longo, Francesco
    Puliafito, Antonio
    Scarpa, Marco
    IEEE TRANSACTIONS ON COMPUTERS, 2013, 62 (06) : 1072 - 1085
  • [6] Workload-based placement and join processing in node-partitioned data warehouses
    Furtado, P
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 38 - 47
  • [7] Visualizing multi-dimensional data
    Eick, SG
    COMPUTER GRAPHICS-US, 2000, 34 (01): : 61 - 67
  • [8] Clustering-based histograms for multi-dimensional data
    Furfaro, F
    Mazzeo, GM
    Sirangelo, C
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2005, 3589 : 478 - 487
  • [9] Visualizing multi-dimensional data
    Eick, Stephen G.
    Computer Graphics (ACM), 2000, 34 (01): : 61 - 67
  • [10] Workload-based analysis of software aging, and rejuvenation
    Bao, YJ
    Sun, XB
    Trivedi, KS
    IEEE TRANSACTIONS ON RELIABILITY, 2005, 54 (03) : 541 - 548