Workload-Based Ordering of Multi-Dimensional Data

被引:5
作者
Yang, Shengxun [1 ]
He, Zhen [1 ]
Chen, Yi-Ping Phoebe [1 ]
机构
[1] La Trobe Univ, Dept Comp Sci & Comp Engn, Bundoora, Vic 3086, Australia
关键词
Space-filling curve; multi-dimensional data ordering; spatial databases; SPACE-FILLING CURVES;
D O I
10.1109/TKDE.2015.2496252
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transforming multi-dimensional data into a one-dimensional sequence using space-filling curves such as the Hilbert curve, the Gray curve, and the Z-curve has been studied extensively. These techniques are not sensitive to data or workload skewness, however, in practice, user-access patterns and data distributions are often very skewed in high dimensional space. It is desirable to produce a one-dimensional sequence which keeps the multi-dimensional grid cells that are queried together close to each other. This generates sequences with higher spatial locality. We propose a workload-based approach to produce one-dimensional ordering from multi-dimensional data in this paper. An extensive experimental evaluation suggests that our approach produces a high quality ordering sequence which outperforms the existing state-of-the-art Hilbert curve by a factor of 4.84, the Gray curve by a factor of 6.66, and the Z-curve by a factor of 7.26 for the number of subsequences used to answer a query; and for IO time, it outperforms the Hilbert curve by a factor of 2.20, the Gray curve by a factor of 2.25, and the Z-curve by 2.38.
引用
收藏
页码:831 / 844
页数:14
相关论文
共 12 条
[11]   STHist-C: a highly accurate cluster-based histogram for two and three dimensional geographic data points [J].
Hai Thanh Mai ;
Kim, Jaeho ;
Roh, Yohan J. ;
Kim, Myoung Ho .
GEOINFORMATICA, 2013, 17 (02) :325-352
[12]   A Spatial Heterogeneity-Based Segmentation Model for Analyzing Road Deterioration Network Data in Multi-Scale Infrastructure Systems [J].
Song, Yongze ;
Wu, Peng ;
Gilmore, Daniel ;
Li, Qindong .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (11) :7073-7083