Using a Time Granularity Table for Gradual Granular Data Aggregation

被引:2
作者
Iftikhar, Nadeem [1 ]
Pedersen, Torben Bach [1 ]
机构
[1] Aalborg Univ, Dept Comp Sci, DK-9220 Aalborg O, Denmark
关键词
Data aggregation; multi-dimensional data aggregation; gradual granular data aggregation; multi-granular data; time granularity; MULTIDIMENSIONAL DATA STREAMS;
D O I
10.3233/FI-2014-1039
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The majority of today's systems increasingly require sophisticated data management as they need to store and to query large amounts of data for analysis and reporting purposes. In order to keep more "detailed" data available for longer periods, "old" data has to be reduced gradually to save space and improve query performance, especially on resource-constrained systems with limited storage and query processing capabilities. A number of data reduction solutions have been developed, however an effective solution particularly based on gradual data reduction is missing. This paper presents an effective solution for data reduction based on gradual granular data aggregation. With the gradual granular data aggregation mechanism, older data can be made coarse-grained while keeping the newest data fine-grained. For instance, when data is 3 months old aggregate to 1 minute level from 1 second level, when data is 6 months old aggregate to 2 minutes level from 1 minute level and so on. The proposed solution introduces a time granularity based data structure, namely a relational time granularity table that enables long term storage of old data by maintaining it at different levels of granularity and effective query processing due to a reduction in data volume. In addition, the paper describes the implementation strategy derived from a farming case study using standard database technologies.
引用
收藏
页码:153 / 176
页数:24
相关论文
共 21 条
[1]  
Boly Aliou, 2007, 2007 IEEE International Conference on Research, Innovation and Vision for the Future, P220, DOI 10.1109/RIVF.2007.369160
[2]  
Cuzzocrea A, 2004, LECT NOTES COMPUT SC, V3292, P144
[3]  
Cuzzocrea Alfredo, 2011, Scientific and Statistical Database Management. Proceedings 23rd International Conference, SSDBM 2011, P575, DOI 10.1007/978-3-642-22351-8_43
[4]  
Cuzzocrea A, 2009, LECT NOTES COMPUT SC, V5691, P48, DOI 10.1007/978-3-642-03730-6_5
[5]   Stream cube: An architecture for multi-dimensional analysis of data streams [J].
Han, JW ;
Chen, YX ;
Dong, GZ ;
Pei, H ;
Wah, BW ;
Wang, JY ;
Cai, YD .
DISTRIBUTED AND PARALLEL DATABASES, 2005, 18 (02) :173-197
[6]  
Iftikhar N., 2010, P 16 EUR C INF SYST, P51
[7]  
Iftikhar N., 2011, INT WORKSHOP DATA WA, P1, DOI DOI 10.1145/2064676.2064678
[8]  
Iftikhar N, 2010, LECT NOTES COMPUT SC, V6295, P219, DOI 10.1007/978-3-642-15576-5_18
[9]  
Iftikhar N, 2010, LECT NOTES COMPUT SC, V6262, P111, DOI 10.1007/978-3-642-15251-1_8
[10]  
Iftikhar N, 2010, LECT NOTES ARTIF INT, V6278, P349, DOI 10.1007/978-3-642-15393-8_40