An Evaluation of Model-Based Approaches to Sensor Data Compression

被引:55
作者
Nguyen Quoc Viet Hung [1 ]
Jeung, Hoyoung [2 ]
Aberer, Karl [1 ]
机构
[1] Ecole Polytech Fed Lausanne, Stn 14, CH-1015 Lausanne, Switzerland
[2] SAP Res, South Brisbane, Qld 4101, Australia
关键词
Lossy compression; sensor data; benchmark; TIME-SERIES;
D O I
10.1109/TKDE.2012.237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the volumes of sensor data being accumulated are likely to soar, data compression has become essential in a wide range of sensor-data applications. This has led to a plethora of data compression techniques for sensor data, in particular model-based approaches have been spotlighted due to their significant compression performance. These methods, however, have never been compared and analyzed under the same setting, rendering a "right" choice of compression technique for a particular application very difficult. Addressing this problem, this paper presents a benchmark that offers a comprehensive empirical study on the performance comparison of the model-based compression techniques. Specifically, we reimplemented several state-of-the-art methods in a comparable manner, and measured various performance factors with our benchmark, including compression ratio, computation time, model maintenance cost, approximation quality, and robustness to noisy data. We then provide in-depth analysis of the benchmark results, obtained by using 11 different real data sets consisting of 346 heterogeneous sensor data signals. We believe that the findings from the benchmark will be able to serve as a practical guideline for applications that need to compress sensor data.
引用
收藏
页码:2434 / 2447
页数:14
相关论文
共 45 条
[1]  
[Anonymous], 2009, Proceedings of the VLDB Endowment (VLDB'09)
[2]  
[Anonymous], 2007, Proceedings of the 33rd International Conference on Very Large Data Bases. VLDB'07
[3]  
Arion A., 2011, P IEEE GLOBECOM, P1
[4]   Energy aware lossless data compression [J].
Barr, K ;
Asanovic, K .
PROCEEDINGS OF MOBISYS 2003, 2003, :231-244
[5]  
Buragohain C, 2007, PROC INT CONF DATA, P1001
[6]  
Burrows M., 1994, Algorithm, Data Compression, DOI 10.1.1.37.6774
[7]  
Cai Y., 2004, Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, SIGMOD '04, P599, DOI [DOI 10.1145/1007568.1007636, 10.1145/1007568.1007636]
[8]  
Carney D., 2002, Proceedings of the Twenty-eighth International Conference on Very Large Data Bases, P215
[9]  
Cheng A. F., 2007, Patent No. [US 7,249,153 B2, 7249153]
[10]  
Chu D., 2006, P 22 INT C DATA ENG, P48, DOI DOI 10.1109/ICDE.2006.21