Progressive Bounded Error Piecewise Linear Approximation with Resolution Reduction for Time Series Data Compression

被引:0
作者
Lin, Jeng-Wei [1 ]
Liao, Shih-wei [2 ]
Tsai, Yu-Hung [1 ]
Huang, Ching-Che [1 ]
机构
[1] Tunghai Univ, Dept Informat Management, Taichung 407224, Taiwan
[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 10617, Taiwan
关键词
sensor data; time series; progressive data compression; hierarchical residual encoding; bounded error piecewise linear approximation; Swing-RR; PBEPLA-RR;
D O I
10.3390/s25010145
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Today, huge amounts of time series data are sensed continuously by AIoT devices, transmitted to edge nodes, and to data centers. It costs a lot of energy to transmit these data, store them, and process them. Data compression technologies are commonly used to reduce the data size and thus save energy. When a certain level of data accuracy is sacrificed, lossy compression technologies can achieve better compression ratios. However, different applications may have different requirements for data accuracy. Instead of keeping multiple compressed versions of a time series w.r.t. different error bounds, HIRE hierarchically maintains a tree, where the root records a constant function to approximate the whole time series, and each other node records a constant function to approximate a part of the residual function of its parent for a particular time period. To retrieve data w.r.t. a specific error bound, it traverses the tree from the root down to certain levels according to the requested error bound and aggregates the constant functions on the visited nodes to generate a new bounded error compressed version dynamically. However, the number of nodes to be visited is unknown before the tree traversal completes, and thus the data size of the new version. In this paper, a time series is progressively decomposed into multiple piecewise linear functions. The first function is an approximation of the original time series w.r.t. the largest error bound. The second function is an approximation of the residual function between the original time series and the first function w.r.t. the second largest error bound, and so forth. The sum of the first, second, & mldr;, and m-th functions is an approximation of the original time series w.r.t. the m-th error bound. For each iteration, Swing-RR is used to generate a Bounded Error Piecewise Linear Approximation (BEPLA). Resolution Reduction (RR) plays an important role. Eight real-world datasets are used to evaluate the proposed method. For each dataset, approximations w.r.t. three typical error bounds, 5%, 1%, and 0.5%, are requested. Three BEPLAs are generated accordingly, which can be summed up to form three approximations w.r.t. the three error bounds. For all datasets, the total data size of the three BEPLAs is almost the same with the size used to store just one version w.r.t. the smallest error bound and significantly smaller than the size used to keep three independent versions. The experiment result shows that the proposed method, referred to as PBEPLA-RR, can achieve very good compression ratios and provide multiple approximations w.r.t. different error bounds.
引用
收藏
页数:18
相关论文
共 24 条
[1]  
Barbarioli Bruno, 2023, Proceedings of the ACM on Management of Data, V1, DOI 10.1145/3588953
[2]   A Vehicle Passive Entry Passive Start System with the Intelligent Internet of Things [J].
Chang, Ray-, I ;
Lin, Tzu-Chieh ;
Lin, Jeng-Wei .
ELECTRONICS, 2024, 13 (13)
[3]  
Chen Q., 2007, P INT C VER LARG DAT, P435
[4]  
Chen Y., 2015, The UCR Time Series Classification Archive
[5]   Classification of short single-lead electrocardiograms (ECGs for atrial fibrillation detection using piecewise linear spline and XGBoost [J].
Chen, Yao ;
Wang, Xiao ;
Jung, Yonghan ;
Abedi, Vida ;
Zand, Ramin ;
Bikak, Marvi ;
Adibuzzaman, Mohammad .
PHYSIOLOGICAL MEASUREMENT, 2018, 39 (10)
[6]   A novel framework for stock trading signals forecasting [J].
Chen, Yingjun ;
Hao, Yijie .
SOFT COMPUTING, 2020, 24 (16) :12111-12130
[7]  
Elmeleegy H, 2009, PROC VLDB ENDOW, V2
[8]  
GNW, 2023, GLOBE NEWSWIRE
[9]   FITTING POLYGONAL FUNCTIONS TO A SET OF POINTS IN THE PLANE [J].
HAKIMI, SL ;
SCHMEICHEL, EF .
CVGIP-GRAPHICAL MODELS AND IMAGE PROCESSING, 1991, 53 (02) :132-136
[10]   Efficient and Flexible Hierarchical Data Layouts for a Unified Encoding of Scalar Field Precision and Resolution [J].
Hoang, Duong ;
Summa, Brian ;
Bhatia, Harsh ;
Lindstrom, Peter ;
Klacansky, Pavol ;
Usher, Will ;
Bremer, Peer-Timo ;
Pascucci, Valerio .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2021, 27 (02) :603-613