Time Series Data Cleaning: A Survey

被引:61
作者
Wang, Xi [1 ]
Wang, Chen [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Data cleaning; data quality; time series; FIXING NUMERICAL ATTRIBUTES; OF-THE-ART; ANOMALY DETECTION; PARAMETER-ESTIMATION; MAXIMUM-LIKELIHOOD; OUTLIER DETECTION; TEMPORAL DATA; MARKOV MODEL; STATE; DATABASES;
D O I
10.1109/ACCESS.2019.2962152
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Errors are prevalent in time series data, which is particularly common in the industrial field. Data with errors could not be stored in the database, which results in the loss of data assets. At present, to deal with these time series containing errors, besides keeping original erroneous data, discarding erroneous data and manually checking erroneous data, we can also use the cleaning algorithm widely used in the database to automatically clean the time series data. This survey provides a classification of time series data cleaning techniques and comprehensively reviews the state-of-the-art methods of each type. Besides we summarize data cleaning tools, systems and evaluation criteria from research and industry. Finally, we highlight possible directions time series data cleaning.
引用
收藏
页码:1866 / 1881
页数:16
相关论文
共 128 条
[11]   Query-Oriented Data Cleaning with Oracles [J].
Bergman, Moria ;
Milo, Tova ;
Novgorodov, Slava ;
Tan, Wang-Chiew .
SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, :1199-1214
[12]  
Bertossi L, 2005, LECT NOTES COMPUT SC, V3774, P262
[13]   The complexity and approximation of fixing numerical attributes in databases under integrity constraints [J].
Bertossi, Leopoldo ;
Bravo, Loreto ;
Franconi, Enrico ;
Lopatenko, Andrei .
INFORMATION SYSTEMS, 2008, 33 (4-5) :407-434
[14]  
Box G.E.P., 1970, Time Series Analysis: Forecasting and Control, DOI DOI 10.1080/01621459.1970.10481180
[15]   EXACT MAXIMUM-LIKELIHOOD PARAMETER-ESTIMATION OF SUPERIMPOSED EXPONENTIAL SIGNALS IN NOISE [J].
BRESLER, Y ;
MACOVSKI, A .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1986, 34 (05) :1081-1089
[16]   LOF: Identifying density-based local outliers [J].
Breunig, MM ;
Kriegel, HP ;
Ng, RT ;
Sander, J .
SIGMOD RECORD, 2000, 29 (02) :93-104
[17]  
Brillinger David R, 2001, Time series: data analysis and theory, DOI DOI 10.1137/1.9780898719246
[18]  
Brockwell PJ, 2016, SPRINGER TEXTS STAT, P1, DOI 10.1007/978-3-319-29854-2
[19]  
Brown R., 1992, Introduction to random signals and applied kalman filtering
[20]  
Brown R. G., 2004, Smoothing, Forecasting and Prediction of Discrete Time Series