Imputation of Missing Values in Time Series Using an Adaptive-Learned Median-Filled Deep Autoencoder

被引:53
作者
Pan, Zhuofu [1 ,2 ]
Wang, Yalin [1 ]
Wang, Kai [1 ]
Chen, Hongtian [2 ]
Yang, Chunhua [1 ]
Gui, Weihua [1 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 1H9, Canada
基金
中国国家自然科学基金;
关键词
Training; Decoding; Feature extraction; Data models; Adaptation models; Supervised learning; Time series analysis; Adaptive-learned median-filled deep autoencoder (AM-DAE); industrial-type missing values; time-series imputation; unsupervised imputation; CLASSIFICATION;
D O I
10.1109/TCYB.2022.3167995
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing values are ubiquitous in industrial data sets because of multisampling rates, sensor faults, and transmission failures. The incomplete data obstruct the effective use of data and degrade the performance of data-driven models. Numerous imputation algorithms have been proposed to deal with missing values, primarily based on supervised learning, that is, imputing the missing values by constructing a prediction model with the remaining complete data. They have limited performance when the amount of incomplete data is overwhelming. Moreover, many methods have not considered the autocorrelation of time-series data. Thus, an adaptive-learned median-filled deep autoencoder (AM-DAE) is proposed in this study, aiming to impute missing values of industrial time-series data in an unsupervised manner. It continuously replaces the missing values by the median of the input data and its reconstruction, which allows the imputation information to be transmitted with the training process. In addition, an adaptive learning strategy is adopted to guide the AM-DAE paying more attention to the reconstruction learning of nonmissing values or missing values in different iteration periods. Finally, two industrial examples are used to verify the superior performance of the proposed method compared with other advanced techniques.
引用
收藏
页码:695 / 706
页数:12
相关论文
共 34 条
[1]   Establishing strong imputation performance of a denoising autoencoder in a wide range of missing data problems [J].
Abiri, Najmeh ;
Linse, Bjorn ;
Eden, Patrik ;
Ohlsson, Mattias .
NEUROCOMPUTING, 2019, 365 :137-146
[2]  
Bottou Leon., 2012, NEURAL NETWORKS TRIC, P421
[3]   Data-Driven Fault Diagnosis for Traction Systems in High-Speed Trains: A Survey, Challenges, and Perspectives [J].
Chen, Hongtian ;
Jiang, Bin ;
Ding, Steven X. ;
Huang, Biao .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (03) :1700-1716
[4]   A Review of Fault Detection and Diagnosis for the Traction System in High-Speed Trains [J].
Chen, Hongtian ;
Jiang, Bin .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (02) :450-465
[5]   Automatic Defect Detection of Fasteners on the Catenary Support Device Using Deep Convolutional Neural Network [J].
Chen, Junwen ;
Liu, Zhigang ;
Wang, Hongrui ;
Nunez, Alfredo ;
Han, Zhiwei .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2018, 67 (02) :257-269
[6]  
Chiang L.H., 2000, Fault Detection and Diagnosis in Industrial Systems
[7]   Self-organizing maps for imputation of missing data in incomplete data matrices [J].
Folguera, Laura ;
Zupan, Jure ;
Cicerone, Daniel ;
Magallanes, Jorge F. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 143 :146-151
[8]  
Fortuin V., 2019, ARXIV190704155
[9]   Pattern classification with missing data: a review [J].
Garcia-Laencina, Pedro J. ;
Sancho-Gomez, Jose-Luis ;
Figueiras-Vidal, Anibal R. .
NEURAL COMPUTING & APPLICATIONS, 2010, 19 (02) :263-282
[10]   DLIN: Deep Ladder Imputation Network [J].
Hallaji, Ehsan ;
Razavi-Far, Roozbeh ;
Saif, Mehrdad .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) :8629-8641