Imputation of Missing Values in Time Series Using an Adaptive-Learned Median-Filled Deep Autoencoder

被引:53
作者
Pan, Zhuofu [1 ,2 ]
Wang, Yalin [1 ]
Wang, Kai [1 ]
Chen, Hongtian [2 ]
Yang, Chunhua [1 ]
Gui, Weihua [1 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 1H9, Canada
基金
中国国家自然科学基金;
关键词
Training; Decoding; Feature extraction; Data models; Adaptation models; Supervised learning; Time series analysis; Adaptive-learned median-filled deep autoencoder (AM-DAE); industrial-type missing values; time-series imputation; unsupervised imputation; CLASSIFICATION;
D O I
10.1109/TCYB.2022.3167995
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing values are ubiquitous in industrial data sets because of multisampling rates, sensor faults, and transmission failures. The incomplete data obstruct the effective use of data and degrade the performance of data-driven models. Numerous imputation algorithms have been proposed to deal with missing values, primarily based on supervised learning, that is, imputing the missing values by constructing a prediction model with the remaining complete data. They have limited performance when the amount of incomplete data is overwhelming. Moreover, many methods have not considered the autocorrelation of time-series data. Thus, an adaptive-learned median-filled deep autoencoder (AM-DAE) is proposed in this study, aiming to impute missing values of industrial time-series data in an unsupervised manner. It continuously replaces the missing values by the median of the input data and its reconstruction, which allows the imputation information to be transmitted with the training process. In addition, an adaptive learning strategy is adopted to guide the AM-DAE paying more attention to the reconstruction learning of nonmissing values or missing values in different iteration periods. Finally, two industrial examples are used to verify the superior performance of the proposed method compared with other advanced techniques.
引用
收藏
页码:695 / 706
页数:12
相关论文
共 34 条
[31]   A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process [J].
Yin, Shen ;
Ding, Steven X. ;
Haghani, Adel ;
Hao, Haiyang ;
Zhang, Ping .
JOURNAL OF PROCESS CONTROL, 2012, 22 (09) :1567-1581
[32]  
Yoon J, 2018, PR MACH LEARN RES, V80
[33]  
Zhou ZH, 2017, PROCEEDINGS OF 2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), P2329, DOI 10.1109/CompComm.2017.8322951
[34]   Review and big data perspectives on robust data mining approaches for industrial process modeling with outliers and missing data [J].
Zhu, Jinlin ;
Ge, Zhiqiang ;
Song, Zhihuan ;
Gao, Furong .
ANNUAL REVIEWS IN CONTROL, 2018, 46 :107-133