Missing-Data Imputation With Position-Encoding Denoising Autoencoders for Industrial Processes

被引:23
作者
Ou, Chen [1 ,2 ]
Zhu, Hongqiu [1 ]
Shardt, Yuri A. W. [3 ]
Ye, Lingjian [4 ]
Yuan, Xiaofeng [1 ]
Wang, Yalin [1 ]
Yang, Chunhua [1 ]
Gui, Weihua [1 ]
机构
[1] Cent South Univ, Sch Automat, Changsha 410083, Peoples R China
[2] Hunan Univ Technol & Business, Sch Microelect & Phys, Changsha 410205, Peoples R China
[3] Tech Univ Ilmenau, Dept Automat Engn, D-98684 Ilmenau, Germany
[4] Huzhou Univ, Sch Engn, Huzhou 313000, Peoples R China
基金
中国国家自然科学基金;
关键词
Imputation; Data models; Training; Noise reduction; Feature extraction; Autocorrelation; Vectors; Deep learning (DL); denoising autoencoder (DAE); industrial processes; missing-data imputation; NETWORK;
D O I
10.1109/TIM.2024.3443350
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Missing values are a common occurrence in industrial datasets, resulting from the multiple sampling rates, sensor malfunctions, and transmission errors, whose presence can significantly affect the accuracy of data-driven models. An effective method to solve this problem is to impute the missing data in advance. This article proposes a new position-encoding denoising autoencoder (PE-DAE), which is motivated by the advantages of DAE in data reconstruction. To make use of the known information, the autocorrelation of the time-series data and the information on the missing positions are considered. Moreover, a self-paced learning (SPL) training strategy is proposed to improve the imputation performance under different levels of the missing data. The SPL training framework can first learn the knowledge structure of data with low missing rates and then gradually increase the difficulty, transitioning to learning more complex knowledge from data with higher missing rates. Finally, the proposed method is used for missing-data imputation in two real industrial processes. Comparative experiments show that the PE-DAE+SPL achieves the smallest error at all missing rates.
引用
收藏
页数:11
相关论文
共 43 条
[1]   Establishing strong imputation performance of a denoising autoencoder in a wide range of missing data problems [J].
Abiri, Najmeh ;
Linse, Bjorn ;
Eden, Patrik ;
Ohlsson, Mattias .
NEUROCOMPUTING, 2019, 365 :137-146
[2]  
Cao W, 2018, ADV NEUR IN, V31
[3]   Data-Driven Fault Diagnosis for Traction Systems in High-Speed Trains: A Survey, Challenges, and Perspectives [J].
Chen, Hongtian ;
Jiang, Bin ;
Ding, Steven X. ;
Huang, Biao .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (03) :1700-1716
[4]   Harmonic Phasor Estimation Based on Frequency-Domain Sampling Theorem [J].
Chen, Lei ;
Zhao, Wei ;
Xie, Xiaorong ;
Zhao, Dongfang ;
Huang, Songling .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70
[5]   Distributed Robust Process Monitoring Based on Optimized Denoising Autoencoder With Reinforcement Learning [J].
Chen, Shutian ;
Jiang, Qingchao .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[6]   Multiscale Modeling Using GAN and Deep Forest Regression With Application to Dioxin Emission Soft Sensor [J].
Cui, Canlin ;
Tang, Jian ;
Xia, Heng ;
Yu, Wen ;
Qiao, Junfei .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
[7]   Soft sensors for product quality monitoring in debutanizer distillation columns [J].
Fortuna, L ;
Graziani, S ;
Xibilia, MG .
CONTROL ENGINEERING PRACTICE, 2005, 13 (04) :499-508
[8]   Counter propagation auto-associative neural network based data imputation [J].
Gautam, Chandan ;
Ravi, Vadlamani .
INFORMATION SCIENCES, 2015, 325 :288-299
[9]   DLIN: Deep Ladder Imputation Network [J].
Hallaji, Ehsan ;
Razavi-Far, Roozbeh ;
Saif, Mehrdad .
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (09) :8629-8641
[10]   Multiple imputation in principal component analysis [J].
Josse, Julie ;
Pages, Jerome ;
Husson, Francois .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (03) :231-246