A missing data imputation method for industrial soft sensor modeling

被引:0
作者
Jiang, Dongnian [1 ]
Yang, Haowen [1 ]
Cao, Huichao [1 ]
Xu, Dezhi [2 ]
机构
[1] Lanzhou Univ Technol, Coll Electricial & Informat Engn, Lanzhou 730050, Peoples R China
[2] Southeast Univ, Sch Elect Engn, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Industrial process; Diffusion model; Missing data imputation; Soft sensor; Customized;
D O I
10.1016/j.jprocont.2025.103485
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data on complex industrial processes are often missing, due to sensor or equipment malfunctions; this poses challenges for the prediction of important quality variables and soft sensor applications, and may have a significant impact on production processes and equipment maintenance. Traditional missing data imputation methods face challenges in terms of acquiring data distributions, structures, etc., and are detached from the downstream soft sensor tasks, as they do not consider the close connections and synergistic relationships between missing data imputation and soft sensors. This affects the filling results for important quality variables, and thus reduces the prediction accuracy of the downstream soft sensors. To address these issues, a missing data imputation method for industrial soft sensor modeling, called PFIDM, is proposed that can realize a customized data imputation process with a progressive feedback strategy. The loss function of the improved diffusion model (IDDPM) is rationally designed to introduce KL dispersion between the noise addition process and the data distribution corresponding to the generation process into the next step of noise prediction, which involves predicting and correcting the noise of the current data state. In addition, a dynamic step decay factor related to the noise intensity is defined in the sampling process, and the sampling step span is adaptively adjusted to reduce the number of sampling steps and to accelerate the sampling time. The superiority of the proposed method is verified by comparing several typical methods and instantiating a dataset.
引用
收藏
页数:12
相关论文
共 36 条
[1]  
Bai SJ, 2018, Arxiv, DOI [arXiv:1803.01271, DOI 10.48550/ARXIV.1803.01271]
[2]  
Batista GE., 2002, His, V87, P48
[3]   Optimal Arrangement and Rearrangement of Objects on Shelves to Minimize Robot Retrieval Cost [J].
Chen, Lawrence Yunliang ;
Huang, Huang ;
Goldberg, Ken .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (03) :2184-2198
[4]  
Dai Y., 2023, IEEE Sens. J.
[5]   Missing data imputation using utility-based regression and sampling approaches [J].
Haliduola, Halimu N. ;
Bretz, Frank ;
Mansmann, Ulrich .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2022, 226
[6]  
Ho J., 2020, Advances in neural information processing systems, P6840
[7]  
Honghai F., 2005, P INT C KNOWL BAS IN
[8]   Multiple imputation in principal component analysis [J].
Josse, Julie ;
Pages, Jerome ;
Husson, Francois .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (03) :231-246
[9]  
Lam M. W., 2022, arXiv, DOI 10.48550/arXiv.2203.13508
[10]  
Li YM, 2023, Arxiv, DOI arXiv:2311.03303