Double-cycle weighted imputation method for wastewater treatment process data with multiple missing patterns

被引:5
作者
Han Honggui [1 ,2 ,3 ]
Sun Meiting [1 ,2 ,3 ]
Wu Xiaolong [1 ,2 ,3 ]
Li Fangyu [1 ,2 ,3 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[3] Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
wastewater treatment process; multiple missing patterns; data information; imputation sorting; imputation estimator; MUTUAL INFORMATION; FEATURE-SELECTION; VALUES;
D O I
10.1007/s11431-022-2163-1
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Due to sensor malfunctions and communication faults, multiple missing patterns frequently happen in wastewater treatment process (WWTP). Nevertheless, the existing missing data imputation works cannot stand multiple missing patterns because they have not sufficiently utilized of data information. In this article, a double-cycle weighted imputation (DCWI) method is proposed to deal with multiple missing patterns by maximizing the utilization of the available information in variables and instances. The proposed DCWI is comprised of two components: a double-cycle-based imputation sorting and a weighted K nearest neighbor-based imputation estimator. First, the double-cycle mechanism, associated with missing variable sorting and missing instance sorting, is applied to direct the missing values imputation. Second, the weighted K nearest neighbor-based imputation estimator is used to acquire the global similar instances and capture the volatility in the local region. The estimator preserves the original data characteristics as much as possible and enhances the imputation accuracy. Finally, experimental results on simulated and real WWTP datasets with non-stationarity and nonlinearity demonstrate that the proposed DCWI produces more accurate imputation results than comparison methods under different missing patterns and missing ratios.
引用
收藏
页码:2967 / 2978
页数:12
相关论文
共 43 条
[1]   Filling gaps in evapotranspiration measurements for water budget studies: Evaluation of a Kalman filtering approach [J].
Alavi, Nasim ;
Warland, Jon S. ;
Berg, Aaron A. .
AGRICULTURAL AND FOREST METEOROLOGY, 2006, 141 (01) :57-66
[2]   Multiple imputation for continuous variables using a Bayesian principal component analysis [J].
Audigier, Vincent ;
Husson, Francois ;
Josse, Julie .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (11) :2140-2156
[3]   Missing data imputation and sensor self-validation towards a sustainable operation of wastewater treatment plants via deep variational residual autoencoders [J].
Ba-Alawi, Abdulrahman H. ;
Loy-Benitez, Jorge ;
Kim, SangYun ;
Yoo, ChangKyoo .
CHEMOSPHERE, 2022, 288
[4]   Intelligent sensor validation for sustainable influent quality monitoring in wastewater treatment plants using stacked denoising autoencoders [J].
Ba-Alawi, Abdulrahman H. ;
Vilela, Paulina ;
Loy-Benitez, Jorge ;
Heo, SungKu ;
Yoo, ChangKyoo .
JOURNAL OF WATER PROCESS ENGINEERING, 2021, 43
[5]  
Batista GEAPA, 2003, APPL ARTIF INTELL, V17, P519, DOI 10.1080/08839510390219309
[6]   Baseline distribution optimization and missing data completion in wavelet-based CS-TomoSAR [J].
Bi, Hui ;
Liu, Jianguo ;
Zhang, Bingchen ;
Hong, Wen .
SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (04)
[7]   On some aspects of minimum redundancy maximum relevance feature selection [J].
Bugata, Peter ;
Drotar, Peter .
SCIENCE CHINA-INFORMATION SCIENCES, 2020, 63 (01)
[8]   A novel data condition and performance hybrid imputation method for energy efficient operations of marine systems [J].
Cheliotis, Michail ;
Gkerekos, Christos ;
Lazakis, Iraklis ;
Theotokatos, Gerasimos .
OCEAN ENGINEERING, 2019, 188
[9]   Missing Data Imputation of Solar Radiation Data under Different Atmospheric Conditions [J].
Crespo Turrado, Concepcion ;
Meizoso Lopez, Maria del Carmen ;
Sanchez Lasheras, Fernando ;
Rodriguez Gomez, Benigno Antonio ;
Calvo Rolle, Jose Luis ;
de Cos Juez, Francisco Javier .
SENSORS, 2014, 14 (11) :20382-20399
[10]   Graph Spectral Regularized Tensor Completion for Traffic Data Imputation [J].
Deng, Lei ;
Liu, Xiao-Yang ;
Zheng, Haifeng ;
Feng, Xinxin ;
Chen, Youjia .
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) :10996-11010