Univariate imputation method for recovering missing data in wastewater treatment process

被引:16
作者
Han, Honggui [1 ]
Sun, Meiting
Han, Huayun
Wu, Xiaolong
Qiao, Junfei
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
来源
CHINESE JOURNAL OF CHEMICAL ENGINEERING | 2023年 / 53卷
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Univariate; Self-similarity; Waste water; Algorithm; Integration; FEATURE-SELECTION; MANAGEMENT; VALUES;
D O I
10.1016/j.cjche.2022.01.033
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
High-quality data play a paramount role in monitoring, control, and prediction of wastewater treatment process (WWTP) and can effectively ensure the efficient and stable operation of system. Missing values seriously degrade the accuracy, reliability and completeness of the data quality due to network collapses, connection errors and data transformation failures. In these cases, it is infeasible to recover missing data depending on the correlation with other variables. To tackle this issue, a univariate imputation method (UIM) is proposed for WWTP integrating decomposition method and imputation algorithms. First, the seasonal-trend decomposition based on loess method is utilized to decompose the original time series into the seasonal, trend and remainder components to deal with the nonstationary characteristics of WWTP data. Second, the support vector regression is used to approximate the nonlinearity of the trend and remainder components respectively to provide estimates of its missing values. A self-similarity decomposition is conducted to fill the seasonal component based on its periodic pattern. Third, all the imputed results are merged to obtain the imputation result. Finally, six time series of WWTP are used to evaluate the imputation performance of the proposed UIM by comparing with existing seven methods based on two indicators. The experimental results illustrate that the proposed UIM is effective for WWTP time series under different missing ratios. Therefore, the proposed UIM is a promising method to impute WWTP time series.(c) 2022 The Chemical Industry and Engineering Society of China, and Chemical Industry Press Co., Ltd. All rights reserved.
引用
收藏
页码:201 / 210
页数:10
相关论文
共 50 条
[1]   Filling gaps in evapotranspiration measurements for water budget studies: Evaluation of a Kalman filtering approach [J].
Alavi, Nasim ;
Warland, Jon S. ;
Berg, Aaron A. .
AGRICULTURAL AND FOREST METEOROLOGY, 2006, 141 (01) :57-66
[2]   Multiple imputation for continuous variables using a Bayesian principal component analysis [J].
Audigier, Vincent ;
Husson, Francois ;
Josse, Julie .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (11) :2140-2156
[3]   Reconstruction of missing data in multidimensional time series by fuzzy similarity [J].
Baraldi, P. ;
Di Maio, F. ;
Genini, D. ;
Zio, E. .
APPLIED SOFT COMPUTING, 2015, 26 :1-9
[4]   Handling missing data in multivariate time series using a vector autoregressive model-imputation (VAR-IM) algorithm [J].
Bashir, Faraj ;
Wei, Hua-Liang .
NEUROCOMPUTING, 2018, 276 :23-30
[5]   Combination of two methodologies, artificial neural network and linear interpolation, to gap-fill daily nitrous oxide flux measurements [J].
Bigaignon, Laurent ;
Fieuzal, Remy ;
Delon, Claire ;
Tallec, Tiphaine .
AGRICULTURAL AND FOREST METEOROLOGY, 2020, 291
[6]   A novel imputation methodology for time series based on pattern sequence forecasting [J].
Bokde, Neeraj ;
Beck, Marcus W. ;
Martinez Alvarez, Francisco ;
Kulat, Kishore .
PATTERN RECOGNITION LETTERS, 2018, 116 :88-96
[7]  
Chandrasekaran S., 2016, P WORKSH COMP INT DO, P1
[8]   A Method for Improving Imputation and Prediction Accuracy of Highly Seasonal Univariate Data with Large Periods of Missingness [J].
Chaudhry, Aizaz ;
Li, Wei ;
Basri, Amir ;
Patenaude, Francois .
WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2019, 2019
[9]   Imputing incomplete time-series data based on varied-window similarity measure of data sequences [J].
Chiewchanwattana, Sirapat ;
Lursinsap, Chidchanok ;
Chu, Chee-Hung Henry .
PATTERN RECOGNITION LETTERS, 2007, 28 (09) :1091-1103
[10]  
Cleveland RB., 1990, J. Off. Stat, V6, P3