Double-cycle weighted imputation method for wastewater treatment process data with multiple missing patterns
被引:5
作者:
Han Honggui
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Han Honggui
[1
,2
,3
]
Sun Meiting
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Sun Meiting
[1
,2
,3
]
Wu Xiaolong
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Wu Xiaolong
[1
,2
,3
]
Li Fangyu
论文数: 0引用数: 0
h-index: 0
机构:
Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R ChinaBeijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
Li Fangyu
[1
,2
,3
]
机构:
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[3] Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R China
Due to sensor malfunctions and communication faults, multiple missing patterns frequently happen in wastewater treatment process (WWTP). Nevertheless, the existing missing data imputation works cannot stand multiple missing patterns because they have not sufficiently utilized of data information. In this article, a double-cycle weighted imputation (DCWI) method is proposed to deal with multiple missing patterns by maximizing the utilization of the available information in variables and instances. The proposed DCWI is comprised of two components: a double-cycle-based imputation sorting and a weighted K nearest neighbor-based imputation estimator. First, the double-cycle mechanism, associated with missing variable sorting and missing instance sorting, is applied to direct the missing values imputation. Second, the weighted K nearest neighbor-based imputation estimator is used to acquire the global similar instances and capture the volatility in the local region. The estimator preserves the original data characteristics as much as possible and enhances the imputation accuracy. Finally, experimental results on simulated and real WWTP datasets with non-stationarity and nonlinearity demonstrate that the proposed DCWI produces more accurate imputation results than comparison methods under different missing patterns and missing ratios.