Self-Supervised Deep Clustering Method for Detecting Abnormal Data of Wastewater Treatment Process

被引:10
作者
Han, Honggui [1 ,2 ]
Sun, Meiting [3 ,4 ]
Li, Fangyu [3 ,4 ]
Wang, Chen [5 ]
Liu, Zezhong [5 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Comp Intelligence & Intellige, Beijing 100124, Peoples R China
[2] Beijing Univ Technol, Minist Educ, Engn Res Ctr Digital Commun, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[4] Beijing Univ Technol, Beijing Key Lab Comp Intelligence & Intelligence, Beijing 100124, Peoples R China
[5] Beijing Aerosp Smart Mfg Technol Dev Co Ltd, Beijing, Peoples R China
基金
北京市自然科学基金; 美国国家科学基金会;
关键词
Wastewater treatment process; abnormal data detection; self-supervised deep clustering network; adaptive weight objective function; double memory enhanced modules; FEATURE-EXTRACTION; INTELLIGENT; SUPPORT;
D O I
10.1109/TII.2023.3268777
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In wastewater treatment process (WWTP), abnormal data seriously reduce data quality rendering the application techniques impractical. The implementation of abnormal data detection is challenging due to the nonlinear nature of WWTP. Typically, constructing an accurate anomaly detector requires large amounts of labeled data, which is difficult in practice. Thus, a self-supervised memory enhanced deep clustering method (SMEL) is proposed to detect abnormal data without using any labels. First, a self-supervised deep clustering network, combining stacked autoencoders and the clustering algorithm, leverages unlabeled data to extract nonlinear features and capture the normal pattern. Second, an adaptive weight objective function, jointly optimizing the reconstruct error and the clustering error, is designed to obtain a robust clustering structure. Third, double memory enhanced modules, consisting of a centroid memory and a score memory, are presented to enhance training stability and detection accuracy. Finally, experiments on three WWTP datasets show that the proposed SMEL achieves the highest detection accuracy.
引用
收藏
页码:1155 / 1166
页数:12
相关论文
共 35 条
[1]  
An J., 2015, Spec. Lect. IE, V2
[2]   Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection [J].
Campello, Ricardo J. G. B. ;
Moulavi, Davoud ;
Zimek, Arthur ;
Sander, Joerg .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2015, 10 (01)
[3]  
Feng C., 2010, ACM SIGKDD INT C KNO
[4]   MPA-RNN: A Novel Attention-Based Recurrent Neural Networks for Total Nitrogen Prediction [J].
Geng, Jingxuan ;
Yang, Chunhua ;
Li, Yonggang ;
Lan, Lijuan ;
Luo, Qiwu .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (10) :6516-6525
[5]  
Guo XF, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1753
[6]   Univariate imputation method for recovering missing data in wastewater treatment process [J].
Han, Honggui ;
Sun, Meiting ;
Han, Huayun ;
Wu, Xiaolong ;
Qiao, Junfei .
CHINESE JOURNAL OF CHEMICAL ENGINEERING, 2023, 53 :201-210
[7]   Double-cycle weighted imputation method for wastewater treatment process data with multiple missing patterns [J].
Han Honggui ;
Sun Meiting ;
Wu Xiaolong ;
Li Fangyu .
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2022, 65 (12) :2967-2978
[8]   Cooperative Fuzzy-Neural Control for Wastewater Treatment Process [J].
Han, Honggui ;
Liu, Hongxu ;
Li, Jiaming ;
Qiao, Junfei .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (09) :5971-5981
[9]   Discovering cluster-based local outliers [J].
He, ZY ;
Xu, XF ;
Deng, SC .
PATTERN RECOGNITION LETTERS, 2003, 24 (9-10) :1641-1650
[10]   Data-Driven Hybrid Model for Forecasting Wastewater Influent Loads Based on Multimodal and Ensemble Deep Learning [J].
Heo, SungKu ;
Nam, KiJeon ;
Loy-Benitez, Jorge ;
Yoo, ChangKyoo .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (10) :6925-6934