Turbidity prediction of lake-type raw water using random forest model based on meteorological data: A case study of Tai lake, China

被引:48
作者
Zhang, Yiping [1 ]
Yao, Xinyu [1 ]
Wu, Qiang [2 ]
Huang, Yongbin [2 ]
Zhou, Zhixu [3 ]
Yang, Jun [4 ]
Liu, Xiaowei [1 ]
机构
[1] Zhejiang Univ, Zhejiang Key Lab Drinking Water Safety & Distribu, Hangzhou 310058, Peoples R China
[2] Huzhou Water Grp Co Ltd, Huzhou 313000, Peoples R China
[3] Huzhou Meteorol Bur, Huzhou 313005, Peoples R China
[4] Hangzhou Meteorol Informat Ctr, Hangzhou 310051, Peoples R China
关键词
Machine learning technique; Wind field; Shallow lake; Drinking water source; Data-driven model; NEURAL-NETWORKS; RIVER-BASIN; VARIABLES;
D O I
10.1016/j.jenvman.2021.112657
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Turbidity is an indication of water quality and enables the growth of pathogenic microorganisms. For drinking water treatment plants (DWTPs), violent fluctuations in turbidity are highly disruptive to operational performance due to the lag in process parameter adjustments. Such risks must be carefully managed to guarantee safe drinking water. Machine learning techniques have been proven to be effective for modeling complex nonlinear environmental systems, and this study adopted such a technique to develop a model for predicting source water turbidity for DWTPs to allow DWTPs to make proactive interventions in advance. A random forest (RF) model used preprocessed (empirical mode decomposition and quartile rejecting) meteorological factors (wind speed, wind direction, air temperature, and rainfall) as the input variables, to establish the turbidity prediction of a lake with significant turbidity in China's South Tai Lake. The modeling process included four main stages: (1) source data analysis, (2) raw data preprocessing, (3) modeling and tuning, and (4) model evaluation. The results of the RF model indicated that the correlation coefficient between the predicted and actual sequences is over 0.7, and more than 55% of the predicted values could control the errors within 20% compared to the actual measured values, suggesting that machine learning techniques are suitable for predicting the turbidity of raw source water. It was found that the RF model can provide a modest performance boost because of its stronger capacity to capture nonlinear interactions in the data. The findings of this study can inform the development of turbidity prediction models using readily available meteorological forecast data. The model can be applied to other DWTPs using similar shallow lakes as water sources.
引用
收藏
页数:8
相关论文
共 28 条
[1]   Can machine language and artificial intelligence revolutionize process automation for water treatment and desalination? [J].
Al Aani, Saif ;
Bonny, Talal ;
Hasan, Shadi W. ;
Hilal, Nidal .
DESALINATION, 2019, 458 :84-96
[2]   Predicting Daily Urban Fine Particulate Matter Concentrations Using a Random Forest Model [J].
Brokamp, Cole ;
Jandarov, Roman ;
Hossain, Monir ;
Ryan, Patrick .
ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2018, 52 (07) :4173-4179
[3]   A hybrid evolutionary data driven model for river water quality early warning [J].
Burchard-Levine, Alejandra ;
Liu, Shuming ;
Vince, Francois ;
Li, Mingming ;
Ostfeld, Avi .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2014, 143 :8-16
[4]  
Chen Z., 2009, P 4 CHIN LAK FOR ANH
[5]   Boundary-processing-technicque in EMD method and Hilbert transform [J].
Deng, YJ ;
Wang, W ;
Qian, CC ;
Wang, Z ;
Dai, DJ .
CHINESE SCIENCE BULLETIN, 2001, 46 (11) :954-961
[6]   Hybrid PSO-SVM-based method for long-term forecasting of turbidity in the Nalon river basin: A case study in Northern Spain [J].
Garcia Nieto, P. J. ;
Garcia-Gonzalo, E. ;
Alonso Fernandez, J. R. ;
Diaz Muniz, C. .
ECOLOGICAL ENGINEERING, 2014, 73 :192-200
[7]  
Guo J, 2014, PREDICTION ANAL TURB
[8]   Turbidity Prediction in a River Basin by Using Artificial Neural Networks: A Case Study in Northern Spain [J].
Iglesias, C. ;
Torres, J. Martinez ;
Nieto, P. J. Garcia ;
Fernandez, J. R. Alonso ;
Muinz, C. Diaz ;
Pineiro, J. I. ;
Taboada, J. .
WATER RESOURCES MANAGEMENT, 2014, 28 (02) :319-331
[9]   Decision Tree-Based Data Mining and Rule Induction for Identifying High Quality Groundwater Zones to Water Supply Management: a Novel Hybrid Use of Data Mining and GIS [J].
Jeihouni, Mehrdad ;
Toomanian, Ara ;
Mansourian, Ali .
WATER RESOURCES MANAGEMENT, 2020, 34 (01) :139-154
[10]  
Kim Hyunil, 2020, [Journal of The Korean Society of Hazard Mitigation, 한국방재학회논문집], V20, P301, DOI 10.9798/KOSHAM.2020.20.6.301