Appraisal of machine learning techniques for predicting emerging disinfection byproducts in small water distribution networks

被引:13
|
作者
Hu, Guangji [1 ,2 ,4 ]
Mian, Haroon R. [2 ]
Mohammadiun, Saeed [2 ]
Rodriguez, Manuel J. [3 ]
Hewage, Kasun [2 ]
Sadiq, Rehan [2 ]
机构
[1] Qingdao Univ, Sch Environm Sci & Engn, Qingdao 266071, Shandong, Peoples R China
[2] Univ British Columbia Okanagan, Sch Engn, 3333 Univ Way, Kelowna, BC V1V 1V7, Canada
[3] Bibliotheque Univ Laval, Ecole Super Amenagement Terr & Dev Reg ESAD, 2325, Quebec City, PQ G1V 0A6, Canada
[4] Qingdao Univ, Sch Environm Sci & Engn, 308 Ningxia Rd, Qingdao 266071, Shandong, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
Emerging disinfection byproducts; Water quality modeling; Small water distribution networks; Support vector regression; Neural networks; DRINKING-WATER; DBPS; REGRESSION; REGION; MODELS;
D O I
10.1016/j.jhazmat.2022.130633
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Monitoring emerging disinfection byproducts (DBPs) is challenging for many small water distribution networks (SWDNs), and machine learning-based predictive modeling could be an alternative solution. In this study, eleven machine learning techniques, including three multivariate linear regression-based, three regression tree-based, three neural networks-based, and two advanced non-parametric regression techniques, are used to develop models for predicting three emerging DBPs (dichloroacetonitrile, chloropicrin, and trichloropropanone) in SWDNs. Predictors of the models include commonly-measured water quality parameters and two conventional DBP groups. Sampling data of 141 cases were collected from eleven SWDNs in Canada, in which 70 % were randomly selected for model training and the rest were used for validation. The modeling process was reiterated 1000 times for each model. The results show that models developed using advanced regression techniques, including support vector regression and Gaussian process regression, exhibited the best prediction performance. Support vector regression models showed the highest prediction accuracy (R2 = 0.94) and stability for predicting dichloroacetonitrile and trichloropropanone, and Gaussian process regression models are optimal for predicting chloropicrin (R2 = 0.92). The difference is likely due to the much lower concentrations of chloropicrin than dichloroacetonitrile and trichloropropanone. Advanced non-parametric regression techniques, characterized by a probabilistic nature, were identified as most suitable for developing the predictive models, followed by neural network-based (e.g., generalized regression neural network), regression tree-based (e.g., random forest), and multivariate linear regression-based techniques. This study identifies promising machine learning techniques among many commonly-used alternatives for monitoring emerging DBPs in SWDNs under data constraints.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Performance analysis of the water quality index model for predicting water state using machine learning techniques
    Uddin, Md Galal
    Nash, Stephen
    Rahman, Azizur
    Olbert, Agnieszka I.
    PROCESS SAFETY AND ENVIRONMENTAL PROTECTION, 2023, 169 : 808 - 828
  • [22] Predicting and understanding residential water use with interpretable machine learning
    Rachunok, Benjamin
    Verma, Aniket
    Fletcher, Sarah
    ENVIRONMENTAL RESEARCH LETTERS, 2024, 19 (01)
  • [23] Closing the gap of known and unknown halogenated nitrogenous disinfection byproducts in water: Advanced mass spectrometry techniques
    Craven, Caley B.
    Tang, Yanan
    Carroll, Kristin
    An, Lirong
    Chen, Bin
    Li, Xing-Fang
    TRAC-TRENDS IN ANALYTICAL CHEMISTRY, 2022, 153
  • [24] Predicting bank insolvencies using machine learning techniques
    Petropoulos, Anastasios
    Siakoulis, Vasilis
    Stavroulakis, Evangelos
    Vlachogiannakis, Nikolaos E.
    INTERNATIONAL JOURNAL OF FORECASTING, 2020, 36 (03) : 1092 - 1113
  • [25] The occurrence and transformation behaviors of disinfection byproducts in drinking water distribution systems in rural areas of eastern China
    Yu, Ying
    Ma, Xu
    Chen, Ruya
    Li, Guiwei
    Tao, Hui
    Shi, Baoyou
    CHEMOSPHERE, 2019, 228 : 101 - 109
  • [26] Enhanced iodinated disinfection byproducts formation in iodide/ iodate-containing water undergoing UV-chloramine sequential disinfection: Machine learning-aided identification of reaction mechanisms
    Luo, Zhen-Ning
    He, Huan
    Zhang, Tian-Yang
    Wei, Xiu-Li
    Dong, Zheng-Yu
    Xu, Meng-Yuan
    Zhao, Heng-Xuan
    Zheng, Zheng-Xiong
    Pan, Ren-Jie
    Hu, Chen-Yan
    Zeng, Chao
    El-Din, Mohamed Gamal
    Xu, Bin
    WATER RESEARCH, 2025, 272
  • [27] Comparison of machine learning techniques for predicting porosity of chalk
    Nourani, Meysam
    Alali, Najeh
    Samadianfard, Saeed
    Band, Shahab S.
    Chau, Kwok-wing
    Shu, Chi-Min
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2022, 209
  • [28] Chloramines in a pilot-scale water distribution system: Transformation of 17β-estradiol and formation of disinfection byproducts
    He, Guilin
    Li, Cong
    Dong, Feilong
    Zhang, Tuqiao
    Chen, Long
    Cizmas, Leslie
    Sharma, Virender K.
    WATER RESEARCH, 2016, 106 : 41 - 50
  • [29] Relative Assessment of Selected Machine Learning Techniques for Predicting Aerodynamic Coefficients of Airfoil
    Ahmed, Shakeel
    Kamal, Khurram
    Ratlamwala, Tahir Abdul Hussain
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF MECHANICAL ENGINEERING, 2024, 48 (04) : 1917 - 1935
  • [30] Predicting Delinquency on Mortgage Loans: An Exhaustive Parametric Comparison of Machine Learning Techniques
    Ali, S. E. Azhar
    Rizvi, S. S. H.
    Lai, F.
    Ali, R. Faizan
    Jan, Ali
    INTERNATIONAL JOURNAL OF INDUSTRIAL ENGINEERING AND MANAGEMENT, 2021, 12 (01): : 1 - 13