Prediction of total organic carbon and E. coli in rivers within the Milwaukee River basin using machine learning methods

被引:7
作者
Nafsin, Nabila [1 ]
Li, Jin [1 ]
机构
[1] Univ Wisconsin Milwaukee, Dept Civil & Environm Engn, Milwaukee, WI 53211 USA
来源
ENVIRONMENTAL SCIENCE-ADVANCES | 2023年 / 2卷 / 02期
关键词
WATER-QUALITY PARAMETERS; INDICATOR BACTERIA; EVENT DETECTION; MODELS;
D O I
10.1039/d2va00285j
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Urban water undergoes physical and chemical changes due to various contaminants from point sources and non-point sources, including organic matter pollution and fecal bacterial contamination. Machine learning (ML) algorithms can be used as potential tools in surface water quality monitoring due to their capacity of finding underlying patterns and non-linear relationships among water quality parameters, unattainable by traditional or process-based water quality analysis. In this study, several standalone ML models such as artificial neural network (ANN), support vector machine (SVM), gradient boosting machine (GBM), random forest (RF) and ensemble-hybrid models such as RF-SVM, ANN-SVM, GBM-SVM, RF-ANN, GBM-ANN, and RF-GBM were developed for predicting total organic carbon (TOC) and E. coli in the Milwaukee River system. The significance of the study is the application of the ensemble-hybrid models for TOC and bacterial contamination prediction for the first time, which provides a reliable and direct approach to complement existing monitoring techniques in the Milwaukee River system with satisfactory prediction accuracies. The ensemble-hybrid models for TOC prediction resulted in R-2 values within a range of 0.95-0.97. However, for E. coli prediction it was difficult to explain the greater amount of unexplained variation in bacterial data based on the physicochemical water quality parameters, resulting in R-2 values within a range of 0.29-0.42. The hybrid model ANN-GBM outperformed others for both TOC and E. coli with prediction accuracies of 97% and 42%, respectively. An attempt was made to explain the variability in living microorganism behavior based on specific physicochemical parameters by developing prediction models for E. coli.
引用
收藏
页码:278 / 293
页数:16
相关论文
共 41 条
  • [1] Machine learning methods for better water quality prediction
    Ahmed, Ali Najah
    Othman, Faridah Binti
    Afan, Haitham Abdulmohsin
    Ibrahim, Rusul Khaleel
    Fai, Chow Ming
    Hossain, Md Shabbir
    Ehteram, Mohammad
    Elshafie, Ahmed
    [J]. JOURNAL OF HYDROLOGY, 2019, 578
  • [2] Efficient Water Quality Prediction Using Supervised Machine Learning
    Ahmed, Umair
    Mumtaz, Rafia
    Anwar, Hirra
    Shah, Asad A.
    Irfan, Rabia
    Garcia-Nieto, Jose
    [J]. WATER, 2019, 11 (11)
  • [3] Nitrogen limitation of decomposition and decay: How can it occur?
    Averill, Colin
    Waring, Bonnie
    [J]. GLOBAL CHANGE BIOLOGY, 2018, 24 (04) : 1417 - 1427
  • [4] Predicting river water quality index using data mining techniques
    Babbar, Richa
    Babbar, Sakshi
    [J]. ENVIRONMENTAL EARTH SCIENCES, 2017, 76 (14)
  • [5] Machine learning methods for imbalanced data set for prediction of faecal contamination in beach waters
    Bourel, Mathias
    Segura, Angel M.
    Crisci, Carolina
    Lopez, Guzman
    Sampognaro, Lia
    Vidal, Victoria
    Kruk, Carla
    Piccini, Claudia
    Perera, Gonzalo
    [J]. WATER RESEARCH, 2021, 202
  • [6] Burzynski M., 2001, PUBL WT 704 2001
  • [7] Comparative analysis of surface water quality prediction performance and identification of key water parameters using different machine learning models based on big data
    Chen, Kangyang
    Chen, Hexia
    Zhou, Chuanlong
    Huang, Yichao
    Qi, Xiangyang
    Shen, Ruqin
    Liu, Fengrui
    Zuo, Min
    Zou, Xinyi
    Wang, Jinfeng
    Zhang, Yan
    Chen, Da
    Chen, Xingguo
    Deng, Yongfeng
    Ren, Hongqiang
    [J]. WATER RESEARCH, 2020, 171 (171)
  • [8] Development of Regression-Based Models to Predict Fecal Bacteria Numbers at Select Sites within the Illinois River Watershed, Arkansas and Oklahoma, USA
    David, Morgan M.
    Haggard, Brian E.
    [J]. WATER AIR AND SOIL POLLUTION, 2011, 215 (1-4) : 525 - 547
  • [9] Improving prediction of water quality indices using novel hybrid machine -learning algorithms
    Duie Tien Bui
    Khosravi, Khabat
    Tiefenbacher, John
    Nguyen, Hoang
    Kazakis, Nerantzis
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 721
  • [10] Prediction of water quality parameters of Karoon River (Iran) by artificial intelligence-based models
    Emamgholizadeh, S.
    Kashi, H.
    Marofpoor, I.
    Zalaghi, E.
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCE AND TECHNOLOGY, 2014, 11 (03) : 645 - 656