Forecasting bacteriological presence in treated drinking water using machine learning

被引:4
作者
Kyritsakas, Grigorios [1 ]
Boxall, Joby [1 ]
Speight, Vanessa [1 ]
机构
[1] Univ Sheffield, Sheffield Water Ctr, Dept Civil & Struct Engn, Sheffield, England
来源
FRONTIERS IN WATER | 2023年 / 5卷
基金
英国工程与自然科学研究理事会;
关键词
drinking water treatment; machine learning; online flow cytometry; total cell counts prediction; forecasting model;
D O I
10.3389/frwa.2023.1199632
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
A novel data-driven model for the prediction of bacteriological presence, in the form of total cell counts, in treated water exiting drinking water treatment plants is presented. The model was developed and validated using a year of hourly online flow cytometer data from an operational drinking water treatment plant. Various machine learning methods are compared (random forest, support vector machines, k-Nearest Neighbors, Feed-forward Artificial Neural Network, Long Short Term Memory and RusBoost) and different variables selection approaches are used to improve the model's accuracy. Results indicate that the model could accurately predict total cell counts 12 h ahead for both regression and classification-based forecasts-NSE = 0.96 for the best regression model, using the K-Nearest Neighbors algorithm, and Accuracy = 89.33% for the best classification model, using the combined random forest, K-neighbors and RusBoost algorithms. This forecasting horizon is sufficient to enable proactive operational interventions to improve the treatment processes, thereby helping to ensure safe drinking water.
引用
收藏
页数:15
相关论文
共 40 条
  • [1] Emerging evolutionary algorithm integrated with kernel principal component analysis for modeling the performance of a water treatment plant
    Abba, S., I
    Quoc Bao Pham
    Usman, A. G.
    Nguyen Thi Thuy Linh
    Aliyu, D. S.
    Quyen Nguyen
    Quang-Vu Bach
    [J]. JOURNAL OF WATER PROCESS ENGINEERING, 2020, 33
  • [2] Efficient Water Quality Prediction Using Supervised Machine Learning
    Ahmed, Umair
    Mumtaz, Rafia
    Anwar, Hirra
    Shah, Asad A.
    Irfan, Rabia
    Garcia-Nieto, Jose
    [J]. WATER, 2019, 11 (11)
  • [3] Online flow cytometry reveals microbial dynamics influenced by concurrent natural and operational events in groundwater used for drinking water treatment
    Besmer, Michael D.
    Epting, Jannis
    Page, Rebecca M.
    Sigrist, Jurg A.
    Huggenberger, Peter
    Hammes, Frederik
    [J]. SCIENTIFIC REPORTS, 2016, 6
  • [4] The feasibility of automated online flow cytometry for in-situ monitoring of microbial dynamics in aquatic ecosystems
    Besmer, Michael D.
    Weissbrodt, David G.
    Kratochvil, Bradley E.
    Sigrist, Juerg A.
    Weyland, Mathias S.
    Hammes, Frederik
    [J]. FRONTIERS IN MICROBIOLOGY, 2014, 5
  • [5] Bishop C. M., 2007, Pattern Recognition and Machine Learning Information Science and Statistics, V1st
  • [6] Boxall J., 2020, REAL TIME MONITORING
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] Bryant M.A., 2016, EVALUATION STAT COMP
  • [9] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [10] Deep learning approach for sustainable WWTP operation: A case study on data-driven influent conditions monitoring
    Dairi, Abdelkader
    Cheng, Tuoyuan
    Harrou, Fouzi
    Sun, Ying
    Leiknes, TorOve
    [J]. SUSTAINABLE CITIES AND SOCIETY, 2019, 50