Integration of Multivariate Adaptive Regression Splines and Weighted Arithmetic Water Quality Index Methods for Drinking Water Quality Analysis

被引:4
|
作者
Jumber, Marshet B. [1 ]
Damtie, Menwagaw T. [1 ]
Tegegne, Desalegn [2 ]
机构
[1] Debre Tabor Univ, Hydraul & Water Resources Engn, Debre Tabor 272, Ethiopia
[2] Int Water Management Inst IWMI, Addis Ababa, Ethiopia
关键词
Water quality index; Machine learning; R programming; Weighted arithmetic WQI; PREDICTION;
D O I
10.1007/s41101-024-00239-x
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The water quality index (WQI) is a widely used tool for assessing water quality of various water bodies, but it has drawn criticism for being transferable globally and taking a more physical approach. The present study aimed to assess the water quality indices using weighted arithmetic method and investigate alternative approach to improve the prediction accuracy of WQI by applying machine learning algorithms. These include artificial neural networks, decision trees, random forest, gradient boosting machine, multivariate adaptive regression splines (MARS), Gaussian process with radial basic function, support vector machine with radial basic function, a hybrid of Bayesian and ridge regression, and K-nearest neighbor; the spatiotemporal assessment of the WQI revealed a considerable fluctuation that requires further research to determine the potential causes. The water quality dataset was split into training (70%) and testing (30%) datasets, and the tenfold cross-validation technique was utilized to compare models and optimize hyperparameters on various subsets of the dataset. The study result revealed that almost all of the deployed machine learning models performed well on the training dataset. The multivariate adaptive regression spline (MARS) model outperformed others during both the training and testing phases (RMSE = 0.044, R2 = 0.89, and MAE = 0.025; RMSE = 0.090, R2 = 0.87, and MAE = 0.061 respectively), with the normalized dataset. The worst prediction performance in the test dataset was attained by kernel-based models such as the Gaussian process and support vector machine, which was possibly the effect of overfitting during the model-building process. A MARS model equation, employing three strongly impacting water quality parameters, including E. coli, free residual chlorine, and turbidity, was finally suggested to predict the water quality index for drinking purposes.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Assessment the Quality of Bottled Drinking Water Through Mamdani Fuzzy Water Quality Index
    Ghorban Asgari
    Ensieh Komijani
    Abdolmotaleb Seid-Mohammadi
    Mohammad Khazaei
    Water Resources Management, 2021, 35 : 5431 - 5452
  • [42] Assessment the Quality of Bottled Drinking Water Through Mamdani Fuzzy Water Quality Index
    Asgari, Ghorban
    Komijani, Ensieh
    Seid-Mohammadi, Abdolmotaleb
    Khazaei, Mohammad
    WATER RESOURCES MANAGEMENT, 2021, 35 (15) : 5431 - 5452
  • [43] Assessment of water quality in Hawkesbury-Nepean River in Sydney using water quality index and multivariate analysis
    Haque, M. M.
    Kader, F.
    Kuruppu, U.
    Rahman, A.
    21ST INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION (MODSIM2015), 2015, : 2493 - 2499
  • [44] DETERMINING OF WATER QUALITY BY USING MULTIVARIATE ANALYSIS TECHNIQUES IN A DRINKING/USING WATER RESERVOIR IN TURKEY
    Elipek, Belgin Camur
    Guher, Huseyin
    Oterler, Burak
    Divrik, Menekse Tas
    Mimiroglu, Pinar Altinoluk
    FRESENIUS ENVIRONMENTAL BULLETIN, 2017, 26 (08): : 5007 - 5012
  • [45] Costs and benefits of the development methods of drinking water quality index: A systematic review
    Han, Xue
    Liu, Xiaohui
    Gao, Datian
    Ma, Bingjie
    Gao, Xiaoyu
    Cheng, Mengke
    ECOLOGICAL INDICATORS, 2022, 144
  • [46] Predicting the Tigris River water quality within Baghdad, Iraq by using water quality index and regression analysis
    Ewaid, Salam Hussein
    Abed, Salwan Ali
    Kadhum, Safaa A.
    ENVIRONMENTAL TECHNOLOGY & INNOVATION, 2018, 11 : 390 - 398
  • [47] Ground Water Quality and Multivariate Statistical Methods
    Viswanath N.C.
    Kumar P.G.D.
    Ammad K.K.
    Kumari E.R.U.
    Environmental Processes, 2015, 2 (2) : 347 - 360
  • [48] The water quality and pollution sources assessment of Surma river, Bangladesh using, hydrochemical, multivariate statistical and water quality index methods
    Howladar, M. Farhad
    Chakma, Elora
    Koley, Nusrat Jahan
    Islam, Sabina
    Al Numanbakth, Md Abdullah
    Ahmed, Zia
    Chowdhury, Tayabur Rashid
    Akter, Shetu
    GROUNDWATER FOR SUSTAINABLE DEVELOPMENT, 2021, 12
  • [49] Use of water quality index and multivariate statistical methods for the evaluation of water quality of a stream affected by multiple stressors: A case study
    Varol, Memet
    ENVIRONMENTAL POLLUTION, 2020, 266
  • [50] Hydrogeochemistry and Water Quality Index in the Assessment of Groundwater Quality for Drinking Uses
    Batabyal, Asit Kumar
    Chakraborty, Surajit
    WATER ENVIRONMENT RESEARCH, 2015, 87 (07) : 607 - 617