Performance analysis of the water quality index model for predicting water state using machine learning techniques

被引:111
|
作者
Uddin, Md Galal [1 ,2 ,3 ,6 ]
Nash, Stephen [1 ,2 ,3 ]
Rahman, Azizur [4 ,5 ]
Olbert, Agnieszka I. [1 ,2 ,3 ]
机构
[1] Univ Galway, Sch Engn, Galway, Ireland
[2] Univ Galway, Ryan Inst, Galway, Ireland
[3] Univ Galway, MaREI Res Ctr, Galway, Ireland
[4] Charles Sturt Univ, Sch Comp Math & Engn, Wagga Wagga, Australia
[5] Charles Sturt Univ, Gulbali Inst Agr Water & Environm, Wagga Wagga, Australia
[6] Univ Galway, Coll Sci & Engn, Civil Engn, Galway, Ireland
关键词
Water quality index; Coastal water quality classification; Model uncertainty; Classification algorithm; Cork Harbour; OPERATING CHARACTERISTIC CURVE; SUPPORT VECTOR MACHINES; NEURAL-NETWORKS; ROC; CLASSIFICATION; MULTICLASS;
D O I
10.1016/j.psep.2022.11.073
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Existing water quality index (WQI) models assess water quality using a range of classification schemes. Conse-quently, different methods provide a number of interpretations for the same water properties that contribute to a considerable amount of uncertainty in the correct classification of water quality. The aims of this study were to evaluate the performance of the water quality index (WQI) model in order to classify coastal water quality correctly using a completely new classification scheme. Cork Harbour water quality data was used in this study, which was collected by Ireland's environmental protection agency (EPA). In the present study, four machine -learning classifier algorithms, including support vector machines (SVM), Naive Bayes (NB), random forest (RF), k-nearest neighbour (KNN), and gradient boosting (XGBoost), were utilized to identify the best classifier for predicting water quality classes using widely used seven WQI models, whereas three models are completely new and recently proposed by the authors. The KNN (100% correct and 0% wrong) and XGBoost (99.9% correct and 0.1% wrong) algorithms were outperformed in predicting the water quality accurately for seven WQI models. The model validation results indicate that the XGBoost classifier outperformed, including accuracy (1.0), pre-cision (0.99), sensitivity (0.99), specificity (1.0), and F1 (0.99) score, in order to predict the correct classification of water quality. Moreover, compared to WQI models, higher prediction accuracy, precision, sensitivity, speci-ficity, and F1 score were found for the weighted quadratic mean (WQM) and unweighted root mean square (RMS) WQI models, respectively, for each class. The findings of this study showed that the WQM and RMS models could be effective and reliable for assessing coastal water quality in terms of correct classification. Therefore, this study could be helpful in providing accurate water quality information to researchers, policy -makers, and water research personnel for monitoring using the WQI model more effectively.
引用
收藏
页码:808 / 828
页数:21
相关论文
共 50 条
  • [2] Machine Learning Algorithms for Predicting the Water Quality Index
    Hussein, Enas E.
    Baloch, Muhammad Yousuf Jat
    Nigar, Anam
    Abualkhair, Hussain F.
    Aldawood, Faisal Khaled
    Tageldin, Elsayed
    WATER, 2023, 15 (20)
  • [3] Recognizing Safe Drinking Water and Predicting Water Quality Index using Machine Learning Framework
    Torky, Mohamed
    Bakhiet, Ali
    Bakrey, Mohamed
    Ismail, Ahmed Adel
    EL Seddawy, Ahmed I. B.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (01) : 23 - 33
  • [4] Machine learning models for predicting water quality index: optimization and performance analysis for El Moghra, Egypt
    Elshaarawy, Mohamed Kamel
    Eltarabily, Mohamed Galal
    Water Supply, 2024, 24 (09) : 3269 - 3294
  • [5] Predicting and Analyzing Water Quality using Machine Learning: A Comprehensive Model
    Khan, Yafra
    See, Chai Soo
    2016 IEEE LONG ISLAND SYSTEMS, APPLICATIONS AND TECHNOLOGY CONFERENCE (LISAT), 2016,
  • [6] Predicting water quality index using machine learning techniques: a case study of river Ganga in Haridwar, India
    Sumita Lamba
    Ishaan Dawar
    Maanas Singal
    Jabrinder Singh
    Earth Science Informatics, 2025, 18 (2)
  • [7] A novel approach for estimating and predicting uncertainty in water quality index model using machine learning approaches
    Uddin, Md Galal
    Nash, Stephen
    Rahman, Azizur
    Olbert, Agnieszka I.
    WATER RESEARCH, 2023, 229
  • [8] EVALUATING THE PERFORMANCE OF MACHINE LEARNING APPROACHES IN PREDICTING ALBANIAN SHKUMBINI RIVER'S WATERS USING WATER QUALITY INDEX MODEL
    Basha, Lule
    Shyti, Bederiana
    Bekteshi, Lirim
    JOURNAL OF ENVIRONMENTAL ENGINEERING AND LANDSCAPE MANAGEMENT, 2024, 32 (02) : 117 - 127
  • [9] Robust machine learning algorithms for predicting coastal water quality index
    Uddin, Md Galal
    Nash, Stephen
    Diganta, Mir Talas Mahammad
    Rahman, Azizur
    Olbert, Agnieszka I.
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2022, 321
  • [10] An advanced deep learning model for predicting water quality index
    Ehteram, Mohammad
    Ahmed, Ali Najah
    Sherif, Mohsen
    El-Shafie, Ahmed
    ECOLOGICAL INDICATORS, 2024, 160