Comparative analysis of ensemble learning algorithms in water quality prediction

被引:0
作者
Shah, Farman Ullah [1 ]
Khan, Afed Ullah [1 ]
Khan, Abdul Waris [1 ]
Ullah, Basir [1 ]
Khan, Muhammad Rashid [1 ]
Javed, Ihrar [1 ]
机构
[1] Univ Engn & Technol Peshawar, Dept Civil Engn, Bannu Campus, Bannu 28100, Pakistan
关键词
AdaBoost; CatBoost; LightGBM; random forest; water quality; XGBoost; SURFACE; MODEL;
D O I
10.2166/hydro.2024.071
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Water is essential for all life forms but is increasingly at risk of contamination. Monitoring water quality is crucial to protect ecosystems and public health. This study evaluates ensemble learning techniques - AdaBoost, Gradient Boost, XGBoost, CatBoost, and LightGBM - for predicting key water quality parameters in the Bara River Basin, Pakistan. Initially, a random forest model identified optimal input-target parameter combinations. Machine learning models were then developed and evaluated using R2, MSE, and MAE, with the best models selected via compromise programming. Results show XGBoost and Gradient Boost outperformed other methods. XGBoost achieved near-perfect R2 values for bicarbonate (HCO3), carbonate (CO3), and magnesium (Mg), while Gradient Boost excelled with parameters like electrical conductivity (EC), sulfate (SO4), temperature, and calcium (Ca). XGBoost demonstrated high training R2 values (0.999) but slightly lower testing R2 (e.g., 0.8636 for HCO3). Gradient Boost exhibited greater stability, maintaining high accuracy in both phases (e.g., Ca testing R2 = 0.9433). AdaBoost and CatBoost showed moderate performance for parameters like chloride (Cl) and pH, while CatBoost and LightGBM performed well for pH and dissolved solids but varied across other indicators. These findings underscore the potential of ensemble methods for accurate water quality prediction, aiding future management and environmental protection efforts.
引用
收藏
页码:3041 / 3059
页数:19
相关论文
共 47 条
[1]   Integrating feature extraction approaches with hybrid emotional neural networks for water quality index modeling [J].
Abba, S., I ;
Abdulkadir, R. A. ;
Sammen, Saad Sh ;
Pham, Quoc Bao ;
Lawan, A. A. ;
Esmaili, Parvaneh ;
Malik, Anurag ;
Al-Ansari, Nadhir .
APPLIED SOFT COMPUTING, 2022, 114
[2]   An Enhanced Water Quality Index for Water Quality Monitoring Using Remote Sensing and Machine Learning [J].
Ahmed, Mehreen ;
Mumtaz, Rafia ;
Anwar, Zahid .
APPLIED SCIENCES-BASEL, 2022, 12 (24)
[3]   Analysis of water quality indices and machine learning techniques for rating water pollution: a case study of Rawal Dam, Pakistan [J].
Ahmed, Mehreen ;
Mumtaz, Rafia ;
Mohammad, Syed .
WATER SUPPLY, 2021, 21 (06) :3225-3250
[4]   Efficient Water Quality Prediction Using Supervised Machine Learning [J].
Ahmed, Umair ;
Mumtaz, Rafia ;
Anwar, Hirra ;
Shah, Asad A. ;
Irfan, Rabia ;
Garcia-Nieto, Jose .
WATER, 2019, 11 (11)
[5]   Proposition of New Ensemble Data-Intelligence Models for Surface Water Quality Prediction [J].
Al-Sulttani, Ali Omran ;
Al-Mukhtar, Mustafa ;
Roomi, Ali B. ;
Farooque, Aitazaz Ahsan ;
Khedher, Khaled Mohamed ;
Yaseen, Zaher Mundher .
IEEE ACCESS, 2021, 9 :108527-108541
[6]   Evolutionary and ensemble machine learning predictive models for evaluation of water quality [J].
Aldrees, Ali ;
Javed, Muhammad Faisal ;
Taha, Abubakr Taha Bakheit ;
Mohamed, Abdeliazim Mustafa ;
Jasinski, Michal ;
Gono, Miroslava .
JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2023, 46
[7]  
Ali J., 2012, International Journal of Computer Science Issues(IJCSI), V9, P273
[8]  
Ali Z A., 2023, Acad. J. Nawroz Univ, V12, P320, DOI DOI 10.25007/AJNU.V12N2A1612
[9]   Comparative Assessment of Individual and Ensemble Machine Learning Models for Efficient Analysis of River Water Quality [J].
Alqahtani, Abdulaziz ;
Shah, Muhammad Izhar ;
Aldrees, Ali ;
Javed, Muhammad Faisal .
SUSTAINABILITY, 2022, 14 (03)
[10]   Water Quality Management Using Hybrid Machine Learning and Data Mining Algorithms: An Indexing Approach [J].
Aslam, Bilal ;
Maqsoom, Ahsen ;
Cheema, Ali Hassan ;
Ullah, Fahim ;
Alharbi, Abdullah ;
Imran, Muhammad .
IEEE ACCESS, 2022, 10 :119692-119705