Multi-ahead electrical conductivity forecasting of surface water based on machine learning algorithms

被引:18
作者
Kumar, Deepak [1 ]
Singh, Vijay Kumar [2 ]
Abed, Salwan Ali [3 ]
Tripathi, Vinod Kumar [1 ]
Gupta, Shivam [4 ]
Al-Ansari, Nadhir [5 ]
Vishwakarma, Dinesh Kumar [6 ]
Dewidar, Ahmed Z. [7 ,8 ]
Al-Othman, Ahmed A. [8 ]
Mattar, Mohamed A. [7 ,8 ,9 ]
机构
[1] Banaras Hindu Univ, Inst Agr Sci, Dept Agr Engn, BHU, Varanasi 221005, Uttar Pradesh, India
[2] Acharya Narendra Deva Univ Agr & Technol, Dept Soil & Water Conservat Engn, Ayodhya 224229, Uttar Pradesh, India
[3] Univ Al Qadisiyah, Coll Sci, POB 1895, Diwaniyah 58001, Iraq
[4] Acharya Narendra Deva Univ Agr & Technol, Dept Irrigat & Drainage Engn, Ayodhya 224229, Uttar Pradesh, India
[5] Lulea Univ Technol, Dept Civil Environm & Nat Resources Engn, S-97187 Lulea, Sweden
[6] Govind Ballabh Pant Univ Agr & Technol, Dept Irrigat & Drainage Engn, Pantnagar 263145, Uttarakhand, India
[7] King Saud Univ, Prince Sultan Inst Environm Water & Desert Res, Prince Sultan Bin Abdulaziz Int Prize Water Chair, Riyadh 11451, Saudi Arabia
[8] King Saud Univ, Coll Food & Agr Sci, Dept Agr Engn, Riyadh 11451, Saudi Arabia
[9] Agr Res Ctr, Agr Engn Res Inst AEnRI, Giza 12618, Egypt
关键词
Decision tree; Multilayer perceptron; Random forest; Co-adaptive neuro-fuzzy inference system; Electrical conductivity; ARTIFICIAL NEURAL-NETWORKS; DECISION TREE; PREDICTION; RIVER; QUALITY; MODELS; INTELLIGENCE; PARAMETERS; VARIABLES; PATTERN;
D O I
10.1007/s13201-023-02005-1
中图分类号
TV21 [水资源调查与水利规划];
学科分类号
081501 ;
摘要
The present research work focused on predicting the electrical conductivity (EC) of surface water in the Upper Ganga basin using four machine learning algorithms: multilayer perceptron (MLP), co-adaptive neuro-fuzzy inference system (CANFIS), random forest (RF), and decision tree (DT). The study also utilized the gamma test for selecting appropriate input and output combinations. The results of the gamma test revealed that total hardness (TH), magnesium (Mg), and chloride (Cl) parameters were suitable input variables for EC prediction. The performance of the models was evaluated using statistical indices such as Percent Bias (PBIAS), correlation coefficient (R), Willmott's index of agreement (WI), Index of Agreement (PI), root mean square error (RMSE) and Legate-McCabe Index (LMI). Comparing the results of the EC models using these statistical indices, it was observed that the RF model outperformed the other algorithms. During the training period, the RF algorithm has a small positive bias (PBIAS = 0.11) and achieves a high correlation with the observed values (R = 0.956). Additionally, it shows a low RMSE value (360.42), a relatively good coefficient of efficiency (CE = 0.932), PI (0.083), WI (0.908) and LMI (0.083). However, during the testing period, the algorithm's performance shows a small negative bias (PBIAS = - 0.46) and a good correlation (R = 0.929). The RMSE value decreases significantly (26.57), indicating better accuracy, the coefficient of efficiency remains high (CE = 0.915), PI (0.033), WI (0.965) and LMI (- 0.028). Similarly, the performance of the RF algorithm during the training and testing periods in Prayagraj. During the training period, the RF algorithm shows a PBIAS of 0.50, indicating a small positive bias. It achieves an RMSE of 368.3, R of 0.909, CE of 0.872, PI of 0.015, WI of 0.921, and LMI of 0.083. During the testing period, the RF algorithm demonstrates a slight negative bias with a PBIAS of - 0.06. The RMSE reduces significantly to 24.1, indicating improved accuracy. The algorithm maintains a high correlation (R = 0.903) and a good coefficient of efficiency (CE = 0.878). The index of agreement (PI) increases to 0.035, suggesting a better fit. The WI is 0.960, indicating high accuracy compared to the mean value, while the LMI decreases slightly to - 0.038. Based on the comparative results of the machine learning algorithms, it was concluded that RF performed better than DT, CANFIS, and MLP. The study recommended using the current month's total hardness (TH), magnesium (Mg), and chloride (Cl) parameters as input variables for multi-ahead forecasting of electrical conductivity (ECt+1, ECt+2, and ECt+3) in future studies in the Upper Ganga basin. The findings also indicated that RF and DT models had superior performance compared to MLP and CANFIS models. These models can be applied for multi-ahead forecasting of monthly electrical conductivity at both Varanasi and Prayagraj stations in the Upper Ganga basin.
引用
收藏
页数:20
相关论文
共 116 条
[1]   Sandstone groundwater salinization modelling using physicochemical variables in Southern Saudi Arabia: Application of novel data intelligent algorithms [J].
Abba, S. I. ;
Benaafi, Mohammed ;
Usman, A. G. ;
Aljundi, Isam H. .
AIN SHAMS ENGINEERING JOURNAL, 2023, 14 (03)
[2]   Performance of Machine Learning Techniques for Meteorological Drought Forecasting in the Wadi Mina Basin, Algeria [J].
Achite, Mohammed ;
Elshaboury, Nehal ;
Jehanzaib, Muhammad ;
Vishwakarma, Dinesh Kumar ;
Pham, Quoc Bao ;
Anh, Duong Tran ;
Abdelkader, Eslam Mohammed ;
Elbeltagi, Ahmed .
WATER, 2023, 15 (04)
[3]   Prediction of daily global solar radiation using different machine learning algorithms: Evaluation and comparison [J].
Agbulut, Umit ;
Gurel, Ali Etem ;
Bicen, Yunus .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2021, 135
[4]   An improved adaptive neuro fuzzy inference system model using conjoined metaheuristic algorithms for electrical conductivity prediction [J].
Ahmadianfar, Iman ;
Shirvani-Hosseini, Seyedehelham ;
He, Jianxun ;
Samadi-Koucheksaraee, Arvin ;
Yaseen, Zaher Mundher .
SCIENTIFIC REPORTS, 2022, 12 (01)
[5]   A novel Hybrid Wavelet-Locally Weighted Linear Regression (W-LWLR) Model for Electrical Conductivity (EC) Prediction in Surface Water [J].
Ahmadianfar, Iman ;
Jamei, Mehdi ;
Chu, Xuefeng .
JOURNAL OF CONTAMINANT HYDROLOGY, 2020, 232
[6]   Machine learning methods for better water quality prediction [J].
Ahmed, Ali Najah ;
Othman, Faridah Binti ;
Afan, Haitham Abdulmohsin ;
Ibrahim, Rusul Khaleel ;
Fai, Chow Ming ;
Hossain, Md Shabbir ;
Ehteram, Mohammad ;
Elshafie, Ahmed .
JOURNAL OF HYDROLOGY, 2019, 578
[7]   Prediction of wind drift and evaporation losses from sprinkler irrigation using neural network and multiple regression techniques [J].
Al-Ghobari, Hussein M. ;
El-Marazky, Mohamed S. ;
Dewidar, Ahmed Z. ;
Mattar, Mohamed A. .
AGRICULTURAL WATER MANAGEMENT, 2018, 195 :211-221
[8]  
[Anonymous], 2001, Springer Series in Statistics, DOI DOI 10.1007/978-0-387-21606-5
[9]  
[Anonymous], 1992, AI 92 P 5 AUSTR JOIN
[10]   Integrative neural networks model for prediction of sediment rating curve parameters for ungauged basins [J].
Atieh, M. ;
Mehltretter, S. L. ;
Gharabaghi, B. ;
Rudra, R. .
JOURNAL OF HYDROLOGY, 2015, 531 :1095-1107