Integrating machine learning models with cross-validation and bootstrapping for evaluating groundwater quality in Kanchanaburi province, Thailand

被引:9
|
作者
Thanh, Nguyen Ngoc [1 ]
Chotpantarat, Srilert [2 ,3 ]
Ngu, Nguyen Huu [1 ]
Thunyawatcharakul, Pongsathorn [4 ]
Kaewdum, Narongsak [5 ]
机构
[1] Hue Univ, Univ Agr & Forestry, 102 Phung Hung Str, Hue City 53000, Thua Thien Hue, Vietnam
[2] Chulalongkorn Univ, Fac Sci, Dept Geol, Bangkok 10330, Thailand
[3] Chulalongkorn Univ, Environm Res Inst, Ctr Excellence Environm Innovat & Management Met E, Phayathai Rd, Bangkok 10330, Thailand
[4] Chulalongkorn Univ, Grad Sch, Int Postgrad Program Hazardous Subst & Environm Ma, Bangkok 10330, Thailand
[5] Mahidol Univ, Geosci Program, Kanchanaburi Campus, Kanchanaburi 71150, Thailand
关键词
Groundwater quality; Random forest; Artificial neural network; Kanchanaburi province; Thailand; ARTIFICIAL NEURAL-NETWORK; RANDOM FOREST; LAND-USE; WATER; GIS; PREDICTION; MANAGEMENT; SURFACE; REGION; INDEX;
D O I
10.1016/j.envres.2024.118952
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Exploring the potential of new models for mapping groundwater quality presents a major challenge in water resource management, particularly in Kanchanaburi Province, Thailand, where groundwater faces contamination risks. This study aimed to explore the applicability of random forest (RF) and artificial neural networks (ANN) models to predict groundwater quality. Particularly, these two models were integrated into crossvalidation (CV) and bootstrapping (B) techniques to build predictive models, including RF-CV, RF-B, ANN-CV, and ANN-B. Entropy groundwater quality index (EWQI) was converted to normalized EWQI which was then classified into five levels from very poor to very good. A total of twelve physicochemical parameters from 180 groundwater wells, including potassium, sodium, calcium, magnesium, chloride, sulfate, bicarbonate, nitrate, pH, electrical conductivity, total dissolved solids, and total hardness, were investigated to decipher groundwater quality in the eastern part of Kanchanaburi Province, Thailand. Our results indicated that groundwater quality in the study area was primarily polluted by calcium, magnesium, and bicarbonate and that the RF-CV model (RMSE = 0.06, R2 = 0.87, MAE = 0.04) outperformed the RF-B (RMSE = 0.07, R2 = 0.80, MAE = 0.04), ANN-CV (RMSE = 0.09, R2 = 0.70, MAE = 0.06), and ANN-B (RMSE = 0.10, R2 = 0.67, MAE = 0.06). Our findings highlight the superiority of the RF models over the ANN models based on the CV and B techniques. In addition, the role of groundwater parameters to the normalized EWQI in various machine learning models was found. The groundwater quality map created by the RF-CV model can be applied to orient groundwater use.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Random Cross-Validation Produces Biased Assessment of Machine Learning Performance in Regional Landslide Susceptibility Prediction
    Kumar, Chandan
    Walton, Gabriel
    Santi, Paul
    Luza, Carlos
    REMOTE SENSING, 2025, 17 (02)
  • [42] Comparative Assessment of Machine Learning Models for Groundwater Quality Prediction Using Various ParametersComparative Assessment of Machine Learning Models for Groundwater Quality Prediction Using Various ParametersNiazkar et al.
    Majid Niazkar
    Reza Piraei
    Mohammad Reza Goodarzi
    Mohammad Javad Abedi
    Environmental Processes, 2025, 12 (1)
  • [43] Machine learning predictive insight of water pollution and groundwater quality in the Eastern Province of Saudi Arabia
    Jibrin, Abdulhayat M.
    Al-Suwaiyan, Mohammad
    Aldrees, Ali
    Dan'azumi, Salisu
    Usman, Jamilu
    Abba, Sani I.
    Yassin, Mohamed A.
    Scholz, Miklas
    Sammen, Saad Sh.
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [44] Prediction quality of cattle behavior traits evaluated through different cross-validation strategies using wearable sensor data and machine learning algorithms
    Ribeiro, Leonardo Augusto Coelho
    Bresolin, Tiago
    Rosa, Guilherme J. M.
    Casagrande, Daniel Rume
    Camargo Danes, Marina De Arruda
    Dorea, Joao R.
    JOURNAL OF ANIMAL SCIENCE, 2020, 98 : 383 - 383
  • [45] Spatial uncertainty of groundwater-vulnerability predictions assessed by a cross-validation strategy: an application to nitrate concentrations in the Province of Milan, northern Italy
    Fabbri, A. G.
    Cavallin, A.
    Masetti, M.
    Poli, S.
    Sterlacchini, S.
    Chung, C. J.
    RISK ANALYSIS VII: SIMULATION AND HAZARD MITIGATION & BROWNFIELDS V: PREVENTION, ASSESSMENT, REHABILITATION AND DEVELOPMENT OF BROWNFIELD SITES, 2010, : PI497 - PI514
  • [46] Applying Multivariate Analysis and Machine Learning Approaches to Evaluating Groundwater Quality on the Kairouan Plain, Tunisia
    Salem, Sarra Bel Haj
    Gaagai, Aissam
    Ben Slimene, Imed
    Moussa, Amor Ben
    Zouari, Kamel
    Yadav, Krishna Kumar
    Eid, Mohamed Hamdy
    Abukhadra, Mostafa R.
    El-Sherbeeny, Ahmed M.
    Gad, Mohamed
    Farouk, Mohamed
    Elsherbiny, Osama
    Elsayed, Salah
    Bellucci, Stefano
    Ibrahim, Hekmat
    WATER, 2023, 15 (19)
  • [47] A novel way to use cross-validation to measure connectivity by machine learning allows epilepsy surgery outcome prediction
    Ivankovic, Karla
    Principe, Alessandro
    Montoya-Galvez, Justo
    Manubens-Gil, Linus
    Zucca, Riccardo
    Villoslada, Pablo
    Dierssen, Mara
    Rocamora, Rodrigo
    NEUROIMAGE, 2025, 306
  • [48] Theranostic markers for personalized therapy of spider phobia: Methods of a bicentric external cross-validation machine learning approach
    Schwarzmeier, Hanna
    Leehr, Elisabeth Johanna
    Boehnlein, Joscha
    Seeger, Fabian Reinhard
    Roesmann, Kati
    Gathmann, Bettina
    Herrmann, Martin J.
    Siminski, Niklas
    Junghoefer, Markus
    Straube, Thomas
    Grotegerd, Dominik
    Dannlowski, Udo
    INTERNATIONAL JOURNAL OF METHODS IN PSYCHIATRIC RESEARCH, 2020, 29 (02)
  • [49] Environmental Cross-Validation of NLOS Machine Learning Classification/Mitigation with Low-Cost UWB Positioning Systems
    Barral, Valentin
    Escudero, Carlos J.
    Garcia-Naya, Jose A.
    Suarez-Casal, Pedro
    SENSORS, 2019, 19 (24)
  • [50] Theranostic markers for personalized therapy of spider phobia: Methods of a bicentric external cross-validation machine learning approach
    Leehr, E.
    Schwarzmeier, H.
    Boehnlein, J.
    Seeger, F.
    Roesmann, K.
    Gathmann, B.
    Herrmann, M.
    Siminski, N.
    Junghoefer, M.
    Straube, T.
    Lueken, U.
    Dannlowski, U.
    JOURNAL OF NEURAL TRANSMISSION, 2019, 126 (11) : 1575 - 1575