Integrating machine learning models with cross-validation and bootstrapping for evaluating groundwater quality in Kanchanaburi province, Thailand

被引:9
|
作者
Thanh, Nguyen Ngoc [1 ]
Chotpantarat, Srilert [2 ,3 ]
Ngu, Nguyen Huu [1 ]
Thunyawatcharakul, Pongsathorn [4 ]
Kaewdum, Narongsak [5 ]
机构
[1] Hue Univ, Univ Agr & Forestry, 102 Phung Hung Str, Hue City 53000, Thua Thien Hue, Vietnam
[2] Chulalongkorn Univ, Fac Sci, Dept Geol, Bangkok 10330, Thailand
[3] Chulalongkorn Univ, Environm Res Inst, Ctr Excellence Environm Innovat & Management Met E, Phayathai Rd, Bangkok 10330, Thailand
[4] Chulalongkorn Univ, Grad Sch, Int Postgrad Program Hazardous Subst & Environm Ma, Bangkok 10330, Thailand
[5] Mahidol Univ, Geosci Program, Kanchanaburi Campus, Kanchanaburi 71150, Thailand
关键词
Groundwater quality; Random forest; Artificial neural network; Kanchanaburi province; Thailand; ARTIFICIAL NEURAL-NETWORK; RANDOM FOREST; LAND-USE; WATER; GIS; PREDICTION; MANAGEMENT; SURFACE; REGION; INDEX;
D O I
10.1016/j.envres.2024.118952
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Exploring the potential of new models for mapping groundwater quality presents a major challenge in water resource management, particularly in Kanchanaburi Province, Thailand, where groundwater faces contamination risks. This study aimed to explore the applicability of random forest (RF) and artificial neural networks (ANN) models to predict groundwater quality. Particularly, these two models were integrated into crossvalidation (CV) and bootstrapping (B) techniques to build predictive models, including RF-CV, RF-B, ANN-CV, and ANN-B. Entropy groundwater quality index (EWQI) was converted to normalized EWQI which was then classified into five levels from very poor to very good. A total of twelve physicochemical parameters from 180 groundwater wells, including potassium, sodium, calcium, magnesium, chloride, sulfate, bicarbonate, nitrate, pH, electrical conductivity, total dissolved solids, and total hardness, were investigated to decipher groundwater quality in the eastern part of Kanchanaburi Province, Thailand. Our results indicated that groundwater quality in the study area was primarily polluted by calcium, magnesium, and bicarbonate and that the RF-CV model (RMSE = 0.06, R2 = 0.87, MAE = 0.04) outperformed the RF-B (RMSE = 0.07, R2 = 0.80, MAE = 0.04), ANN-CV (RMSE = 0.09, R2 = 0.70, MAE = 0.06), and ANN-B (RMSE = 0.10, R2 = 0.67, MAE = 0.06). Our findings highlight the superiority of the RF models over the ANN models based on the CV and B techniques. In addition, the role of groundwater parameters to the normalized EWQI in various machine learning models was found. The groundwater quality map created by the RF-CV model can be applied to orient groundwater use.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Cross-validation and cross-study validation of kidney cancer with machine learning and whole exome sequences from the National Cancer Institute
    Aljouie, Abdulrhman
    Patel, Nihir
    Roshan, Usman
    2018 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (CIBCB), 2018, : 61 - 66
  • [32] A combination of time-scale calculus and a cross-validation technique used in fitting and evaluating fractional models
    Almusharff, Amera
    Ngoc Nguyen
    APPLIED MATHEMATICS LETTERS, 2012, 25 (03) : 550 - 554
  • [33] Challenges of machine learning model validation using correlated behaviour data: Evaluation of cross-validation strategies and accuracy measures
    Ferdinandy, Bence
    Gerencser, Linda
    Corrieri, Luca
    Perez, Paula
    Ujvary, Dora
    Csizmadia, Gabor
    Miklosi, Adam
    PLOS ONE, 2020, 15 (07):
  • [34] Optimal learning of p-layer additive F0 models with cross-validation
    Sakai, Shinsuke
    Kawahara, Tatsuya
    Shimizu, Tohru
    Nakamura, Satoshi
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2009, : 4245 - 4248
  • [35] OPTIMAL LEARNING OF P-LAYER ADDITIVE F0 MODELS WITH CROSS-VALIDATION
    Sakai, Shinsuke
    Kawahara, Tatsuya
    Shimizu, Tohru
    Nakamura, Satoshi
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4245 - 4248
  • [36] Application of machine learning models in groundwater quality assessment and prediction: progress and challenges
    Huang, Yanpeng
    Wang, Chao
    Wang, Yuanhao
    Lyu, Guangfeng
    Lin, Sijie
    Liu, Weijiang
    Niu, Haobo
    Hu, Qing
    FRONTIERS OF ENVIRONMENTAL SCIENCE & ENGINEERING, 2024, 18 (03)
  • [37] Evaluation and prediction of groundwater quality for irrigation using regression and machine learning models
    Shaw, Souvick Kumar
    Sharma, Anurag
    WATER QUALITY RESEARCH JOURNAL, 2025, 60 (01) : 260 - 297
  • [38] Groundwater Quality Prediction and Analysis Using Machine Learning Models and Geospatial Technology
    Rammohan, Bommi
    Partheeban, Pachaivannan
    Ranganathan, Ranihemamalini
    Balaraman, Sundarambal
    SUSTAINABILITY, 2024, 16 (22)
  • [39] Comparative Analysis of Full Training Set and Cross-Validation for Machine Learning Approach to Run Smart Wheelchair
    Badajena, Jenamani Chandrakanta
    Sethi, Srinivas
    Sahoo, Ramesh Kumar
    INTELLIGENT SYSTEMS AND APPLICATIONS, ICISA 2022, 2023, 959 : 121 - 132
  • [40] Advanced Machine Learning and Water Quality Index (WQI) Assessment: Evaluating Groundwater Quality at the Yopurga Landfill
    Zheng, Hongmei
    Hou, Shiwei
    Liu, Jing
    Xiong, Yanna
    Wang, Yuxin
    WATER, 2024, 16 (12)