An optimization framework with dimensionality reduction using Markov Chain Monte Carlo and genetic algorithms for groundwater potential assessment

被引:0
作者
Wang, Zitao [1 ,2 ,3 ]
Yue, Chao [1 ,2 ,3 ]
Wang, Jianping [1 ,2 ]
机构
[1] Chinese Acad Sci, Qinghai Inst Salt Lakes, Key Lab Comprehens & Highly Efficient Utilizat Sal, Xining 810008, Peoples R China
[2] Qinghai Prov Key Lab Geol & Environm Salt Lakes, Xining 810008, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Groundwater potential assessment; Dimensionality reduction; Genetic algorithm; MCMC; Automated machine learning; JIANGHAN PLAIN; LOGISTIC-REGRESSION; RANDOM FOREST; GIS; VARIABILITY; WEIGHTS; MACHINE; MODELS;
D O I
10.1016/j.asoc.2024.111991
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Limited samples and high-dimensional feature spaces often hinder the accuracy of machine learning (ML) models in regional groundwater potential assessment (GPA). This study proposes a novel framework, the GPA with Dimensionality Optimization (GPADO), that optimizes feature dimension reduction to enhance prediction performance. Taking the Jianghan Basin as an example, data on nine continuous variables and five categorical variables influencing the region's GPA were gathered, expanding the feature set to 37 through One-hot encoding for categorical variables. Three scenarios were devised to assess prediction outcomes following various dimensionality reduction approaches. Comparative analysis revealed that a hybrid dimension reduction method, incorporating both continuous and categorical variables, yielded the highest validation set accuracy. Consequently, genetic algorithm and Markov Chain Monte Carlo methods were employed to determine the optimal solution and uncertainties associated with four unknown parameters: the chosen dimension reduction method for continuous and categorical variables, and the number of dimensions retained. Results indicated that utilizing singular value decomposition to reduce categorical variables to three dimensions, coupled with principal component analysis reducing continuous variables to three dimensions, produced the highest model validation accuracy of 0.834 within the GPADO framework. This optimal configuration facilitated automated ML training, resulting in a final validation set accuracy of 0.851 and a test set accuracy of 0.836. The resulting model provided a more precise spatial distribution of groundwater potential and demonstrated the GPADO framework's effectiveness in improving GPA accuracy, particularly in data-scarce regions. The GPADO framework offers a valuable approach for enhancing GPA studies.
引用
收藏
页数:15
相关论文
共 100 条
  • [1] Predicting groundwater recharge potential zones using geospatial technique
    Akter, Aysha
    Uddin, Abir Md Humam
    Ben Wahid, Khalid
    Ahmed, Shoukat
    [J]. SUSTAINABLE WATER RESOURCES MANAGEMENT, 2020, 6 (02)
  • [2] Space-time modelling of groundwater level and salinity
    Akter, Farzina
    Bishop, Thomas F. A.
    Vervoort, Willem
    [J]. SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 776
  • [3] Spatial mapping of groundwater springs potentiality using grid search-based and genetic algorithm-based support vector regression
    Al-Fugara, A'kif
    Ahmadlou, Mohammad
    Al-Shabeeb, Abdel Rahman
    AlAyyash, Saad
    Al-Amoush, Hani
    Al-Adamat, Rida
    [J]. GEOCARTO INTERNATIONAL, 2022, 37 (01) : 284 - 303
  • [4] Assessment of groundwater potential modeling using support vector machine optimization based on Bayesian multi-objective hyperparameter algorithm
    Anh, Duong Tran
    Pandey, Manish
    Mishra, Varun Narayan
    Singh, Kiran Kumari
    Ahmadi, Kourosh
    Janizadeh, Saeid
    Tran, Thanh Thai
    Linh, Nguyen Thi Thuy
    Dang, Nguyen Mai
    [J]. APPLIED SOFT COMPUTING, 2023, 132
  • [5] Modeling groundwater potential using novel GIS-based machine-learning ensemble techniques
    Arabameri, Alireza
    Pal, Subodh Chandra
    Rezaie, Fatemeh
    Nalivan, Omid Asadi
    Chowdhuri, Indrajit
    Saha, Asish
    Lee, Saro
    Moayedi, Hossein
    [J]. JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2021, 36
  • [6] An Assessment of Geospatial Analysis Combined with AHP Techniques to Identify Groundwater Potential Zones in the Pudukkottai District, Tamil Nadu, India
    Arumugam, Muruganantham
    Kulandaisamy, Prabakaran
    Karthikeyan, Sivakumar
    Thangaraj, Kongeswaran
    Senapathi, Venkatramanan
    Chung, Sang Yong
    Muthuramalingam, Subagunasekar
    Rajendran, Muthuramalingam
    Sugumaran, Sathish
    Manimuthu, Siva
    [J]. WATER, 2023, 15 (06)
  • [7] Evaluation of Sanliurfa Karakopru basin groundwater potential for sustainability with GIS-based AHP and TOPSIS methods
    Aslan, Veysel
    [J]. SUSTAINABLE WATER RESOURCES MANAGEMENT, 2023, 9 (03)
  • [8] Groundwater Potential Mapping in Hubei Region of China Using Machine Learning, Ensemble Learning, Deep Learning and AutoML Methods
    Bai, Zhigang
    Liu, Qimeng
    Liu, Yu
    [J]. NATURAL RESOURCES RESEARCH, 2022, 31 (05) : 2549 - 2569
  • [9] Application of e-TOPSIS for Ground Water Potentiality Zonation using Morphometric Parameters and Geospatial Technology of Vanvate Lui Basin, Mizoram, NE India
    Barman, Jonmenjoy
    Biswas, Brototi
    [J]. JOURNAL OF THE GEOLOGICAL SOCIETY OF INDIA, 2022, 98 (10) : 1385 - 1394
  • [10] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32