An ensemble optimizer with a stacking ensemble surrogate model for identification of groundwater contamination source

被引:2
作者
Zhu, Liuzhi [1 ,2 ,3 ]
Lu, Wenxi [1 ,2 ,3 ]
Luo, Chengming [1 ,2 ,3 ]
Xu, Yaning [1 ,2 ,3 ]
Wang, Zibo [1 ,2 ,3 ]
机构
[1] Jilin Univ, Key Lab Groundwater Resources & Environm, Minist Educ, Changchun 130021, Peoples R China
[2] Jilin Univ, Jilin Prov Key Lab Water Resources & Environm, Changchun 130021, Peoples R China
[3] Jilin Univ, Coll New Energy & Environm, Changchun 130021, Peoples R China
关键词
Groundwater contamination source; identification; Ensemble optimizer; Stacking ensemble surrogate; Deep learning model; Swarm intelligence optimizer; POLLUTION;
D O I
10.1016/j.jconhyd.2024.104437
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The application of the simulation-optimization method for groundwater contamination source identification (GCSI) encounters two main challenges: the substantial time cost of calling the simulation model, and the limitations on the accuracy of identification results due to the complexity, nonlinearity, and ill-posed nature of the inverse problem. To address these issues, we have innovatively developed an inversion framework based on ensemble learning strategies. This framework comprises a stacking ensemble model (SEM), which integrates three distinct machine learning models (Extremely Randomized Trees, Adaptive Boosting, and Bidirectional Gated Recurrent Unit), and an ensemble optimizer (E-GKSEEFO), which combines two newly proposed swarm intelligence optimizers (Genghis Khan Shark Optimizer and Electric Eel Foraging Optimizer). Specifically, the SEM serves as a surrogate model for the groundwater numerical simulation model. Compared to the original simulation model, it significantly reduces time cost while maintaining accuracy. The E-GKSEEFO, functioning as the search strategy for the optimization model, greatly enhances the accuracy of the optimization results. We have verified the performance of the SEM-E-GKSEEFO ensemble inversion framework through two hypothetical scenarios derived from an actual coal gangue pile. The results are as follows. (1) The SEM exhibits improved fitting performance compared to single machine learning models when dealing with high-dimensional nonlinear data from GCSI. (2) The E-GKSEEFO achieves significantly higher accuracy in the identification results of GCSI than individual optimizers. These findings affirm the effectiveness and superiority of the proposed SEM-EGKSEEFO ensemble inversion framework.
引用
收藏
页数:15
相关论文
共 45 条
[1]  
Bagtzoglou A., 1991, Water Resources Engineering Risk Assessment, P189, DOI DOI 10.1007/978-3-642-76971-9_12
[2]   Dynamic Groundwater Contamination Vulnerability Assessment Techniques: A Systematic Review [J].
Banerjee, Arghadyuti ;
Creedon, Leo ;
Jones, Noelle ;
Gill, Laurence ;
Gharbia, Salem .
HYDROLOGY, 2023, 10 (09)
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Cho K., 2014, ARXIV14061078, P1724, DOI [10.3115/v1/D14-1179, DOI 10.3115/V1/D14-1179]
[5]   An extensive experimental survey of regression methods [J].
Fernandez-Delgado, M. ;
Sirsat, M. S. ;
Cernadas, E. ;
Alawadi, S. ;
Barro, S. ;
Febrero-Bande, M. .
NEURAL NETWORKS, 2019, 111 :11-34
[6]  
Freund Y., 1996, Machine Learning. Proceedings of the Thirteenth International Conference (ICML '96), P148
[7]   Ensemble deep learning: A review [J].
Ganaie, M. A. ;
Hu, Minghui ;
Malik, A. K. ;
Tanveer, M. ;
Suganthan, P. N. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 115
[8]   Extremely randomized trees [J].
Geurts, P ;
Ernst, D ;
Wehenkel, L .
MACHINE LEARNING, 2006, 63 (01) :3-42
[9]   IDENTIFYING SOURCES OF GROUNDWATER POLLUTION - AN OPTIMIZATION APPROACH [J].
GORELICK, SM ;
EVANS, B ;
REMSON, I .
WATER RESOURCES RESEARCH, 1983, 19 (03) :779-790
[10]   Application of machine learning in groundwater quality modeling-A comprehensive review [J].
Haggerty, Ryan ;
Sun, Jianxin ;
Yu, Hongfeng ;
Li, Yusong .
WATER RESEARCH, 2023, 233