Groundwater contaminant source identification based on an ensemble learning search framework associated with an auto xgboost surrogate

被引:18
作者
Pan, Zidong [1 ,2 ,3 ]
Lu, Wenxi [1 ,2 ,3 ]
Wang, Han [1 ,2 ,3 ]
Bai, Yukun [1 ,2 ,3 ]
机构
[1] Jilin Univ, Key Lab Groundwater Resources & Environm, Minist Educ, Changchun 130021, Peoples R China
[2] Jilin Univ, Jilin Prov Key Lab Water Resources & Environm, Changchun 130021, Peoples R China
[3] Jilin Univ, Coll New Energy & Environm, Changchun 130021, Peoples R China
关键词
Auto xgboost; Ensemble learning search; Iterative ensemble smoother; Particle filter; Differential evolution; Swarm evolution algorithm; DATA ASSIMILATION; NEURAL-NETWORKS; KALMAN FILTER; OPTIMIZATION; ALGORITHMS; EVOLUTION; DESIGN; MODELS;
D O I
10.1016/j.envsoft.2022.105588
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Groundwater contaminant source identification (GCSI) is commonly accompanied by search process which tweaks the unknown contaminant source information to match the simulation model outputs with the mea-surements. When solving identification task, search accuracy and time cost have always been challenges that must be tackled. In the present study, a novel ensemble learning search framework associated with auto extreme gradient boosting tree (xgboost) was proposed to solve GCSI. In particular, auto xgboost was employed to reduce the calculation burden caused by repeatedly running simulation model. To promote search efficiency, boosting strategy (BOS) was employed to sequentially concatenate iterative ensemble smoother, differential evolution particle filter (DEPF), and swarm evolution algorithm. The identification results indicated that: 1. Auto xgboost could substitute a numerical simulation model with desired accuracy and expeditious running speed. 2. BOS could achieve better search accuracy, but with the sacrifice of infinitesimal calculated time cost, when compared with bagging strategy.
引用
收藏
页数:14
相关论文
共 57 条
[1]   A State-of-the-Art Survey on Deep Learning Theory and Architectures [J].
Alom, Md Zahangir ;
Taha, Tarek M. ;
Yakopcic, Chris ;
Westberg, Stefan ;
Sidike, Paheding ;
Nasrin, Mst Shamima ;
Hasan, Mahmudul ;
Van Essen, Brian C. ;
Awwal, Abdul A. S. ;
Asari, Vijayan K. .
ELECTRONICS, 2019, 8 (03)
[2]   A hybrid simulation-optimization approach for solving the areal groundwater pollution source identification problems [J].
Ayvaz, M. Tamer .
JOURNAL OF HYDROLOGY, 2016, 538 :161-176
[3]   A linked simulation-optimization model for solving the unknown groundwater pollution source identification problems [J].
Ayvaz, M. Tamer .
JOURNAL OF CONTAMINANT HYDROLOGY, 2010, 117 (1-4) :46-59
[4]   Improving GALDIT-based groundwater vulnerability predictive mapping using coupled resampling algorithms and machine learning models [J].
Barzegar, Rahim ;
Razzagh, Siamak ;
Quilty, John ;
Adamowski, Jan ;
Pour, Homa Kheyrollah ;
Booij, Martijn J. .
JOURNAL OF HYDROLOGY, 2021, 598
[5]   Characterizing an unknown pollution source in groundwater resources systems using PSVM and PNN [J].
Bashi-Azghadi, Seyyed Nasser ;
Kerachian, Reza ;
Bazargan-Lari, Mohammad Reza ;
Solouki, Kazem .
EXPERT SYSTEMS WITH APPLICATIONS, 2010, 37 (10) :7154-7161
[6]   Prediction of sediment heavy metal at the Australian Bays using newly developed hybrid artificial intelligence models [J].
Bhagat, Suraj Kumar ;
Tiyasha, Tiyasha ;
Awadh, Salih Muhammad ;
Tran Minh Tung ;
Jawad, Ali H. ;
Yaseen, Zaher Mundher .
ENVIRONMENTAL POLLUTION, 2021, 268
[7]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[8]  
Chiu YC, 2014, HYDROGEOL J, V22, P1731, DOI 10.1007/s10040-014-1172-7
[9]   A survey on ensemble learning [J].
Dong, Xibin ;
Yu, Zhiwen ;
Cao, Wenming ;
Shi, Yifan ;
Ma, Qianli .
FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (02) :241-258
[10]   Analysis of iterative ensemble smoothers for solving inverse problems [J].
Evensen, Geir .
COMPUTATIONAL GEOSCIENCES, 2018, 22 (03) :885-908