Improving the model robustness of flood hazard mapping based on hyperparameter optimization of random forest

被引:20
作者
Liao, Mingyong [1 ]
Wen, Haijia [1 ]
Yang, Ling [2 ]
Wang, Guilin [1 ]
Xiang, Xuekun [1 ,3 ]
Liang, Xiaowen [1 ]
机构
[1] Chongqing Univ, Natl Joint Engn Res Ctr Geohazards Prevent Reservo, Key Lab New Technol Construct Cities Mt Area, Minist Educ,Sch Civil Engn, Chongqing 400045, Peoples R China
[2] Nanjing Normal Univ, Sch Geog, Nanjing 210023, Peoples R China
[3] Chongqing Inst Geol & Mineral Resources, Minist Nat Resources, Technol Innovat Ctr Geohazards Automat Monitoring, Chongqing 401120, Peoples R China
关键词
Flood hazard mapping; Random forest; Hyperparameteroptimization; SHAP; Robust; REMOTE-SENSING DATA; SPATIAL PREDICTION; RISK-ASSESSMENT; SUSCEPTIBILITY ASSESSMENT; CONDITIONING FACTORS; REGRESSION; RESOLUTION; BIVARIATE; COUNTY; CHINA;
D O I
10.1016/j.eswa.2023.122682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional machine learning algorithms face challenges in assessing flood susceptibility reliably due to their low robustness and the inherent 'black-box' nature. This paper utilizes five hyperparameter optimization algoirthms (HPO), namely grid search (GS), random search (RS), gauss process (GP), tree-structured parzen estimator (TPE) and simulated annealing (SA), to tune the traditional random forest's (RF) hyperparameters to improve the robustness of flood hazard mapping (FHM) models at Ningxiang City Hunan Province, China. Additionally, SHapley Additive exPlanations (SHAP) method were used to interpret the decision-mechanisms of these flood hazard models. This study considers 19 pluvial flood influencing factors and 2064 flood locations to create a geospatial database. The performance of each hybrid model was evaluated by area under the receiver operating characteristic (ROC) curve (AUC) and several validation methods. The results demonstrate that the developed hybrid models demonstrated good performance, with RF-TPE achieving the highest AUC (0.9660), followed by RF-GP (0.9648), RF-SA (0.9624), RF-GS (0.9612), RF-RS (0.9600), and RF (0.9539). The RF-TPE model exhibits superior robustness than other models, and the FHM constructed using it is more reliable. HPO is an effective approach to improve the predictive accuracy and robustness of FHM models. When considering limited computational resources, Bayesian optimization (TPE) should be prioritized for optimizing FHM models, followed by metaheuristic algorithms and model-free algorithms. Moreover, the study revealed that distance from river, peak rainfall intensity, continuous rainfall, antecedent effective rainfall, and terrain relief, are the most significant for pluvial FHM modeling in this region.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Random forest forest solar power forecast based on classification optimization
    Liu, Da
    Sun, Kun
    ENERGY, 2019, 187
  • [32] Comparing performance of random forest and adaptive neuro-fuzzy inference system data mining models for flood susceptibility mapping
    Vafakhah, Mehdi
    Loor, Sajad Mohammad Hasani
    Pourghasemi, Hamidreza
    Katebikord, Azadeh
    ARABIAN JOURNAL OF GEOSCIENCES, 2020, 13 (11)
  • [33] A Novel HashedNets Model Based on the Efficient Hyperparameter Optimization
    Fang, Qin
    Chen, Jianxia
    Ma, Zhongbao
    Li, Chao
    Zhang, Jie
    Chen, Yixin
    Lv, Qiang
    2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1146 - 1151
  • [34] A new strategy for spatial predictive mapping of mineral prospectivity: Automated hyperparameter tuning of random forest approach
    Daviran, Mehrdad
    Maghsoudi, Abbas
    Ghezelbash, Reza
    Pradhan, Biswajeet
    COMPUTERS & GEOSCIENCES, 2021, 148
  • [35] ROBUSTNESS ANALYSIS AND IMPROVEMENT OF FAULT DIAGNOSIS MODEL FOR NUCLEAR POWER PLANTS BASED ON RANDOM FOREST
    Li, Jiangkuan
    Lin, Meng
    PROCEEDINGS OF 2021 28TH INTERNATIONAL CONFERENCE ON NUCLEAR ENGINEERING (ICONE28), VOL 4, 2021,
  • [36] Sustainability-Based Flood Hazard Mapping of the Swannanoa River Watershed
    Ahmadisharaf, Ebrahim
    Kalyanapu, Alfred J.
    Chung, Eun-Sung
    SUSTAINABILITY, 2017, 9 (10)
  • [37] A Super-Resolution land cover mapping Based on a Random Forest and Markov Random Field model
    Sanpayao, Manatsawee
    Kasetkasem, Teerasit
    Isshiki, Tsuyoshi
    Rakwatin, Preesan
    Chanwimaluang, Thitiporn
    2017 14TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2017, : 553 - 556
  • [38] Improving flood hazard datasets using a low-complexity, probabilistic floodplain mapping approach
    Diehl, Rebecca M.
    Gourevitch, Jesse D.
    Drago, Stephanie
    Wemple, Beverley C.
    PLOS ONE, 2021, 16 (03):
  • [39] Flood risk assessment and mapping based on a modified multi-parameter flood hazard index model in the Guanzhong Urban Area, China
    Xinyi Dou
    Jinxi Song
    Liping Wang
    Bin Tang
    Shaofeng Xu
    Feihe Kong
    Xiaohui Jiang
    Stochastic Environmental Research and Risk Assessment, 2018, 32 : 1131 - 1146
  • [40] Forest fire mapping: a comparison between GIS-based random forest and Bayesian models
    Noroozi, Farzaneh
    Ghanbarian, Gholamabbas
    Safaeian, Roja
    Pourghasemi, Hamid Reza
    NATURAL HAZARDS, 2024, 120 (07) : 6569 - 6592