Improving the model robustness of flood hazard mapping based on hyperparameter optimization of random forest

被引:20
|
作者
Liao, Mingyong [1 ]
Wen, Haijia [1 ]
Yang, Ling [2 ]
Wang, Guilin [1 ]
Xiang, Xuekun [1 ,3 ]
Liang, Xiaowen [1 ]
机构
[1] Chongqing Univ, Natl Joint Engn Res Ctr Geohazards Prevent Reservo, Key Lab New Technol Construct Cities Mt Area, Minist Educ,Sch Civil Engn, Chongqing 400045, Peoples R China
[2] Nanjing Normal Univ, Sch Geog, Nanjing 210023, Peoples R China
[3] Chongqing Inst Geol & Mineral Resources, Minist Nat Resources, Technol Innovat Ctr Geohazards Automat Monitoring, Chongqing 401120, Peoples R China
关键词
Flood hazard mapping; Random forest; Hyperparameteroptimization; SHAP; Robust; REMOTE-SENSING DATA; SPATIAL PREDICTION; RISK-ASSESSMENT; SUSCEPTIBILITY ASSESSMENT; CONDITIONING FACTORS; REGRESSION; RESOLUTION; BIVARIATE; COUNTY; CHINA;
D O I
10.1016/j.eswa.2023.122682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional machine learning algorithms face challenges in assessing flood susceptibility reliably due to their low robustness and the inherent 'black-box' nature. This paper utilizes five hyperparameter optimization algoirthms (HPO), namely grid search (GS), random search (RS), gauss process (GP), tree-structured parzen estimator (TPE) and simulated annealing (SA), to tune the traditional random forest's (RF) hyperparameters to improve the robustness of flood hazard mapping (FHM) models at Ningxiang City Hunan Province, China. Additionally, SHapley Additive exPlanations (SHAP) method were used to interpret the decision-mechanisms of these flood hazard models. This study considers 19 pluvial flood influencing factors and 2064 flood locations to create a geospatial database. The performance of each hybrid model was evaluated by area under the receiver operating characteristic (ROC) curve (AUC) and several validation methods. The results demonstrate that the developed hybrid models demonstrated good performance, with RF-TPE achieving the highest AUC (0.9660), followed by RF-GP (0.9648), RF-SA (0.9624), RF-GS (0.9612), RF-RS (0.9600), and RF (0.9539). The RF-TPE model exhibits superior robustness than other models, and the FHM constructed using it is more reliable. HPO is an effective approach to improve the predictive accuracy and robustness of FHM models. When considering limited computational resources, Bayesian optimization (TPE) should be prioritized for optimizing FHM models, followed by metaheuristic algorithms and model-free algorithms. Moreover, the study revealed that distance from river, peak rainfall intensity, continuous rainfall, antecedent effective rainfall, and terrain relief, are the most significant for pluvial FHM modeling in this region.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Landslide susceptibility mapping using hybrid random forest with GeoDetector and RFE for factor optimization
    Zhou, Xinzhi
    Wen, Haijia
    Zhang, Yalan
    Xu, Jiahui
    Zhang, Wengang
    GEOSCIENCE FRONTIERS, 2021, 12 (05)
  • [22] Mapping Landslide Hazard Risk Using Random Forest Algorithm in Guixi, Jiangxi, China
    Zhang, Yang
    Wu, Weicheng
    Qin, Yaozu
    Lin, Ziyu
    Zhang, Guiliang
    Chen, Renxiang
    Song, Yong
    Lang, Tao
    Zhou, Xiaoting
    Huangfu, Wenchao
    Ou, Penghui
    Xie, Lifeng
    Huang, Xiaolan
    Peng, Shanling
    Shao, Chongjian
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (11)
  • [23] Flood Hazard Rating Prediction for Urban Areas Using Random Forest and LSTM
    Hyun Il Kim
    Byung Hyun Kim
    KSCE Journal of Civil Engineering, 2020, 24 : 3884 - 3896
  • [24] IDD-HPO: A Proposed Model for Improving Diabetic Detection using Hyperparameter Optimization and Cloud Mapping Storage
    Zaky, Eman H.
    Soliman, Mona M.
    Elkholy, A. K.
    Ghali, Neveen, I
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (08) : 352 - 362
  • [25] Flood susceptibility mapping using AutoML and a deep learning framework with evolutionary algorithms for hyperparameter optimization
    Vincent, Amala Mary
    Parthasarathy, K. S. S.
    Jidesh, P.
    APPLIED SOFT COMPUTING, 2023, 148
  • [26] Reliability Analysis Based on Optimization Random Forest Model and MCMC
    Yang, Fan
    Ren, Jianwei
    CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2020, 125 (02): : 801 - 814
  • [27] Flood Risk Mapping by Remote Sensing Data and Random Forest Technique
    Farhadi, Hadi
    Najafzadeh, Mohammad
    WATER, 2021, 13 (21)
  • [28] Flood hazard mapping in Southern Brazil: a combination of flow frequency analysis and the HAND model
    Speckhann, Gustavo Andrei
    Borges Chaffe, Pedro Luiz
    Goerl, Roberto Fabris
    de Abreu, Janete Josina
    Altamirano Flores, Juan Antonio
    HYDROLOGICAL SCIENCES JOURNAL, 2018, 63 (01) : 87 - 100
  • [29] Forecasting of flash flood susceptibility mapping using random forest regression model and geographic information systems
    Wahba, Mohamed
    Essam, Radwa
    El-Rawy, Mustafa
    Al-Arifi, Nassir
    Abdalla, Fathy
    Elsadek, Wael M.
    HELIYON, 2024, 10 (13)
  • [30] Sampling design optimization for soil mapping with random forest
    Wadoux, Alexandre M. J-C.
    Brus, Dick J.
    Heuvelink, Gerard B. M.
    GEODERMA, 2019, 355