Improving the model robustness of flood hazard mapping based on hyperparameter optimization of random forest

被引:20
|
作者
Liao, Mingyong [1 ]
Wen, Haijia [1 ]
Yang, Ling [2 ]
Wang, Guilin [1 ]
Xiang, Xuekun [1 ,3 ]
Liang, Xiaowen [1 ]
机构
[1] Chongqing Univ, Natl Joint Engn Res Ctr Geohazards Prevent Reservo, Key Lab New Technol Construct Cities Mt Area, Minist Educ,Sch Civil Engn, Chongqing 400045, Peoples R China
[2] Nanjing Normal Univ, Sch Geog, Nanjing 210023, Peoples R China
[3] Chongqing Inst Geol & Mineral Resources, Minist Nat Resources, Technol Innovat Ctr Geohazards Automat Monitoring, Chongqing 401120, Peoples R China
关键词
Flood hazard mapping; Random forest; Hyperparameteroptimization; SHAP; Robust; REMOTE-SENSING DATA; SPATIAL PREDICTION; RISK-ASSESSMENT; SUSCEPTIBILITY ASSESSMENT; CONDITIONING FACTORS; REGRESSION; RESOLUTION; BIVARIATE; COUNTY; CHINA;
D O I
10.1016/j.eswa.2023.122682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional machine learning algorithms face challenges in assessing flood susceptibility reliably due to their low robustness and the inherent 'black-box' nature. This paper utilizes five hyperparameter optimization algoirthms (HPO), namely grid search (GS), random search (RS), gauss process (GP), tree-structured parzen estimator (TPE) and simulated annealing (SA), to tune the traditional random forest's (RF) hyperparameters to improve the robustness of flood hazard mapping (FHM) models at Ningxiang City Hunan Province, China. Additionally, SHapley Additive exPlanations (SHAP) method were used to interpret the decision-mechanisms of these flood hazard models. This study considers 19 pluvial flood influencing factors and 2064 flood locations to create a geospatial database. The performance of each hybrid model was evaluated by area under the receiver operating characteristic (ROC) curve (AUC) and several validation methods. The results demonstrate that the developed hybrid models demonstrated good performance, with RF-TPE achieving the highest AUC (0.9660), followed by RF-GP (0.9648), RF-SA (0.9624), RF-GS (0.9612), RF-RS (0.9600), and RF (0.9539). The RF-TPE model exhibits superior robustness than other models, and the FHM constructed using it is more reliable. HPO is an effective approach to improve the predictive accuracy and robustness of FHM models. When considering limited computational resources, Bayesian optimization (TPE) should be prioritized for optimizing FHM models, followed by metaheuristic algorithms and model-free algorithms. Moreover, the study revealed that distance from river, peak rainfall intensity, continuous rainfall, antecedent effective rainfall, and terrain relief, are the most significant for pluvial FHM modeling in this region.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A random forest model of landslide susceptibility mapping based on hyperparameter optimization using Bayes algorithm
    Sun, Deliang
    Wen, Haijia
    Wang, Danzhou
    Xu, Jiahui
    GEOMORPHOLOGY, 2020, 362
  • [2] Flood Hazard Risk Mapping Using a Pseudo Supervised Random Forest
    Esfandiari, Morteza
    Abdi, Ghasem
    Jabari, Shabnam
    McGrath, Heather
    Coleman, David
    REMOTE SENSING, 2020, 12 (19) : 1 - 23
  • [3] Flood hazard risk assessment model based on random forest
    Wang, Zhaoli
    Lai, Chengguang
    Chen, Xiaohong
    Yang, Bing
    Zhao, Shiwei
    Bai, Xiaoyan
    JOURNAL OF HYDROLOGY, 2015, 527 : 1130 - 1141
  • [4] An interpretable model for landslide susceptibility assessment based on Optuna hyperparameter optimization and Random Forest
    Xiao, Xin
    Zou, Yi
    Huang, Jiangcheng
    Luo, Xuan
    Yang, Luyi
    Li, Meng
    Yang, Pengwu
    Ji, Xuan
    Li, Yungang
    GEOMATICS NATURAL HAZARDS & RISK, 2024, 15 (01)
  • [5] Application of Bayesian Hyperparameter Optimized Random Forest and XGBoost Model for Landslide Susceptibility Mapping
    Wang, Shibao
    Zhuang, Jianqi
    Zheng, Jia
    Fan, Hongyu
    Kong, Jiaxu
    Zhan, Jiewei
    FRONTIERS IN EARTH SCIENCE, 2021, 9
  • [6] Prediction for TBM Penetration Rate Using Four Hyperparameter Optimization Methods and Random Forest Model
    Zhang W.
    Tang L.
    Chen F.
    Yang J.
    Yingyong Jichu yu Gongcheng Kexue Xuebao/Journal of Basic Science and Engineering, 2021, 29 (05): : 1186 - 1200
  • [7] A Model-Based Flood Hazard Mapping on the Southern Slope of Himalaya
    Aryal, Dibit
    Wang, Lei
    Adhikari, Tirtha Raj
    Zhou, Jing
    Li, Xiuping
    Shrestha, Maheswor
    Wang, Yuanwei
    Chen, Deliang
    WATER, 2020, 12 (02)
  • [8] Assessment of landslide susceptibility mapping based on Bayesian hyperparameter optimization: A comparison between logistic regression and random forest
    Sun, Deliang
    Xu, Jiahui
    Wen, Haijia
    Wang, Danzhou
    ENGINEERING GEOLOGY, 2021, 281
  • [9] Flood Hazard Rating Prediction for Urban Areas Using Random Forest and LSTM
    Kim, Hyun Il
    Kim, Byung Hyun
    KSCE JOURNAL OF CIVIL ENGINEERING, 2020, 24 (12) : 3884 - 3896
  • [10] An Enhanced Proximal Policy Optimization-Based Reinforcement Learning Method with Random Forest for Hyperparameter Optimization
    Ma, Zhixin
    Cui, Shengmin
    Joe, Inwhee
    APPLIED SCIENCES-BASEL, 2022, 12 (14):