Improving the model robustness of flood hazard mapping based on hyperparameter optimization of random forest

被引:20
作者
Liao, Mingyong [1 ]
Wen, Haijia [1 ]
Yang, Ling [2 ]
Wang, Guilin [1 ]
Xiang, Xuekun [1 ,3 ]
Liang, Xiaowen [1 ]
机构
[1] Chongqing Univ, Natl Joint Engn Res Ctr Geohazards Prevent Reservo, Key Lab New Technol Construct Cities Mt Area, Minist Educ,Sch Civil Engn, Chongqing 400045, Peoples R China
[2] Nanjing Normal Univ, Sch Geog, Nanjing 210023, Peoples R China
[3] Chongqing Inst Geol & Mineral Resources, Minist Nat Resources, Technol Innovat Ctr Geohazards Automat Monitoring, Chongqing 401120, Peoples R China
关键词
Flood hazard mapping; Random forest; Hyperparameteroptimization; SHAP; Robust; REMOTE-SENSING DATA; SPATIAL PREDICTION; RISK-ASSESSMENT; SUSCEPTIBILITY ASSESSMENT; CONDITIONING FACTORS; REGRESSION; RESOLUTION; BIVARIATE; COUNTY; CHINA;
D O I
10.1016/j.eswa.2023.122682
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional machine learning algorithms face challenges in assessing flood susceptibility reliably due to their low robustness and the inherent 'black-box' nature. This paper utilizes five hyperparameter optimization algoirthms (HPO), namely grid search (GS), random search (RS), gauss process (GP), tree-structured parzen estimator (TPE) and simulated annealing (SA), to tune the traditional random forest's (RF) hyperparameters to improve the robustness of flood hazard mapping (FHM) models at Ningxiang City Hunan Province, China. Additionally, SHapley Additive exPlanations (SHAP) method were used to interpret the decision-mechanisms of these flood hazard models. This study considers 19 pluvial flood influencing factors and 2064 flood locations to create a geospatial database. The performance of each hybrid model was evaluated by area under the receiver operating characteristic (ROC) curve (AUC) and several validation methods. The results demonstrate that the developed hybrid models demonstrated good performance, with RF-TPE achieving the highest AUC (0.9660), followed by RF-GP (0.9648), RF-SA (0.9624), RF-GS (0.9612), RF-RS (0.9600), and RF (0.9539). The RF-TPE model exhibits superior robustness than other models, and the FHM constructed using it is more reliable. HPO is an effective approach to improve the predictive accuracy and robustness of FHM models. When considering limited computational resources, Bayesian optimization (TPE) should be prioritized for optimizing FHM models, followed by metaheuristic algorithms and model-free algorithms. Moreover, the study revealed that distance from river, peak rainfall intensity, continuous rainfall, antecedent effective rainfall, and terrain relief, are the most significant for pluvial FHM modeling in this region.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] FLOOD SUSCEPTIBILITY MAPPING AND ASSESSMENT USING REGULARIZED RANDOM FOREST AND NAIVE BAYES ALGORITHMS
    Habibi, A.
    Delavar, M. R.
    Sadeghian, M. S.
    Nazari, B.
    ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, : 241 - 248
  • [42] An LP-based hyperparameter optimization model for language modeling
    Rahnama, Amir Hossein Akhavan
    Toloo, Mehdi
    Zaidenberg, Nezer Jacob
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (05) : 2151 - 2160
  • [43] Prediction of respiratory diseases based on random forest model
    Yang, Xiaotong
    Li, Yi
    Liu, Lang
    Zang, Zengliang
    FRONTIERS IN PUBLIC HEALTH, 2025, 13
  • [44] Flood risk assessment and mapping based on a modified multi-parameter flood hazard index model in the Guanzhong Urban Area, China
    Dou, Xinyi
    Song, Jinxi
    Wang, Liping
    Tang, Bin
    Xu, Shaofeng
    Kong, Feihe
    Jiang, Xiaohui
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2018, 32 (04) : 1131 - 1146
  • [45] Slope-Unit Scale Landslide Susceptibility Mapping Based on the Random Forest Model in Deep Valley Areas
    Deng, Hui
    Wu, Xiantan
    Zhang, Wenjiang
    Liu, Yansong
    Li, Weile
    Li, Xiangyu
    Zhou, Ping
    Zhuo, Wenhao
    REMOTE SENSING, 2022, 14 (17)
  • [46] Optimizing Public Grievance Detection Accuracy Through Hyperparameter Tuning of Random Forest and Hybrid Model
    Shah, Khushboo
    Joshi, Hardik
    Joshi, Hiren
    SOFT COMPUTING AND ITS ENGINEERING APPLICATIONS, ICSOFTCOMP 2022, 2023, 1788 : 463 - 476
  • [47] An LP-based hyperparameter optimization model for language modeling
    Amir Hossein Akhavan Rahnama
    Mehdi Toloo
    Nezer Jacob Zaidenberg
    The Journal of Supercomputing, 2018, 74 : 2151 - 2160
  • [48] An empirical flood fatality model for Italy using random forest algorithm
    Yazdani, Mina
    Gencarelli, Christian N.
    Salvati, Paola
    Molinari, Daniela
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2023, 98
  • [49] A new approach based on biology-inspired metaheuristic algorithms in combination with random forest to enhance the flood susceptibility mapping
    Razavi-Termeh, Seyed Vahid
    Sadeghi-Niaraki, Abolghasem
    Choi, Soo-Mi
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 345
  • [50] Flood Mapping Based on Multiple Endmember Spectral Mixture Analysis and Random Forest Classifier-The Case of Yuyao, China
    Feng, Quanlong
    Gong, Jianhua
    Liu, Jiantao
    Li, Yi
    REMOTE SENSING, 2015, 7 (09) : 12539 - 12562