Flood Susceptibility Assessment with Random Sampling Strategy in Ensemble Learning (RF and XGBoost)

被引:22
作者
Ren, Hancheng [1 ,2 ]
Pang, Bo [1 ,2 ]
Bai, Ping [3 ]
Zhao, Gang [4 ]
Liu, Shu [5 ,6 ]
Liu, Yuanyuan [5 ,6 ]
Li, Min [5 ,6 ]
机构
[1] Beijing Normal Univ, Coll Water Sci, Beijing 100875, Peoples R China
[2] Beijing Key Lab Urban Hydrol Cycle & Sponge City T, Beijing 100875, Peoples R China
[3] Kunming Flood Control & Drought Relief Headquarter, Kunming 650000, Peoples R China
[4] Univ Tokyo, Inst Ind Sci, Tokyo 1538505, Japan
[5] China Inst Water Resources & Hydropower Res, Beijing 100038, Peoples R China
[6] Minist Water Resources, Res Ctr Flood & Drought Disaster Reduct, Beijing 100038, Peoples R China
关键词
flood susceptibility; ensemble learning; random sampling strategies; mountainous urban areas; Artificial Neural Network (ANN); Support Vector Machine (SVM); IMPACT; AREAS; MODELS; INDEX;
D O I
10.3390/rs16020320
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Due to the complex interaction of urban and mountainous floods, assessing flood susceptibility in mountainous urban areas presents a challenging task in environmental research and risk analysis. Data-driven machine learning methods can evaluate flood susceptibility in mountainous urban areas lacking essential hydrological data, utilizing remote sensing data and limited historical inundation records. In this study, two ensemble learning algorithms, Random Forest (RF) and XGBoost, were adopted to assess the flood susceptibility of Kunming, a typical mountainous urban area prone to severe flood disasters. A flood inventory was created using flood observations from 2018 to 2022. The spatial database included 10 explanatory factors, encompassing climatic, geomorphic, and anthropogenic factors. Artificial Neural Network (ANN) and Support Vector Machine (SVM) were selected for model comparison. To minimize the influence of expert opinions on model training, this study employed a strategy of uniformly random sampling in historically non-flooded areas for negative sample selection. The results demonstrated that (1) ensemble learning algorithms offer higher accuracy than other machine learning methods, with RF achieving the highest accuracy, evidenced by an area under the curve (AUC) of 0.87, followed by XGBoost at 0.84, surpassing both ANN (0.83) and SVM (0.82); (2) the interpretability of ensemble learning highlighted the differences in the potential distribution of the training data's positive and negative samples. Feature importance in ensemble learning can be utilized to minimize human bias in the collection of flooded-site samples, more targeted flood susceptibility maps of the study area's road network were obtained; and (3) ensemble learning algorithms exhibited greater stability and robustness in datasets with varied negative samples, as evidenced by their performance in F1-Score, Kappa, and AUC metrics. This paper further substantiates the superiority of ensemble learning in flood susceptibility assessment tasks from the perspectives of accuracy, interpretability, and robustness, enhances the understanding of the impact of negative samples on such assessments, and optimizes the specific process for urban flood susceptibility assessment using data-driven methods.
引用
收藏
页数:18
相关论文
共 66 条
  • [31] A new approach to flood susceptibility assessment in data-scarce and ungauged regions based on GIS-based hybrid multi criteria decision-making method
    Kanani-Sadat, Yousef
    Arabsheibani, Reza
    Karimipour, Farid
    Nasseri, Mohsen
    [J]. JOURNAL OF HYDROLOGY, 2019, 572 : 17 - 31
  • [32] A GIS-Based Artificial Neural Network Model for Flood Susceptibility Assessment
    Khoirunisa, Nanda
    Ku, Cheng-Yu
    Liu, Chih-Yu
    [J]. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2021, 18 (03) : 1 - 20
  • [33] A comparative assessment of flood susceptibility modeling using Multi-Criteria Decision-Making Analysis and Machine Learning Methods
    Khosravi, Khabat
    Shahabi, Himan
    Binh Thai Pham
    Adamowski, Jan
    Shirzadi, Ataollah
    Pradhan, Biswajeet
    Dou, Jie
    Ly, Hai-Bang
    Grof, Gyula
    Huu Loc Ho
    Hong, Haoyuan
    Chapi, Kamran
    Prakash, Indra
    [J]. JOURNAL OF HYDROLOGY, 2019, 573 : 311 - 323
  • [34] Environmental disaster and public rescue: A social media perspective
    Li, Lei
    Du, Yufei
    Ma, Shaojun
    Ma, Xiaoyu
    Zheng, Yilin
    Han, Xu
    [J]. ENVIRONMENTAL IMPACT ASSESSMENT REVIEW, 2023, 100
  • [35] Visual Classification: Expert Knowledge Guides Machine Learning
    MacInnes, Joseph
    Santosa, Stephanie
    Wright, William
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2010, 30 (01) : 8 - 14
  • [36] Application of machine learning algorithms for flood susceptibility assessment and risk management
    Madhuri, R.
    Sistla, S.
    Srinivasa Raju, K.
    [J]. JOURNAL OF WATER AND CLIMATE CHANGE, 2021, 12 (06) : 2608 - 2623
  • [37] Mallick R B., 2018, Geotechnics for Natural and Engineered Sustainable Technologies, DOI DOI 10.1007/978-981-10-7721-0_23
  • [38] TOPOGRAPHIC DISTANCE AND WATERSHED LINES
    MEYER, F
    [J]. SIGNAL PROCESSING, 1994, 38 (01) : 113 - 125
  • [39] The impacts of urbanisation and climate change on urban flooding and urban water quality: A review of the evidence concerning the United Kingdom
    Miller, James D.
    Hutchins, Michael
    [J]. JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2017, 12 : 345 - 362
  • [40] Performance of the flood models in different topographies
    Moghim, Sanaz
    Gharehtoragh, Mohammad Ahmadi
    Safaie, Ammar
    [J]. JOURNAL OF HYDROLOGY, 2023, 620