XGBoost-based method for flash flood risk assessment

被引:236
作者
Ma, Meihong [1 ]
Zhao, Gang [2 ]
He, Bingshun [3 ]
Li, Qing [3 ]
Dong, Haoyue [1 ]
Wang, Shenggang [4 ]
Wang, Zhongliang [1 ,5 ]
机构
[1] Tianjin Normal Univ, Sch Geog & Environm Sci, Tianjin 300387, Peoples R China
[2] Univ Bristol, Sch Geog Sci, Bristol BS8 1SS, Avon, England
[3] China Inst Water Resources & Hydropower Res, Beijing 100038, Peoples R China
[4] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100101, Peoples R China
[5] Tianjin Chengjian Univ, Tianjin Key Lab Aquat Sci & Technol, Tianjin, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Flash flood; Risk assessment; XGBoost; Yunnan; SUPPORT VECTOR REGRESSION; REMOTE-SENSING DATA; SUSCEPTIBILITY; MULTISENSOR; MODELS;
D O I
10.1016/j.jhydrol.2021.126382
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Flash flood risk assessment, a widely applied technology in preventing catastrophic flash flood disasters, has become the current research hotspot. However, most existing machine learning methods for assessing flash flood risk rely on a single classifier, which is suitable for processing small sets of sample data, but the resulting prediction accuracy and generalization ability are insufficient. Meanwhile, machine learning methods that integrate multiple classifiers are thus far unknown. Extreme Gradient Boosting (XGBoost) is an excellent algorithm for ensemble learning methods which has achieved remarkable results in many fields. It not only optimizes the algorithm but also automatically applies the CPU's multi-threading to perform parallel calculations, thus greatly improving the model training speed and prediction accuracy. Therefore, this article introduces the XGBoost model for the assessment of flash flood risk, and then combines the two input strategies and the Least squares support vector machine (LSSVM) model to verify its optimal effect, thus proposing the XGBoost-based method for flash flood risk assessment. Subsequently, an attribution analysis was implemented to assess the possible errors of this approach; and finally, a county-level flash flood risk map for Yunnan Province, China, was generated based on the proposed method. The results demonstrate that: (1) XGBoost performs well, with an accuracy of 0.84 in the testing period, and its five indices (precision, recall, accuracy, kappa, and F-score) are all higher than those of LSSVM. (2) The XGBoost-based approach provided the reliable flash flood risk maps, which were validated by another flash flood inventory, although some errors may be attributed to critical environmental factors and statistical disaster location accuracy. (3) The high-risk counties (including high-risk and highest-risk) accounted for 40.3%, with the highest-risk counties mainly concentrated in southeastern Yunnan. This article further addresses the limitations of XGBoost (e.g., as a time-consuming greedy algorithm, the non-necessity of multi-threaded optimization). All of the above results indicate that the XGBoost-based method is an effective method for obtaining high-quality county-level flash flood risk maps, which contributes to the theoretical basis for ongoing county-level flash flood prevention in China.
引用
收藏
页数:12
相关论文
共 51 条
  • [1] Abhishek G., 2018, NAT HAZARDS, P1
  • [2] Testing empirical and synthetic flood damage models: the case of Italy
    Amadio, Mattia
    Scorzini, Anna Rita
    Carisi, Francesca
    Essenfelder, Arthur H.
    Domeneghetti, Alessio
    Mysiak, Jaroslav
    Castellarin, Attilio
    [J]. NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2019, 19 (03) : 661 - 678
  • [3] Baily B., 2016, NATURAL HAZARDS EART
  • [4] Berkhahn S., 2019, J HYDROL
  • [5] Changing climate both increases and decreases European river floods
    Bloeschl, Guenter
    Hall, Julia
    Viglione, Alberto
    Perdigao, Rui A. P.
    Parajka, Juraj
    Merz, Bruno
    Lun, David
    Arheimer, Berit
    Aronica, Giuseppe T.
    Bilibashi, Ardian
    Bohac, Milon
    Bonacci, Ognjen
    Borga, Marco
    Canjevac, Ivan
    Castellarin, Attilio
    Chirico, Giovanni B.
    Claps, Pierluigi
    Frolova, Natalia
    Ganora, Daniele
    Gorbachova, Liudmyla
    Gul, Ali
    Hannaford, Jamie
    Harrigan, Shaun
    Kireeva, Maria
    Kiss, Andrea
    Kjeldsen, Thomas R.
    Kohnova, Silvia
    Koskela, Jarkko J.
    Ledvinka, Ondrej
    Macdonald, Neil
    Mavrova-Guirguinova, Maria
    Mediero, Luis
    Merz, Ralf
    Molnar, Peter
    Montanari, Alberto
    Murphy, Conor
    Osuch, Marzena
    Ovcharuk, Valeryia
    Radevski, Ivan
    Salinas, Jose L.
    Sauquet, Eric
    Sraj, Mojca
    Szolgay, Jan
    Volpi, Elena
    Wilson, Donna
    Zaimi, Klodian
    Zivkovic, Nenad
    [J]. NATURE, 2019, 573 (7772) : 108 - +
  • [6] Flash flood forecasting, warning and risk management: the HYDRATE project
    Borga, M.
    Anagnostou, E. N.
    Bloeschl, G.
    Creutin, J. -D.
    [J]. ENVIRONMENTAL SCIENCE & POLICY, 2011, 14 (07) : 834 - 844
  • [7] Budholiya K., 2020, J KING SAUD UNIV-COM
  • [8] Multi-geohazards susceptibility mapping based on machine learning-a case study in Jiuzhaigou, China
    Cao, Juan
    Zhang, Zhao
    Du, Jie
    Zhang, Liangliang
    Song, Yun
    Sun, Geng
    [J]. NATURAL HAZARDS, 2020, 102 (03) : 851 - 871
  • [9] XGBoost: A Scalable Tree Boosting System
    Chen, Tianqi
    Guestrin, Carlos
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 785 - 794
  • [10] CONUS-Wide Evaluation of National Weather Service Flash Flood Guidance Products
    Clark, Robert A.
    Gourley, Jonathan J.
    Flamig, Zachary L.
    Hong, Yang
    Clark, Edward
    [J]. WEATHER AND FORECASTING, 2014, 29 (02) : 377 - 392