Prediction of flood sensitivity based on Logistic Regression, eXtreme Gradient Boosting, and Random Forest modeling methods

被引:7
|
作者
Wu, Ying [1 ]
Zhang, Zhiming [2 ]
Qi, Xiaotian [1 ]
Hu, Wenhan [1 ]
Si, Shuai [1 ]
机构
[1] Beijing Univ Civil Engn & Architecture, Dept Environm & Energy Engn, 1 Zhanlanguan Rd, Beijing 100044, Peoples R China
[2] Beijing Univ Civil Engn & Architecture, Beijing Climate Change Response Res & Educ Ctr, Sch Environm & Energy Engn, Beijing 100044, Peoples R China
基金
国家重点研发计划;
关键词
eXtreme Gradient Boosting (XGBoost); flood sensitivity assessment; Logistic Regression (LR); Random Forest (RF); DECISION TREE; SUSCEPTIBILITY; ALGORITHMS;
D O I
10.2166/wst.2024.146
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Floods are one of the most destructive disasters that cause loss of life and property worldwide every year. In this study, the aim was to find the best-performing model in flood sensitivity assessment and analyze key characteristic factors, the spatial pattern of flood sensitivity was evaluated using three machine learning (ML) models: Logistic Regression (LR), eXtreme Gradient Boosting (XGBoost), and Random Forest (RF). Suqian City in Jiangsu Province was selected as the study area, and a random sample dataset of historical flood points was constructed. Fifteen different meteorological, hydrological, and geographical spatial variables were considered in the flood sensitivity assessment, 12 variables were selected based on the multi-collinearity study. Among the results of comparing the selected ML models, the RF method had the highest AUC value, accuracy, and comprehensive evaluation effect, and is a reliable and effective flood risk assessment model. As the main output of this study, the flood sensitivity map is divided into five categories, ranging from very low to very high sensitivity. Using the RF model (i.e., the highest accuracy of the model), the high-risk area covers about 44% of the study area, mainly concentrated in the central, eastern, and southern parts of the old city area.
引用
收藏
页码:2605 / 2624
页数:20
相关论文
共 50 条
  • [41] Random Forest and Logistic Regression algorithms for prediction of groundwater contamination using ammonia concentration
    Ahmed Madani
    Mohammed Hagage
    Salwa F. Elbeih
    Arabian Journal of Geosciences, 2022, 15 (20)
  • [42] Comparison of Accuracy Rate in Prediction of Cardiovascular Disease using Random Forest with Logistic Regression
    Vishnuvardhan, Talluri
    Rama, A.
    CARDIOMETRY, 2022, (25): : 1526 - 1531
  • [43] Methods for Identifying SNP Interactions: A Review on Variations of Logic Regression, Random Forest and Bayesian Logistic Regression
    Chen, Carla Chia-Ming
    Schwender, Holger
    Keith, Jonathan
    Nunkesser, Robin
    Mengersen, Kerrie
    Macrossan, Paula
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2011, 8 (06) : 1580 - 1591
  • [44] Prediction and classification of solar photovoltaic power generation using extreme gradient boosting regression model
    Rinesh, S.
    Deepa, S.
    Nandan, R. T.
    Sachin, R. S.
    Thamil, S., V
    Akash, R.
    Arun, M.
    Prajitha, C.
    Kumar, A. P. Senthil
    INTERNATIONAL JOURNAL OF LOW-CARBON TECHNOLOGIES, 2024, 19 : 2420 - 2430
  • [45] Logistic Regression Analysis for LncRNA-Disease Association Prediction Based on Random Forest and Clinical Stage Data
    Wang, Bo
    Zhang, Jing
    IEEE ACCESS, 2020, 8 (08): : 35004 - 35017
  • [46] The Comparative Performance of Logistic Regression and Random Forest in Propensity Score Methods: a Simulation Study
    Ali, M. Sanni
    Khalid, Sara
    Collins, Gary S.
    Prieto-Alhambra, Daniel
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 489 - 489
  • [47] CLASSIFYING HIGH MEDICAL EXPENDITURE PATIENTS USING LOGISTIC REGRESSION AND RANDOM FOREST METHODS
    Menon, J.
    VALUE IN HEALTH, 2021, 24 : S188 - S189
  • [48] Modeling Road Accident Severity with Comparisons of Logistic Regression, Decision Tree and Random Forest
    Chen, Mu-Ming
    Chen, Mu-Chen
    INFORMATION, 2020, 11 (05)
  • [49] Scenario-Based Real-Time Flood Prediction with Logistic Regression
    Lee, Jaeyeong
    Kim, Byunghyun
    WATER, 2021, 13 (09)
  • [50] Uncertainty Reduction in Flood Susceptibility Mapping Using Random Forest and eXtreme Gradient Boosting Algorithms in Two Tropical Desert Cities, Shibam and Marib, Yemen
    Al-Aizari, Ali R.
    Alzahrani, Hassan
    Althuwaynee, Omar F.
    Al-Masnay, Yousef A.
    Ullah, Kashif
    Park, Hyuck-Jin
    Al-Areeq, Nabil M.
    Rahman, Mahfuzur
    Hazaea, Bashar Y.
    Liu, Xingpeng
    REMOTE SENSING, 2024, 16 (02)