Prediction of flood risk levels of urban f looded points though using machine learning with unbalanced data

被引:8
|
作者
Wang, Hongfa [1 ,2 ]
Meng, Yu [1 ,2 ]
Xu, Hongshi [1 ,2 ]
Wang, Huiliang [1 ,2 ]
Guan, Xinjian [1 ,2 ]
Liu, Yuan [1 ,2 ]
Liu, Meng [1 ,2 ]
Wu, Zening [1 ,2 ]
机构
[1] Zhengzhou Univ, Sch Water Conservancy & Transportat, Zhengzhou 450001, Henan, Peoples R China
[2] Zhengzhou Univ, Yellow River Lab, Zhengzhou 450001, Henan, Peoples R China
关键词
Flood risk; Borderline-SMOTE; K-Means; Mahalanobis distance; Genetic Algorithm; Random Forest; DECISION-MAKING; CLASSIFICATION; SMOTE;
D O I
10.1016/j.jhydrol.2024.130742
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
With the emphasis on preventing urban flooding and the enhancement of rational urban development, data related to urban flooding are also collected with unbalanced sample size that is a widespread phenomenon in other world fields. The performance of the classification model is compromised by unbalanced datasets, therefore, minority-class samples, floods with higher risk, are often missing alerted or incorrectly warned. To solve this problem, a novel hybrid resampling proposal is proposed in this research proved to be effective for balancing data. First, it optimizes an imbalanced dataset by the Borderline-SMOTE algorithm. Next, alternative datasets are synthesized through under-sampling techniques, whose qualities are evaluated by using information entropy and calculated rely on the k-nearest neighbor entropy estimator. The suggested method not only makes full use of the original data information, but also avoids under-fitting due to the single under-sampling utilization. A practical application in the central area of Zhengzhou, China, combining the resampling proposal and the Random Forest classification model optimized by Genetic Algorithm, the results show that significantly better results are yielded compared without any treatment in terms of all assessment indicators (Accuracy, Recall, G -mean and F1 -score) have been improved.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Urban Flood-Risk Assessment: Integration of Decision-Making and Machine Learning
    Taromideh, Fereshteh
    Fazloula, Ramin
    Choubin, Bahram
    Emadi, Alireza
    Berndtsson, Ronny
    SUSTAINABILITY, 2022, 14 (08)
  • [2] Evaluating urban flood risk using hybrid method of TOPSIS and machine learning
    Rafiei-Sardooi, Elham
    Azareh, Ali
    Choubin, Bahram
    Mosavi, Amir H.
    Clague, John J.
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2021, 66
  • [3] Quantifying urban flood extent using satellite imagery and machine learning
    Composto, Rebecca W.
    Tulbure, Mirela G.
    Tiwari, Varun
    Gaines, Mollie D.
    Caineta, Julio
    NATURAL HAZARDS, 2025, 121 (01) : 175 - 199
  • [4] Urban flood risk mapping using the GARP and QUEST models: A comparative study of machine learning techniques
    Darabi, Hamid
    Choubin, Bahram
    Rahmati, Omid
    Haghighi, Ali Torabi
    Pradhan, Biswajeet
    Klove, Bjorn
    JOURNAL OF HYDROLOGY, 2019, 569 : 142 - 154
  • [5] Flood prediction in urban areas based on machine learning considering the statistical characteristics of rainfall
    Jang, Se-Dong
    Yoo, Jae-Hwan
    Lee, Yeon-Su
    Kim, Byunghyun
    PROGRESS IN DISASTER SCIENCE, 2025, 26
  • [6] A Data Science Methodology Based on Machine Learning Algorithms for Flood Severity Prediction
    Khalaf, Mohammed
    Hussain, Abir Jaafar
    Al-Jumeily, Dhiya
    Baker, Thar
    Keight, Robert
    Lisboa, Paulo
    Fergus, Paul
    Al Kafri, Ala S.
    2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 230 - 237
  • [7] Urban flood hazard mapping using machine learning models: GARP, RF, MaxEnt and NB
    Norallahi, Mahya
    Kaboli, Hesam Seyed
    NATURAL HAZARDS, 2021, 106 (01) : 119 - 137
  • [8] Mercury Prediction in Urban Soils by Remote Sensing and Relief Data Using Machine Learning Techniques
    Suleymanov, Azamat
    Suleymanov, Ruslan
    Kulagin, Andrey
    Yurkevich, Marija
    REMOTE SENSING, 2023, 15 (12)
  • [9] Assessing Surface Water Flood Risks in Urban Areas Using Machine Learning
    Li, Zhufeng
    Liu, Haixing
    Luo, Chunbo
    Fu, Guangtao
    WATER, 2021, 13 (24)
  • [10] Obesity disease risk prediction using machine learning
    Dutta, Raja Ram
    Mukherjee, Indrajit
    Chakraborty, Chinmay
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024, 19 (4) : 709 - 718