A Comparative Flood Susceptibility Assessment in a Norwegian Coastal City Using Feature Selection Methods and Machine Learning Algorithms

被引:1
作者
Lam Van Nguyen [1 ,2 ]
Dieu Tien Bui [3 ]
Seidu, Razak [1 ]
机构
[1] Norwegian Univ Sci & Technol, Smart Water & Environm Engn Grp, Dept Ocean Operat & Civil Engn, Fac Engn, N-6025 Alesund, Norway
[2] Hanoi Univ Min & Geol, Fac Geomat & Land Adm, Dept Geodesy, 18 Vien St, Hanoi 100000, Vietnam
[3] Univ South Eastern Norway, Dept Business & IT, GIS Grp, Gullbringvegen 36, N-3800 Bo In Telemark, Norway
来源
ADVANCES IN RESEARCH ON WATER RESOURCES AND ENVIRONMENTAL SYSTEMS | 2023年
关键词
Flood risk assessment; Logistic regression; Naive Bayes; Support vector machine; Bayesian model averaging; Machine learning; Alesund; Norway; SUPPORT VECTOR MACHINE; HOA BINH PROVINCE; CLIMATE-CHANGE; STEPWISE REGRESSION; PROJECTED INCREASES; SPATIAL PREDICTION; DECISION TREE; EL-NINO; FREQUENCY; RAINFALL;
D O I
10.1007/978-3-031-17808-5_36
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Identification of significant input factors plays a crucial role in optimizing the predictive model performance of floods. A good predictive model is a model that can predict the outcome with the best accuracy but using the least input parameters. In this study, the Bayesian Model Averaging (BMA) method was applied instead of the traditional stepwise regression method for determining flood-conditioning factors. In general, the BMA method had higher accuracy than the stepwise regression method. Fourteen flood conditioning factors were used to compare feature selection capacity using the LR and BMA methods, including altitude, slope, aspect, curvature, normalized difference vegetation index (NDVI), normalized difference snow index (NDSI), rainfall, stream power index (SPI), topographic wetness index (TWI), modified normalized difference water index (MNDWI), distance from rivers (DFR), soil types, land cover, and geology. By applying the BMA method, the input parameters were reduced from 14 to 4 (altitude, slope, rainfall, and NDVI) but the accuracy of predicted models was maintained. This research applied three machine learning algorithms namely Logistic Regression (LR), Naive Bayes (NB), and Support Vector Machine (SVM) using the above significant flood-conditioning factors to produce flood susceptibility maps for the coastal city of Alesund, Norway. A total of 182 flood events were used to develop and evaluate the machine learning models, and the accuracy and performance of predicted models were tested by employing the area under the curve (AUC) and some statistical indices. Among the input factors, altitude and rainfall in Alesund were the most important factors that significantly influenced flood events. The studied results indicated that SVM (AUC = 1.0) was the best algorithm for building flood susceptibility in Alesund, followed by the NB (AUC = 0.996) and LR (AUC = 0.984) algorithms, respectively.
引用
收藏
页码:591 / 618
页数:28
相关论文
共 100 条
[1]   A robust and efficient stepwise regression method for building sparse polynomial chaos expansions [J].
Abraham, Simon ;
Raisee, Mehrdad ;
Ghorbaniasl, Ghader ;
Contino, Francesco ;
Lacor, Chris .
JOURNAL OF COMPUTATIONAL PHYSICS, 2017, 332 :461-474
[2]   Evaluation of flood susceptibility mapping using logistic regression and GIS conditioning factors [J].
Al-Juaidi, Ahmed E. M. ;
Nassar, Ayman M. ;
Al-Juaidi, Omar E. M. .
ARABIAN JOURNAL OF GEOSCIENCES, 2018, 11 (24)
[3]  
[Anonymous], About Us
[4]   What you see may not be what you get: A brief, nontechnical introduction to overfitting in regression-type models [J].
Babyak, MA .
PSYCHOSOMATIC MEDICINE, 2004, 66 (03) :411-421
[5]   GIS Based Hybrid Computational Approaches for Flash Flood Susceptibility Assessment [J].
Binh Thai Pham ;
Avand, Mohammadtaghi ;
Janizadeh, Saeid ;
Tran Van Phong ;
Al-Ansari, Nadhir ;
Lanh Si Ho ;
Das, Sumit ;
Hiep Van Le ;
Amini, Ata ;
Bozchaloei, Saeid Khosrobeigi ;
Jafari, Faeze ;
Prakash, Indra .
WATER, 2020, 12 (03)
[6]   Fuzzy sigmoid kernel for support vector classifiers [J].
Camps-Valls, G ;
Martín-Guerrero, JD ;
Rojo-Alvarez, JL ;
Soria-Olivas, E .
NEUROCOMPUTING, 2004, 62 :501-506
[7]   RESEARCH ON GEOGRAPHICAL ENVIRONMENT UNIT DIVISION BASED ON THE METHOD OF NATURAL BREAKS (JENKS) [J].
Chen, Jian ;
Yang, Shengtian ;
Li, Hongwei ;
Zhang, Bin ;
Lv, Junrong .
ISPRS/IGU/ICA JOINT WORKSHOP ON BORDERLANDS MODELLING AND UNDERSTANDING FOR GLOBAL SUSTAINABILITY 2013, 2013, 40-4-W3 :47-50
[8]   Modeling flood susceptibility using data-driven approaches of naive Bayes tree, alternating decision tree, and random forest methods [J].
Chen, Wei ;
Li, Yang ;
Xue, Weifeng ;
Shahabi, Himan ;
Li, Shaojun ;
Hong, Haoyuan ;
Wang, Xiaojing ;
Bian, Huiyuan ;
Zhang, Shuai ;
Pradhan, Biswajeet ;
Bin Ahmad, Baharin .
SCIENCE OF THE TOTAL ENVIRONMENT, 2020, 701
[9]   An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines [J].
Choubin, Bahram ;
Moradi, Ehsan ;
Golshan, Mohammad ;
Adamowski, Jan ;
Sajedi-Hosseini, Farzaneh ;
Mosavi, Amir .
SCIENCE OF THE TOTAL ENVIRONMENT, 2019, 651 :2087-2096
[10]   Eigenvector selection with stepwise regression techniques to construct eigenvector spatial filters [J].
Chun, Yongwan ;
Griffith, Daniel A. ;
Lee, Monghyeon ;
Sinha, Parmanand .
JOURNAL OF GEOGRAPHICAL SYSTEMS, 2016, 18 (01) :67-85