A Comparative Flood Susceptibility Assessment in a Norwegian Coastal City Using Feature Selection Methods and Machine Learning Algorithms

被引:1
作者
Lam Van Nguyen [1 ,2 ]
Dieu Tien Bui [3 ]
Seidu, Razak [1 ]
机构
[1] Norwegian Univ Sci & Technol, Smart Water & Environm Engn Grp, Dept Ocean Operat & Civil Engn, Fac Engn, N-6025 Alesund, Norway
[2] Hanoi Univ Min & Geol, Fac Geomat & Land Adm, Dept Geodesy, 18 Vien St, Hanoi 100000, Vietnam
[3] Univ South Eastern Norway, Dept Business & IT, GIS Grp, Gullbringvegen 36, N-3800 Bo In Telemark, Norway
来源
ADVANCES IN RESEARCH ON WATER RESOURCES AND ENVIRONMENTAL SYSTEMS | 2023年
关键词
Flood risk assessment; Logistic regression; Naive Bayes; Support vector machine; Bayesian model averaging; Machine learning; Alesund; Norway; SUPPORT VECTOR MACHINE; HOA BINH PROVINCE; CLIMATE-CHANGE; STEPWISE REGRESSION; PROJECTED INCREASES; SPATIAL PREDICTION; DECISION TREE; EL-NINO; FREQUENCY; RAINFALL;
D O I
10.1007/978-3-031-17808-5_36
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Identification of significant input factors plays a crucial role in optimizing the predictive model performance of floods. A good predictive model is a model that can predict the outcome with the best accuracy but using the least input parameters. In this study, the Bayesian Model Averaging (BMA) method was applied instead of the traditional stepwise regression method for determining flood-conditioning factors. In general, the BMA method had higher accuracy than the stepwise regression method. Fourteen flood conditioning factors were used to compare feature selection capacity using the LR and BMA methods, including altitude, slope, aspect, curvature, normalized difference vegetation index (NDVI), normalized difference snow index (NDSI), rainfall, stream power index (SPI), topographic wetness index (TWI), modified normalized difference water index (MNDWI), distance from rivers (DFR), soil types, land cover, and geology. By applying the BMA method, the input parameters were reduced from 14 to 4 (altitude, slope, rainfall, and NDVI) but the accuracy of predicted models was maintained. This research applied three machine learning algorithms namely Logistic Regression (LR), Naive Bayes (NB), and Support Vector Machine (SVM) using the above significant flood-conditioning factors to produce flood susceptibility maps for the coastal city of Alesund, Norway. A total of 182 flood events were used to develop and evaluate the machine learning models, and the accuracy and performance of predicted models were tested by employing the area under the curve (AUC) and some statistical indices. Among the input factors, altitude and rainfall in Alesund were the most important factors that significantly influenced flood events. The studied results indicated that SVM (AUC = 1.0) was the best algorithm for building flood susceptibility in Alesund, followed by the NB (AUC = 0.996) and LR (AUC = 0.984) algorithms, respectively.
引用
收藏
页码:591 / 618
页数:28
相关论文
共 100 条
[71]  
Singh Gurinder, 2019, 2019 International Conference on Automation, Computational and Technology Management (ICACTM). Proceedings, P593, DOI 10.1109/ICACTM.2019.8776800
[72]   Comparative analysis of regression and machine learning methods for predicting fault proneness models [J].
Singh, Yogesh ;
Kaur, Arvinder ;
Malhotra, Ruchika .
INTERNATIONAL JOURNAL OF COMPUTER APPLICATIONS IN TECHNOLOGY, 2009, 35 (2-4) :183-193
[73]   Rain-on-snow induced flood events in Southern Germany [J].
Sui, J ;
Koehler, G .
JOURNAL OF HYDROLOGY, 2001, 252 (1-4) :205-220
[74]   Improving Estimation of Cropland Evapotranspiration by the Bayesian Model Averaging Method with Surface Energy Balance Models [J].
Sun, Huaiwei ;
Yang, Yong ;
Wu, Ruiying ;
Gui, Dongwei ;
Xue, Jie ;
Liu, Yi ;
Yan, Dong .
ATMOSPHERE, 2019, 10 (04)
[75]   Least squares support vector machine classifiers [J].
Suykens, JAK ;
Vandewalle, J .
NEURAL PROCESSING LETTERS, 1999, 9 (03) :293-300
[76]  
Talha S., 2019, INT ARCH PHOTOGRAMME, V42, P407, DOI [https://doi.org/10.5194/isprs-archives-XLII-4-W19-407-2019, DOI 10.5194/ISPRS-ARCHIVES-XLII-4-W19-407-2019, 10.5194/isprsarchives-XLII-4, DOI 10.5194/ISPRS-ARCHIVES]
[77]  
Tang R, 2020, 2020 5TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS (IEEE ICBDA 2020), P80, DOI 10.1109/ICBDA49040.2020.9101199
[78]   Flood susceptibility assessment based on a novel random Naive Bayes method: A comparison between different factor discretization methods [J].
Tang, Xianzhe ;
Li, Jiufeng ;
Liu, Minnan ;
Liu, Wei ;
Hong, Haoyuan .
CATENA, 2020, 190
[79]   Identifying the essential flood conditioning factors for flood prone area mapping using machine learning techniques [J].
Tehrany, Mahyat Shafapour ;
Jones, Simon ;
Shabani, Farzin .
CATENA, 2019, 175 :174-192
[80]   Flood susceptibility assessment using GIS-based support vector machine model with different kernel types [J].
Tehrany, Mahyat Shafapour ;
Pradhan, Biswajeet ;
Mansor, Shattri ;
Ahmad, Noordin .
CATENA, 2015, 125 :91-101