Extreme flash flood susceptibility mapping using a novel PCA-based model stacking approach

被引:8
作者
Shojaeian, Amirreza [1 ]
Shafizadeh-Moghadam, Hossein [2 ]
Sharafati, Ahmad [1 ]
Shahabi, Himan [3 ,4 ]
机构
[1] Islamic Azad Univ, Dept Civil Engn, Sci & Res Branch, Tehran, Iran
[2] Tarbiat Modares Univ, Dept Water Engn & Management, Tehran, Iran
[3] Univ Kurdistan, Fac Nat Resources, Dept Geomorphol, Sanandaj 6617715175, Iran
[4] Silesian Tech Univ, Inst Phys, Div Geochronol & Environm Isotopes, PL-44100 Gliwice, Poland
关键词
Machine learning; Meta-model; Model integration; PCA; Karkheh Basin; REGRESSION; CLASSIFICATION; PREDICTION; NETWORKS; AREAS;
D O I
10.1016/j.asr.2024.08.004
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This study introduces an efficient methodology for model stacking, incorporating six diverse machine learning and statistical models alongside principal component analysis (PCA). The approach is applied for the flash flood susceptibility mapping within the Karkheh Basin in Iran. The selected models include random forest (RF), boosted regression trees (BRT), support vector machine (SVM), artificial neural networks (ANN), generalized additive model (GAM), and the least absolute shrinkage and selection operator (Lasso), with RF also serving as the meta-model for the stacking. The results revealed significant correlations among the predictions of the individual models, which could potentially impact the meta-model's efficacy. To address this, PCA was applied to the model predictions to generate de- correlated components as inputs for the meta-model, thereby enhancing prediction accuracy and robustness. Evaluation based on the area under the receiver operating characteristic (AUROC) curve demonstrated that the GAM outperformed all other individual models with the highest accuracy score of 0.924. In contrast, the RF and ANN models had the lowest accuracy, both registering at 0.872. However, the performance disparity across models was minimal. Notably, the PCA-based stacking approach (0.936) surpassed both traditional model stacking (0.912) and the performances of all individual models, advocating for its use in enhancing predictive accuracy. These findings endorse the PCA-stacking method over conventional stacking techniques. Nonetheless, further research across varied applications is warranted to generalize its efficacy. (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:5371 / 5382
页数:12
相关论文
共 71 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]  
Abedi H., 2019, J. Comput. Vision Imag. Syst., V5, P2, DOI [10.1109/Bhi.2018.8333371, DOI 10.1109/BHI.2018.8333371]
[3]   Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees [J].
Abedi, Rahebeh ;
Costache, Romulus ;
Shafizadeh-Moghadam, Hossein ;
Pham, Quoc Bao .
GEOCARTO INTERNATIONAL, 2022, 37 (19) :5479-5496
[4]   Unsupervised Rapid Flood Mapping Using Sentinel-1 GRD SAR Images [J].
Amitrano, Donato ;
Di Martino, Gerardo ;
Iodice, Antonio ;
Riccio, Daniele ;
Ruello, Giuseppe .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (06) :3290-3299
[5]   Flood susceptibility mapping using meta-heuristic algorithms [J].
Arabameri, Alireza ;
Danesh, Amir Seyed ;
Santosh, M. ;
Cerda, Artemi ;
Pal, Subodh Chandra ;
Ghorbanzadeh, Omid ;
Roy, Paramita ;
Chowdhuri, Indrajit .
GEOMATICS NATURAL HAZARDS & RISK, 2022, 13 (01) :949-974
[6]   Flash flood susceptibility modelling using functional tree and hybrid ensemble techniques [J].
Arabameri, Alireza ;
Saha, Sunil ;
Chen, Wei ;
Roy, Jagabandhu ;
Pradhan, Biswajeet ;
Bui, Dieu Tien .
JOURNAL OF HYDROLOGY, 2020, 587
[7]   Ensemble forecasting of species distributions [J].
Araujo, Miguel B. ;
New, Mark .
TRENDS IN ECOLOGY & EVOLUTION, 2007, 22 (01) :42-47
[8]  
Bezak N., 2019, WATER-SUI, V11, P2167, DOI DOI 10.3390/w11102167
[9]  
Bishop C., 2006, Pattern Recognition and Machine Learning
[10]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32