Extreme flash flood susceptibility mapping using a novel PCA-based model stacking approach

被引:8
作者
Shojaeian, Amirreza [1 ]
Shafizadeh-Moghadam, Hossein [2 ]
Sharafati, Ahmad [1 ]
Shahabi, Himan [3 ,4 ]
机构
[1] Islamic Azad Univ, Dept Civil Engn, Sci & Res Branch, Tehran, Iran
[2] Tarbiat Modares Univ, Dept Water Engn & Management, Tehran, Iran
[3] Univ Kurdistan, Fac Nat Resources, Dept Geomorphol, Sanandaj 6617715175, Iran
[4] Silesian Tech Univ, Inst Phys, Div Geochronol & Environm Isotopes, PL-44100 Gliwice, Poland
关键词
Machine learning; Meta-model; Model integration; PCA; Karkheh Basin; REGRESSION; CLASSIFICATION; PREDICTION; NETWORKS; AREAS;
D O I
10.1016/j.asr.2024.08.004
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
This study introduces an efficient methodology for model stacking, incorporating six diverse machine learning and statistical models alongside principal component analysis (PCA). The approach is applied for the flash flood susceptibility mapping within the Karkheh Basin in Iran. The selected models include random forest (RF), boosted regression trees (BRT), support vector machine (SVM), artificial neural networks (ANN), generalized additive model (GAM), and the least absolute shrinkage and selection operator (Lasso), with RF also serving as the meta-model for the stacking. The results revealed significant correlations among the predictions of the individual models, which could potentially impact the meta-model's efficacy. To address this, PCA was applied to the model predictions to generate de- correlated components as inputs for the meta-model, thereby enhancing prediction accuracy and robustness. Evaluation based on the area under the receiver operating characteristic (AUROC) curve demonstrated that the GAM outperformed all other individual models with the highest accuracy score of 0.924. In contrast, the RF and ANN models had the lowest accuracy, both registering at 0.872. However, the performance disparity across models was minimal. Notably, the PCA-based stacking approach (0.936) surpassed both traditional model stacking (0.912) and the performances of all individual models, advocating for its use in enhancing predictive accuracy. These findings endorse the PCA-stacking method over conventional stacking techniques. Nonetheless, further research across varied applications is warranted to generalize its efficacy. (c) 2024 COSPAR. Published by Elsevier B.V. All rights are reserved, including those for text and data mining, AI training, and similar technologies.
引用
收藏
页码:5371 / 5382
页数:12
相关论文
共 71 条
[21]   Detection of areas prone to flood risk using state-of-the-art machine learning models [J].
Costache, Romulus ;
Arabameri, Alireza ;
Elkhrachy, Ismail ;
Ghorbanzadeh, Omid ;
Quoc Bao Pham .
GEOMATICS NATURAL HAZARDS & RISK, 2021, 12 (01) :1488-1507
[23]   Combining satellite data and appropriate objective functions for improved spatial pattern performance of a distributed hydrologic model [J].
Demirel, Mehmet C. ;
Mai, Juliane ;
Mendiguren, Gorka ;
Koch, Julian ;
Samaniego, Luis ;
Stisen, Simon .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2018, 22 (02) :1299-1315
[24]   A working guide to boosted regression trees [J].
Elith, J. ;
Leathwick, J. R. ;
Hastie, T. .
JOURNAL OF ANIMAL ECOLOGY, 2008, 77 (04) :802-813
[25]   A spatial model for coastal flood susceptibility assessment using the 2D-SPR method with complex network theory: A case study of a reclamation island in Zhoushan, China [J].
Fang, Xin ;
Zhang, Yifei ;
Xiang, Yunyun ;
Zou, Jiaqi ;
Li, Xiaoyan ;
Hao, Chunling ;
Wang, Jingchen .
ENVIRONMENTAL IMPACT ASSESSMENT REVIEW, 2023, 98
[26]   Satellite-supported flood forecasting in river networks: A real case study [J].
Garcia-Pintado, Javier ;
Mason, David C. ;
Dance, Sarah L. ;
Cloke, Hannah L. ;
Neal, Jeff C. ;
Freer, Jim ;
Bates, Paul D. .
JOURNAL OF HYDROLOGY, 2015, 523 :706-724
[27]  
Ghosh A, 2018, NAT HAZARDS, V94, P349, DOI 10.1007/s11069-018-3392-y
[28]   Assimilating SAR-derived water level data into a hydraulic model: a case study [J].
Giustarini, L. ;
Matgen, P. ;
Hostache, R. ;
Montanari, M. ;
Plaza, D. ;
Pauwels, V. R. N. ;
De Lannoy, G. J. M. ;
De Keyser, R. ;
Pfister, L. ;
Hoffmann, L. ;
Savenije, H. H. G. .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2011, 15 (07) :2349-2365
[29]   Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling [J].
Goetz, J. N. ;
Brenning, A. ;
Petschko, H. ;
Leopold, P. .
COMPUTERS & GEOSCIENCES, 2015, 81 :1-11
[30]   Investigation of the random forest framework for classification of hyperspectral data [J].
Ham, J ;
Chen, YC ;
Crawford, MM ;
Ghosh, J .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2005, 43 (03) :492-501