Explainable step-wise binary classification for the susceptibility assessment of geo-hydrological hazards

被引:33
作者
Ekmekcioglu, Omer [1 ]
Koc, Kerim [2 ]
机构
[1] Istanbul Tech Univ, Civil Engn Dept, Istanbul, Turkey
[2] Yildiz Tech Univ, Civil Engn Dept, Istanbul, Turkey
关键词
Flooding; Landslides; Disaster management; Explainable artificial intelligence; Geo-hydrological hazards; SHAP; LANDSLIDE SUSCEPTIBILITY; BIVARIATE STATISTICS; ARTIFICIAL-INTELLIGENCE; FLOOD; MACHINE; PREDICTION; ALGORITHM; TREES; PRONE; GIS;
D O I
10.1016/j.catena.2022.106379
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
This research proposes a novel step-wise binary prediction framework for the susceptibility assessment of geo-hydrological hazards specific to floods and landslides. The framework of the study comprises two major steps: prediction of geo-hydrological hazard-prone locations (Step-1: hazard/non-hazard), and classification of geo-hydrological hazards by identifying the locations of floods and landslides separately (Step-2: floods/land-slides). We used 1326 historically experienced hazard locations (i.e., 726 for floods and 690 for landslides) in the Kentucky River basin, United States, along with the 13 hazard conditioning factors. Extremely randomized trees (ERT) coupled with the particle swarm optimization (PSO) was adopted to provide an effective classification scheme. Based on the predictions of the ERT-PSO in the first step, correctly classified hazard instances were used in the second step of the prediction task to further deepen the machine learning application. The results revealed a strong agreement between the predicted and observed hazard locations with an AUROC of 0.8032 and 0.8845 for geo-hydrological hazard (Step-1) and flood/landslide classifications (Step-2), respectively. The proposed hybrid prediction framework introduced considerably accurate performance as 73.78% and 72.91% of the hazard and non-hazard classes were correctly identified at Step-1, respectively, while at Step-2, 72.31% of the flooding points and 84.85% of the landslide points were ascertained accurately. Overall findings emerged from Step-1 illustrated that nearly 10% of the entire basin is susceptible to geo-hydrological hazards with very high probability, whereas very low susceptible areas cover only 20% of the basin. A model-agnostic game-theory based SHapley Additive explanations (SHAP) algorithm was employed to anatomize the contribution of hazard conditioning factors on the incident outcome predictions aiding to increase the interpretability of the adopted methodology. The holistic approach adopted in the present research has significant potential in providing insights into the practical and theoretical grounds of the literature.
引用
收藏
页数:18
相关论文
共 106 条
[1]   Flash-flood susceptibility mapping based on XGBoost, random forest and boosted regression trees [J].
Abedi, Rahebeh ;
Costache, Romulus ;
Shafizadeh-Moghadam, Hossein ;
Pham, Quoc Bao .
GEOCARTO INTERNATIONAL, 2022, 37 (19) :5479-5496
[2]   Performance Comparison of Support Vector Machine, Random Forest, and Extreme Learning Machine for Intrusion Detection [J].
Ahmad, Iftikhar ;
Basheri, Mohammad ;
Iqbal, Muhammad Javed ;
Rahim, Aneel .
IEEE ACCESS, 2018, 6 :33789-33795
[3]   Mapping flood susceptibility in an arid region of southern Iraq using ensemble machine learning classifiers: a comparative study [J].
Al-Abadi, Alaa M. .
ARABIAN JOURNAL OF GEOSCIENCES, 2018, 11 (09)
[4]   Integration of hard and soft supervised machine learning for flood susceptibility mapping [J].
Andaryani, Soghra ;
Nourani, Vahid ;
Haghighi, Ali Torabi ;
Keesstra, Saskia .
JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2021, 291
[5]  
Anderson MG, 2013, COMMUNITY-BASED LANDSLIDE RISK REDUCTION: MANAGING DISASTERS IN SMALL STEPS, P1, DOI 10.1596/978-0-8213-9456-4
[6]  
[Anonymous], 2021, Crops
[7]  
[Anonymous], 2001, Geo-hydrological Hazards BT-Anatomy of an Orogen: the Apennines and Adjacent Mediterranean Basins, DOI DOI 10.1007/978-94-015-9829-328
[8]   Computational intelligence optimization approach based on particle swarm optimizer and neutrosophic set for abdominal CT liver tumor segmentation [J].
Anter, Ahmed M. ;
Hassenian, Aboul Ella .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 25 :376-387
[9]   Optimization of state-of-the-art fuzzy-metaheuristic ANFIS-based machine learning models for flood susceptibility prediction mapping in the Middle Ganga Plain, India [J].
Arora, Aman ;
Arabameri, Alireza ;
Pandey, Manish ;
Siddiqui, Masood A. ;
Shukla, U. K. ;
Dieu Tien Bui ;
Mishra, Varun Narayan ;
Bhardwaj, Anshuman .
SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 750
[10]   Copula-based IDF curves and empirical rainfall thresholds for flash floods and rainfall-induced landslides [J].
Bezak, Nejc ;
Sraj, Mojca ;
Mikos, Matjaz .
JOURNAL OF HYDROLOGY, 2016, 541 :272-284