Explainable machine learning for the prediction and assessment of complex drought impacts

被引:31
作者
Zhang, Beichen [1 ,2 ]
Abu Salem, Fatima K. [3 ]
Hayes, Michael J.
Smith, Kelly Helm [1 ,2 ]
Tadesse, Tsegaye [1 ,2 ]
Wardlow, Brian D. [1 ,4 ]
机构
[1] Univ Nebraska, Sch Nat Resources, Lincoln, NE 68583 USA
[2] Univ Nebraska, Natl Drought Mitigat Ctr, Lincoln, NE 68583 USA
[3] Amer Univ Beirut, Comp Sci Dept, Beirut, Lebanon
[4] Univ Nebraska, Ctr Adv Land Management Informat Technol, Lincoln, NE 68583 USA
关键词
Drought; Machine learning; Explainable AI; Impact assessment; IMBALANCED DATA; CLIMATE-CHANGE; INDICATORS; MODEL; CHALLENGES; INDEX; RISK;
D O I
10.1016/j.scitotenv.2023.165509
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Drought is a common and costly natural disaster with broad social, economic, and environmental impacts. Machine learning (ML) has been widely applied in scientific research because of its outstanding performance on predictive tasks. However, for practical applications like disaster monitoring and assessment, the cost of the models failure, especially false negative predictions, might significantly affect society. Stakeholders are not satisfied with or do not "trust" the predictions from a so-called black box. The explainability of ML models becomes progressively crucial in studying drought and its impacts. In this work, we propose an explainable ML pipeline using the XGBoost model and SHAP model based on a comprehensive database of drought impacts in the U.S. The XGBoost models significantly outperformed the baseline models in predicting the occurrence of multidimensional drought impacts derived from the text-based Drought Impact Reporter, attaining an average F2 score of 0.883 at the national level and 0.942 at the state level. The interpretation of the models at the state scale indicates that the Standardized Precipitation Index (SPI) and Standardized Temperature Index (STI) contribute significantly to predicting multi-dimensional drought impacts. The time scalar, importance, and relationships of the SPI and STI vary depending on the types of drought impacts and locations. The patterns between the SPI variables and drought impacts indicated by the SHAP values reveal an expected relationship in which negative SPI values positively contribute to complex drought impacts. The explainability based on the SPI variables improves the trustworthiness of the XGBoost models. Overall, this study reveals promising results in accurately predicting complex drought impacts and rendering the relationships between the impacts and indicators more interpretable. This study also reveals the potential of utilizing explainable ML for the general social good to help stakeholders better understand the multi-dimensional drought impacts at the regional level and motivate appropriate responses.
引用
收藏
页数:17
相关论文
共 93 条
[1]   Heat Waves in the United States: Mortality Risk during Heat Waves and Effect Modification by Heat Wave Characteristics in 43 U.S. Communities [J].
Anderson, G. Brooke ;
Bell, Michelle L. .
ENVIRONMENTAL HEALTH PERSPECTIVES, 2011, 119 (02) :210-218
[2]   Flood Hazard and Risk Mapping by Applying an Explainable Machine Learning Framework Using Satellite Imagery and GIS Data [J].
Antzoulatos, Gerasimos ;
Kouloglou, Ioannis-Omiros ;
Bakratsas, Marios ;
Moumtzidou, Anastasia ;
Gialampoukidis, Ilias ;
Karakostas, Anastasios ;
Lombardo, Francesca ;
Fiorin, Roberto ;
Norbiato, Daniele ;
Ferri, Michele ;
Symeonidis, Andreas ;
Vrochidis, Stefanos ;
Kompatsiaris, Ioannis .
SUSTAINABILITY, 2022, 14 (06)
[3]   Forest fires and climate change: causes, consequences and management options [J].
Aponte, Cristina ;
de Groot, William J. ;
Wotton, B. Mike .
INTERNATIONAL JOURNAL OF WILDLAND FIRE, 2016, 25 (08) :I-II
[4]   Deep Learning and Machine Learning in Hydrological Processes Climate Change and Earth Systems a Systematic Review [J].
Ardabili, Sina ;
Mosavi, Amir ;
Dehghani, Majid ;
Varkonyi-Koczy, Annamaria R. .
ENGINEERING FOR SUSTAINABLE FUTURE, 2020, 101 :52-62
[5]   A quantitative analysis to objectively appraise drought indicators and model drought impacts [J].
Bachmair, S. ;
Svensson, C. ;
Hannaford, J. ;
Barker, L. J. ;
Stahl, K. .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2016, 20 (07) :2589-2609
[6]   Exploring the link between drought indicators and impacts [J].
Bachmair, S. ;
Kohn, I. ;
Stahl, K. .
NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2015, 15 (06) :1381-1397
[7]   Developing drought impact functions for drought risk management [J].
Bachmair, Sophie ;
Svensson, Cecilia ;
Prosdocimi, Ilaria ;
Hannaford, Jamie ;
Stahl, Kerstin .
NATURAL HAZARDS AND EARTH SYSTEM SCIENCES, 2017, 17 (11) :1947-1960
[8]   Drought indicators revisited: the need for a wider consideration of environment and society [J].
Bachmair, Sophie ;
Stahl, Kerstin ;
Collins, Kevin ;
Hannaford, Jamie ;
Acreman, Mike ;
Svoboda, Mark ;
Knutson, Cody ;
Smith, Kelly Helm ;
Wall, Nicole ;
Fuchs, Brian ;
Crossman, Neville D. ;
Overton, Ian C. .
WILEY INTERDISCIPLINARY REVIEWS-WATER, 2016, 3 (04) :516-536
[9]  
Begueria Santiago, 2023, CRAN
[10]   Towards pan-European drought risk maps: quantifying the link between drought indices and reported drought impacts [J].
Blauhut, Veit ;
Gudmundsson, Lukas ;
Stahl, Kerstin .
ENVIRONMENTAL RESEARCH LETTERS, 2015, 10 (01)