Integration of shapley additive explanations with random forest model for quantitative precipitation estimation of mesoscale convective systems

被引:17
作者
He, Zheyu [1 ]
Yang, Yuanjian [1 ]
Fang, Runzhuo [1 ]
Zhou, Shaohui [1 ]
Zhao, Wenchuan [1 ]
Bai, Yingjie [2 ]
Li, Junsheng [1 ]
Wang, Bo [3 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Collaborat Innovat Ctr Forecast & Evaluat Meteorol, Sch Atmospher Phys, Key Lab Aerosol Cloud Precipitat,China Meteorol Ad, Nanjing, Peoples R China
[2] China Flight Test Estab, Xian, Peoples R China
[3] Boyan Informat Technol Xian Co Ltd, Xian, Peoples R China
基金
中国国家自然科学基金;
关键词
shapley additive explanations (SHAP); interpretability; random forest; quantitative precipitation estimation (QPE); himawari-8; RAINFALL; TEMPERATURE; PREDICTION; FEATURES; SUMMER; CLOUDS; CHINA;
D O I
10.3389/fenvs.2022.1057081
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Mesoscale convective cloud systems have a small horizontal scale and a short lifetime, which brings great challenges to quantitative precipitation estimation (QPE) by satellite remote sensing. Combining machine learning models and geostationary satellite spectral information is an effective method for the QPE of mesoscale convective cloud, while the interpretability of machine learning model outputs remains unclear. In this study, based on Himawari-8 data, high-density automatic weather station observations, and reanalysis data over the North China Plain, a random forest (RF) machine learning model of satellite-based QPE was established and verified. The interpretation of the output of the RF model of satellite-based QPE was further explored by using the Shapley Additive Explanations (SHAP) algorithm. Results showed that the correlation coefficient between the predicted and observed precipitation intensity of the RF model was .64, with a root-mean-square error of .27 mm/h. The importance ranking obtained by SHAP model is completely consistent with the outputs of random forest importance function. This SHAP method can display the importance ranking of global features with positive/negative contribution values (e.g., current precipitation, column water vapor/black body temperature, cloud base height), and can visualize the marginal contribution values of local features under interaction. Therefore, combining the RF and SHAP methods provides a valuable way to interpret the output of machine learning models for satellite-based QPE, as well as an important basis for the selection of input variables for satellite-based QPE.
引用
收藏
页数:15
相关论文
共 71 条
[1]  
ADLER RF, 1984, J CLIM APPL METEOROL, V23, P280, DOI 10.1175/1520-0450(1984)023<0280:TCHRRF>2.0.CO
[2]  
2
[4]   Heavy precipitation in a changing climate: Does short-term summer precipitation increase faster? [J].
Ban, Nikolina ;
Schmidli, Juerg ;
Schaer, Christoph .
GEOPHYSICAL RESEARCH LETTERS, 2015, 42 (04) :1165-1172
[5]   Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI [J].
Barredo Arrieta, Alejandro ;
Diaz-Rodriguez, Natalia ;
Del Ser, Javier ;
Bennetot, Adrien ;
Tabik, Siham ;
Barbado, Alberto ;
Garcia, Salvador ;
Gil-Lopez, Sergio ;
Molina, Daniel ;
Benjamins, Richard ;
Chatila, Raja ;
Herrera, Francisco .
INFORMATION FUSION, 2020, 58 :82-115
[6]   An Interpretable Prediction Model for Identifying N7-Methylguanosine Sites Based on XGBoost and SHAP [J].
Bi, Yue ;
Xiang, Dongxu ;
Ge, Zongyuan ;
Li, Fuyi ;
Jia, Cangzhi ;
Song, Jiangning .
MOLECULAR THERAPY-NUCLEIC ACIDS, 2020, 22 :362-372
[7]   Machine Learning in Weather Prediction and Climate Analyses-Applications and Perspectives [J].
Bochenek, Bogdan ;
Ustrnul, Zbigniew .
ATMOSPHERE, 2022, 13 (02)
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]  
Brenner IS, 2004, WEATHER FORECAST, V19, P286, DOI 10.1175/1520-0434(2004)019<0286:TRBMPA>2.0.CO
[10]  
2