SurvSHAP(t): Time-dependent explanations of machine learning survival models

被引:63
作者
Krzyzinski, Mateusz [1 ]
Spytek, Mikolaj [1 ]
Baniecki, Hubert [1 ,2 ]
Biecek, Przemyslaw [1 ,2 ]
机构
[1] Warsaw Univ Technol, Fac Math & Informat Sci, Warsaw, Poland
[2] Univ Warsaw, Fac Math Informat & Mech, Warsaw, Poland
关键词
Survival analysis; Cox Proportional Hazards model; Random Survival Forest; Interpretability; Explainable AI; EXPRESSION; FAILURE;
D O I
10.1016/j.knosys.2022.110234
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine and deep learning survival models demonstrate similar or even improved time-to-event prediction capabilities compared to classical statistical learning methods yet are too complex to be interpreted by humans. Several model-agnostic explanations are available to overcome this issue; how-ever, none directly explain the survival function prediction. In this paper, we introduce SurvSHAP(t), the first time-dependent explanation that allows for interpreting survival black-box models. It is based on SHapley Additive exPlanations with solid theoretical foundations and a broad adoption among machine learning practitioners. The proposed methods aim to enhance precision diagnostics and support domain experts in making decisions. Experiments on synthetic and medical data confirm that SurvSHAP(t) can detect variables with a time-dependent effect, and its aggregation is a better deter-minant of the importance of variables for a prediction than SurvLIME. SurvSHAP(t) is model-agnostic and can be applied to all models with functional output. We provide an accessible implementation of time-dependent explanations in Python at https://github.com/MI2DataLab/survshap. (c) 2023 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页数:13
相关论文
共 71 条
[1]   NONPARAMETRIC INFERENCE FOR A FAMILY OF COUNTING PROCESSES [J].
AALEN, O .
ANNALS OF STATISTICS, 1978, 6 (04) :701-726
[2]   Explaining individual predictions when features are dependent: More accurate approximations to Shapley values [J].
Aas, Kjersti ;
Jullum, Martin ;
Loland, Anders .
ARTIFICIAL INTELLIGENCE, 2021, 298
[3]  
Agarwal Rishabh, 2021, ADV NEURAL INFORM PR, V34
[4]   Current Challenges and Future Opportunities for XAI in Machine Learning-Based Clinical Decision Support Systems: A Systematic Review [J].
Antoniadi, Anna Markella ;
Du, Yuhan ;
Guendouz, Yasmine ;
Wei, Lan ;
Mazo, Claudia ;
Becker, Brett A. ;
Mooney, Catherine .
APPLIED SCIENCES-BASEL, 2021, 11 (11)
[5]  
Baniecki H, 2021, J MACH LEARN RES, V22
[6]   Simple Models in Complex Worlds: Occam's Razor and Statistical Learning Theory [J].
Bargagli Stoffi, Falco J. ;
Cevolani, Gustavo ;
Gnecco, Giorgio .
MINDS AND MACHINES, 2022, 32 (01) :13-42
[7]   Generating survival times to simulate Cox proportional hazards models [J].
Bender, R ;
Augustin, T ;
Blettner, M .
STATISTICS IN MEDICINE, 2005, 24 (11) :1713-1723
[8]   TimeSHAP: Explaining Recurrent Models through Sequence Perturbations [J].
Bento, Joao ;
Saleiro, Pedro ;
Cruz, Andre F. ;
Figueiredo, Mario A. T. ;
Bizarro, Pedro .
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, :2565-2573
[9]  
Biecek PL, 2018, J MACH LEARN RES, V19
[10]   Variational Bayes survival analysis for unemployment modelling [J].
Boskoski, Pavle ;
Perne, Matija ;
Ramesa, Martina ;
Boshkoska, Biljana Mileva .
KNOWLEDGE-BASED SYSTEMS, 2021, 229