MS-CPFI: A model-agnostic Counterfactual Perturbation Feature Importance algorithm for interpreting black-box Multi-State models

被引:4
作者
Cottin, Aziliz [1 ,2 ,3 ]
Zulian, Marine [1 ]
Pecuchet, Nicolas [1 ]
Guilloux, Agathe [3 ]
Katsahian, Sandrine [2 ,3 ,4 ,5 ]
机构
[1] Healthcare & Life Sci Res, Dassault Syst, Velizy Villacoublay, France
[2] Univ Paris Cite, Paris, France
[3] INRIA, HeKa team, Paris, France
[4] Georges Pompidou Assistance Publ Hop Paris, Med Informat Biostat & Publ Hlth Dept, Paris, France
[5] Epidemiol Clin, Inserm, Ctr Invest Clin 1418 CIC1418, Paris, France
关键词
Multi-state model; Deep learning; Interpretability; Feature importance; Medical decision; COMPETING RISKS;
D O I
10.1016/j.artmed.2023.102741
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-state processes (Webster, 2019) are commonly used to model the complex clinical evolution of diseases where patients progress through different states. In recent years, machine learning and deep learning algorithms have been proposed to improve the accuracy of these models' predictions (Wang et al., 2019). However, acceptability by patients and clinicians, as well as for regulatory compliance, require interpretability of these algorithms's predictions. Existing methods, such as the Permutation Feature Importance algorithm, have been adapted for interpreting predictions in black-box models for 2-state processes (corresponding to survival analysis). For generalizing these methods to multi-state models, we introduce a novel model-agnostic interpretability algorithm called Multi-State Counterfactual Perturbation Feature Importance (MS-CPFI) that computes feature importance scores for each transition of a general multi-state model, including survival, competing-risks, and illness-death models. MS-CPFI uses a new counterfactual perturbation method that allows interpreting feature effects while capturing the non-linear effects and potentially capturing time-dependent effects. Experimental results on simulations show that MS-CPFI increases model interpretability in the case of non-linear effects. Additionally, results on a real-world dataset for patients with breast cancer confirm that MSCPFI can detect clinically important features and provide information on the disease progression by displaying features that are protective factors versus features that are risk factors for each stage of the disease. Overall, MS-CPFI is a promising model-agnostic interpretability algorithm for multi-state models, which can improve the interpretability of machine learning and deep learning algorithms in healthcare.
引用
收藏
页数:17
相关论文
共 66 条
[1]  
Agarwal Rishabh, 2021, Advances in Neural Information Processing Systems, V34
[2]   Estrogen and Progesterone Receptor Testing in Breast Cancer: ASCO/CAP Guideline Update [J].
Allison, Kimberly H. ;
Hammond, M. Elizabeth H. ;
Dowsett, Mitchell ;
McKernin, Shannon E. ;
Carey, Lisa A. ;
Fitzgibbons, Patrick L. ;
Hayes, Daniel F. ;
Lakhani, Sunil R. ;
Chavez-MacGregor, Mariana ;
Perlmutter, Jane ;
Perou, Charles M. ;
Regan, Meredith M. ;
Rimm, David L. ;
Symmans, W. Fraser ;
Torlakovic, Emina E. ;
Varella, Leticia ;
Viale, Giuseppe ;
Weisberg, Tracey F. ;
McShane, Lisa M. ;
Wolff, Antonio C. .
JOURNAL OF CLINICAL ONCOLOGY, 2020, 38 (12) :1346-+
[3]  
Ancona M, 2018, Arxiv, DOI arXiv:1711.06104
[4]  
Ancona Marco., 2019, Explainable AI: Interpreting, explaining and visualizing deep learning, P169, DOI [10.1007/978-3-030-28954-6_9, DOI 10.1007/978-3-030-28954-6_9]
[5]  
Andersen P.K., 1997, STAT MODELS BASED CO
[6]  
[Anonymous], 2017, Spline terms in a Cox model
[7]   Visualizing the effects of predictor variables in black box supervised learning models [J].
Apley, Daniel W. ;
Zhu, Jingyu .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020, 82 (04) :1059-1086
[8]   Generating survival times to simulate Cox proportional hazards models [J].
Bender, R ;
Augustin, T ;
Blettner, M .
STATISTICS IN MEDICINE, 2005, 24 (11) :1713-1723
[9]  
Breiman L, 2001, MACH LEARN, V45, P5, DOI [10.1186/s12859-018-2419-4, 10.3322/caac.21834]
[10]   Focus on an infrequently used quantity in the context of competing risks: The conditional probability function [J].
Cabarrou, Bastien ;
Dalenc, Florence ;
Leconte, Eve ;
Boher, Jean-Marie ;
Filleron, Thomas .
COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 101 :70-81