Using Bayesian Neural Networks to Select Features and Compute Credible Intervals for Personalized Survival Prediction

被引:3
作者
Qi, Shi-ang [1 ]
Kumar, Neeraj [2 ,3 ]
Verma, Ruchika [3 ]
Xu, Jian-Yi [4 ]
Shen-Tu, Grace [4 ]
Greiner, Russell [2 ,3 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2R3, Canada
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB, Canada
[3] Alberta Machine Intelligence Inst, Edmonton, AB, Canada
[4] Alberta Hlth Serv, Albertas Tomorrow Project, Canc Care Alberta, Edmonton, AB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Machine Learning; Bayesian neural networks; credible intervals; feature selection; precision medicine; survival analysis; VARIABLE SELECTION; HEART-FAILURE; REGRESSION; REGULARIZATION; ISSUES; MODEL;
D O I
10.1109/TBME.2023.3287514
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
An Individual Survival Distribution (ISD) models a patient's personalized survival probability at all future time points. Previously, ISD models have been shown to produce accurate and personalized survival estimates (for example, time to relapse or to death) in several clinical applications. However, off-the-shelf neural-network-based ISD models are usually opaque models due to their limited support for meaningful feature selection and uncertainty estimation, which hinders their wide clinical adoption. Here, we introduce a Bayesian-neural-network-based ISD (BNN-ISD) model that produces accurate survival estimates but also quantifies the uncertainty in model's parameter estimation, which can be used to (1) rank the importance of the input features to support feature selection and (2) compute credible intervals around ISDs for clinicians to assess the model's confidence in its prediction. Our BNN-ISD model utilized sparsity-inducing priors to learn a sparse set of weights to enable feature selection. We provide empirical evidence, on 2 synthetic and 3 real-world clinical datasets, that BNN-ISD system can effectively select meaningful features and compute trustworthy credible intervals of the survival distribution for each patient. We observed that our approach accurately recovers feature importance in the synthetic datasets and selects meaningful features for the real-world clinical data as well, while also achieving state-of-the-art survival prediction performance. We also show that these credible regions can aid in clinical decision-making by providing a gauge of the uncertainty of the estimated ISD curves.
引用
收藏
页码:3389 / 3400
页数:12
相关论文
共 56 条
[1]   Guidance on the management of diarrhoea during cancer chemotherapy [J].
Andreyev, Jervoise ;
Ross, Paul ;
Donnellan, Clare ;
Lennan, Elaine ;
Leonard, Pauline ;
Waters, Caroline ;
Wedlake, Linda ;
Bridgewater, John ;
Glynne-Jones, Rob ;
Allum, William ;
Chau, Ian ;
Wilson, Richard ;
Ferry, David .
LANCET ONCOLOGY, 2014, 15 (10) :E447-E460
[2]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[3]   ANALYSIS OF SURVIVAL DATA UNDER PROPORTIONAL HAZARDS MODEL [J].
BRESLOW, NE .
INTERNATIONAL STATISTICAL REVIEW, 1975, 43 (01) :45-58
[4]  
Carvalho CM., 2009, ARTIF INTELL, P73
[5]  
Chapfuwa P, 2018, PR MACH LEARN RES, V80
[6]   Calibration and Uncertainty in Neural Time-to-Event Modeling [J].
Chapfuwa, Paidamoyo ;
Tao, Chenyang ;
Li, Chunyuan ;
Khan, Irfan ;
Chandross, Karen J. ;
Pencina, Michael J. ;
Carin, Lawrence ;
Henao, Ricardo .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (04) :1666-1680
[7]  
COX DR, 1975, BIOMETRIKA, V62, P269, DOI 10.1093/biomet/62.2.269
[8]  
COX DR, 1972, J R STAT SOC B, V34, P187
[9]  
Ding Chris, 2005, Journal of Bioinformatics and Computational Biology, V3, P185, DOI 10.1142/S0219720005001004
[10]  
Fotso S, 2018, Arxiv, DOI arXiv:1801.05512