An online framework for survival analysis: reframing Cox proportional hazards model for large data sets and neural networks

被引:3
作者
Tarkhan, Aliasghar [1 ]
Simon, Noah [1 ]
机构
[1] Univ Washington, Hans Rosling Ctr Populat Hlth, Dept Biostat, Box 351617, Seattle, WA 98195 USA
基金
美国国家卫生研究院;
关键词
Cox proportional hazards model; Inference; Large survival data sets; Neural networks; Stochastic gradient descent; Streaming data; REGRESSION; ESTIMATORS;
D O I
10.1093/biostatistics/kxac039
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In many biomedical applications, outcome is measured as a "time-to-event" (e.g., disease progression or death). To assess the connection between features of a patient and this outcome, it is common to assume a proportional hazards model and fit a proportional hazards regression (or Cox regression). To fit this model, a log-concave objective function known as the "partial likelihood" is maximized. For moderate-sized data sets, an efficient Newton-Raphson algorithm that leverages the structure of the objective function can be employed. However, in large data sets this approach has two issues: (i) The computational tricks that leverage structure can also lead to computational instability; (ii) The objective function does not naturally decouple: Thus, if the data set does not fit in memory, the model can be computationally expensive to fit. This additionally means that the objective is not directly amenable to stochastic gradient-based optimization methods. To overcome these issues, we propose a simple, new framing of proportional hazards regression: This results in an objective function that is amenable to stochastic gradient descent. We show that this simple modification allows us to efficiently fit survival models with very large data sets. This also facilitates training complex, for example, neural-network-based, models with survival data.
引用
收藏
页码:134 / 153
页数:20
相关论文
共 48 条
[1]  
Aggarwal Charu C., 2018, Neural Networks and Deep Learning
[2]   Interpreting the concordance statistic of a logistic regression model: relation to the variance and odds ratio of a continuous explanatory variable [J].
Austin, Peter C. ;
Steyerberg, Ewout W. .
BMC MEDICAL RESEARCH METHODOLOGY, 2012, 12
[3]  
Bach Francis., 2011, Advances in Neural Information Processing Systems, P451
[4]   Generating survival times to simulate Cox proportional hazards models [J].
Bender, R ;
Augustin, T ;
Blettner, M .
STATISTICS IN MEDICINE, 2005, 24 (11) :1713-1723
[5]   SOME PROPERTIES OF INCOMPLETE U-STATISTICS [J].
BLOM, G .
BIOMETRIKA, 1976, 63 (03) :573-580
[6]   Large-Scale Machine Learning with Stochastic Gradient Descent [J].
Bottou, Leon .
COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, :177-186
[7]  
Boyd Stephen., 2010, Foundations and Trends R in Machine Learning, V3, P1, DOI DOI 10.1561/2200000016
[8]  
Breslow N E, 1980, IARC Sci Publ, P5
[9]  
Chapfuwa Paidamoyo, 2018, Proc Mach Learn Res, V80, P735
[10]   Cox-nnet: An artificial neural network method for prognosis prediction of high-throughput omics data [J].
Ching, Travers ;
Zhu, Xun ;
Garmire, Lana X. .
PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (04)