Fitting survival data with penalized Poisson regression

被引:2
作者
Perperoglou, Aris [1 ]
机构
[1] Univ E Anglia, Fac Hlth, Norwich NR4 7TJ, Norfolk, England
关键词
Proportional hazards; Penalized likelihood; Ridge regression; Efficient computation; RIDGE-REGRESSION; LIKELIHOOD; SELECTION; MODELS;
D O I
10.1007/s10260-011-0172-1
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Cox's proportional hazards model is the most common way to analyze survival data. The model can be extended in the presence of collinearity to include a ridge penalty, or in cases where a very large number of coefficients (e. g. with microarray data) has to be estimated. To maximize the penalized likelihood, optimal weights of the ridge penalty have to be obtained. However, there is no definite rule for choosing the penalty weight. One approach suggests maximization of the weights by maximizing the leave-one-out cross validated partial likelihood, however this is time consuming and computationally expensive, especially in large datasets. We suggest modelling survival data through a Poisson model. Using this approach, the log-likelihood of a Poisson model is maximized by standard iterative weighted least squares. We will illustrate this simple approach, which includes smoothing of the hazard function and move on to include a ridge term in the likelihood. We will then maximize the likelihood by considering tools from generalized mixed linear models. We will show that the optimal value of the penalty is found simply by computing the hat matrix of the system of linear equations and dividing its trace by a product of the estimated coefficients.
引用
收藏
页码:451 / 462
页数:12
相关论文
共 32 条
[1]  
[Anonymous], 1990, APPL LINEAR STAT MOD
[2]  
Chatterjee S, 1991, REGRESSION ANAL EXAM, P193
[3]   Generalized linear array models with applications to multidimensional smoothing [J].
Currie, ID ;
Durban, M ;
Eilers, PHC .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2006, 68 :259-280
[4]   Fast and compact smoothing on large multidimensional grids [J].
Eilers, PHC ;
Currie, ID ;
Durbán, M .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2006, 50 (01) :61-76
[5]   Flexible smoothing with B-splines and penalties [J].
Eilers, PHC ;
Marx, BD .
STATISTICAL SCIENCE, 1996, 11 (02) :89-102
[6]  
Goeman J, 2010, L1 L2 PENALIZED REGR
[7]  
Goeman JJ, 2008, STAT APPL GENET MOL, V7
[8]   A global test for groups of genes: testing association with a clinical outcome [J].
Goeman, JJ ;
van de Geer, SA ;
de Kort, F ;
van Houwelingen, HC .
BIOINFORMATICS, 2004, 20 (01) :93-99
[9]   AN APPLICATION OF RIDGE-REGRESSION ANALYSIS IN THE STUDY OF SYPHILIS DATA [J].
HADGU, A .
STATISTICS IN MEDICINE, 1984, 3 (03) :293-299
[10]  
Harrell FE, 2001, REGRESSION MODELING, P207