Regularized Estimation for the Accelerated Failure Time Model

被引:76
作者
Cai, T. [1 ]
Huang, J. [2 ]
Tian, L. [2 ]
机构
[1] Harvard Univ, Dept Biostat, Boston, MA 02115 USA
[2] Northwestern Univ, Dept Prevent Med, Chicago, IL 60611 USA
基金
美国国家卫生研究院;
关键词
AFT model; LASSO regularization; Linear programming; GENE-EXPRESSION SIGNATURE; PARTIAL LEAST-SQUARES; VARIABLE SELECTION; REGRESSION SHRINKAGE; LINEAR-REGRESSION; ADAPTIVE LASSO; SURVIVAL; PATH;
D O I
10.1111/j.1541-0420.2008.01074.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the presence of high-dimensional predictors, it is challenging to develop reliable regression models that can be used to accurately predict future outcomes. Further complications arise when the outcome of interest is an event time, which is often not fully observed due to censoring. In this article, we develop robust prediction models for event time outcomes by regularizing the Gehan's estimator for the accelerated failure time (AFT) model (Tsiatis, 1996, Annals of Statistics 18, 305-328) with least absolute shrinkage and selection operator (LASSO) penalty. Unlike existing methods based on the inverse probability weighting and the Buckley and James estimator (Buckley and James, 1979, Biometrika 66, 429-436), the proposed approach does not require additional assumptions about the censoring and always yields a solution that is convergent. Furthermore, the proposed estimator leads to a stable regression model for prediction even if the AFT model fails to hold. To facilitate the adaptive selection of the tuning parameter, we detail an efficient numerical algorithm for obtaining the entire regularization path. The proposed procedures are applied to a breast cancer dataset to derive a reliable regression model for predicting patient survival based on a set of clinical prognostic factors and gene signatures. Finite sample performances of the procedures are evaluated through a simulation study.
引用
收藏
页码:394 / 404
页数:11
相关论文
共 40 条
[1]  
BUCKLEY J, 1979, BIOMETRIKA, V66, P429
[2]   Robustness, scalability, and integration of a wound-response gene expression signature in predicting breast cancer survival [J].
Chang, HY ;
Nuyten, DSA ;
Sneddon, JB ;
Hastie, T ;
Tibshirani, R ;
Sorlie, T ;
Dai, HY ;
He, YDD ;
van't Veer, LJ ;
Bartelink, H ;
van de Rijn, M ;
Brown, PO ;
van de Vijver, MJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (10) :3738-3743
[3]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[4]   A note on Buckley-James estimators for censored data [J].
Currie, ID .
BIOMETRIKA, 1996, 83 (04) :912-915
[5]   Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO [J].
Datta, Susmita ;
Le-Rademacher, Jennifer ;
Datta, Somnath .
BIOMETRICS, 2007, 63 (01) :259-271
[6]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[7]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[8]  
FLEMING T, 2001, COUNTING PROCESSES S
[9]  
Friedman Jerome, 2004, Gradient directed regularization
[10]   Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data [J].
Gui, J ;
Li, HZ .
BIOINFORMATICS, 2005, 21 (13) :3001-3008