Variable selection in the accelerated failure time model via the bridge method

被引:91
作者
Huang, Jian [1 ,2 ]
Ma, Shuangge [3 ]
机构
[1] Univ Iowa, Dept Stat & Actuarial Sci, Iowa City, IA 52242 USA
[2] Univ Iowa, Dept Biostat, Iowa City, IA 52242 USA
[3] Yale Univ, Dept Epidemiol & Publ Hlth, New Haven, CT 06520 USA
基金
美国国家科学基金会;
关键词
Bridge penalization; Censored data; High dimensional data; Selection consistency; Stability; Sparse model; REGRESSION; SURVIVAL; LASSO; SAMPLE;
D O I
10.1007/s10985-009-9144-2
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In high throughput genomic studies, an important goal is to identify a small number of genomic markers that are associated with development and progression of diseases. A representative example is microarray prognostic studies, where the goal is to identify genes whose expressions are associated with disease free or overall survival. Because of the high dimensionality of gene expression data, standard survival analysis techniques cannot be directly applied. In addition, among the thousands of genes surveyed, only a subset are disease-associated. Gene selection is needed along with estimation. In this article, we model the relationship between gene expressions and survival using the accelerated failure time (AFT) models. We use the bridge penalization for regularized estimation and gene selection. An efficient iterative computational algorithm is proposed. Tuning parameters are selected using V-fold cross validation. We use a resampling method to evaluate the prediction performance of bridge estimator and the relative stability of identified genes. We show that the proposed bridge estimator is selection consistent under appropriate conditions. Analysis of two lymphoma prognostic studies suggests that the bridge estimator can identify a small number of genes and can have better prediction performance than the Lasso.
引用
收藏
页码:176 / 195
页数:20
相关论文
共 25 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] [Anonymous], 2011, The statistical analysis of failure time data
  • [3] BUCKLEY J, 1979, BIOMETRIKA, V66, P429
  • [4] Prediction of survival in follicular lymphoma based on molecular features of tumor-infiltrating immune cells
    Dave, SS
    Wright, G
    Tan, B
    Rosenwald, A
    Gascoyne, RD
    Chan, WC
    Fisher, RI
    Braziel, RM
    Rimsza, LM
    Grogan, TM
    Miller, TP
    LeBlanc, M
    Greiner, TC
    Weisenburger, DD
    Lynch, JC
    Vose, J
    Armitage, JO
    Smeland, EB
    Kvaloy, S
    Holte, H
    Delabie, J
    Connors, JM
    Lansdorp, PM
    Ouyang, Q
    Lister, TA
    Davies, AJ
    Norton, AJ
    Muller-Hermelink, HK
    Ott, G
    Campo, E
    Montserrat, E
    Wilson, WH
    Jaffe, ES
    Simon, R
    Yang, LM
    Powell, J
    Zhao, H
    Goldschmidt, N
    Chiorazzi, M
    Staudt, LM
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2004, 351 (21) : 2159 - 2169
  • [5] Least angle regression - Rejoinder
    Efron, B
    Hastie, T
    Johnstone, I
    Tibshirani, R
    [J]. ANNALS OF STATISTICS, 2004, 32 (02) : 494 - 499
  • [6] A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS
    FRANK, IE
    FRIEDMAN, JH
    [J]. TECHNOMETRICS, 1993, 35 (02) : 109 - 135
  • [7] Penalized regressions: The bridge versus the lasso
    Fu, WJJ
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (03) : 397 - 416
  • [8] Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data
    Gui, J
    Li, HZ
    [J]. BIOINFORMATICS, 2005, 21 (13) : 3001 - 3008
  • [9] Asymptotic properties of bridge estimators in sparse high-dimensional regression models
    Huang, Jian
    Horowitz, Joel L.
    Ma, Shuangge
    [J]. ANNALS OF STATISTICS, 2008, 36 (02) : 587 - 613
  • [10] Regularized estimation in the accelerated failure time model with high-dimensional covariates
    Huang, Jian
    Ma, Shuangge
    Xie, Huiliang
    [J]. BIOMETRICS, 2006, 62 (03) : 813 - 820