REGULARIZATION FOR COX'S PROPORTIONAL HAZARDS MODEL WITH NP-DIMENSIONALITY

被引:108
作者
Bradic, Jelena [1 ]
Fan, Jianqing [3 ]
Jiang, Jiancheng [2 ]
机构
[1] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
[2] Univ N Carolina, Dept Math & Stat, Charlotte, NC 28223 USA
[3] Princeton Univ, Bendheim Ctr Finance, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
关键词
Hazard rate; LASSO; SCAD; large deviation; oracle; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; EXPONENTIAL INEQUALITIES; DANTZIG SELECTOR; LASSO; SPARSITY; REGRESSION; MARTINGALES; RECOVERY;
D O I
10.1214/11-AOS911
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
High throughput genetic sequencing arrays with thousands of measurements per sample and a great amount of related censored clinical data have increased demanding need for better measurement specific model selection. In this paper we establish strong oracle properties of nonconcave penalized methods for nonpolynomial (NP) dimensional data with censoring in the framework of Cox's proportional hazards model. A class of folded-concave penalties are employed and both LASSO and SCAD are discussed specifically. We unveil the question under which dimensionality and correlation restrictions can an oracle estimator be constructed and grasped. It is demonstrated that nonconcave penalties lead to significant reduction of the "irrepresentable condition" needed for LASSO model selection consistency. The large deviation result for martingales, bearing interests of its own, is developed for characterizing the strong oracle property. Moreover, the nonconcave regularized estimator, is shown to achieve asymptotically the information bound of the oracle estimator. A coordinate-wise algorithm is developed for finding the grid of solution paths for penalized hazard regression problems, and its performance is evaluated on simulated and gene association study examples.
引用
收藏
页码:3092 / 3120
页数:29
相关论文
共 38 条
  • [1] COX REGRESSION-MODEL FOR COUNTING-PROCESSES - A LARGE SAMPLE STUDY
    ANDERSEN, PK
    GILL, RD
    [J]. ANNALS OF STATISTICS, 1982, 10 (04) : 1100 - 1120
  • [2] [Anonymous], 1991, Counting Processes and Survival Analysis
  • [3] [Anonymous], 2003, NONLINEAR PROGRAMMIN
  • [4] Bhatia Rajendra, 1997, Matrix Analysis: Graduate Texts in Mathematics, V169
  • [5] SIMULTANEOUS ANALYSIS OF LASSO AND DANTZIG SELECTOR
    Bickel, Peter J.
    Ritov, Ya'acov
    Tsybakov, Alexandre B.
    [J]. ANNALS OF STATISTICS, 2009, 37 (04) : 1705 - 1732
  • [6] BRADIC J., 2011, REGULARIZATION COX S, DOI [10.1214/11-AOS911SUPP, DOI 10.1214/11-AOS911SUPP]
  • [7] Penalized composite quasi-likelihood for ultrahigh dimensional variable selection
    Bradic, Jelena
    Fan, Jianqing
    Wang, Weiwei
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2011, 73 : 325 - 349
  • [8] Sparsity oracle inequalities for the Lasso
    Bunea, Florentina
    Tsybakov, Alexandre
    Wegkamp, Marten
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2007, 1 : 169 - 194
  • [9] Vairiable selection for multivariate failure time data
    Cai, JW
    Fan, JQ
    Li, RZ
    Zhou, HB
    [J]. BIOMETRIKA, 2005, 92 (02) : 303 - 316
  • [10] Candes E, 2007, ANN STAT, V35, P2313, DOI 10.1214/009053606000001523