Variable selection for high-dimensional genomic data with censored outcomes using group lasso prior

被引:8
|
作者
Lee, Kyu Ha [1 ,2 ]
Chakraborty, Sounak [3 ]
Sun, Jianguo [3 ]
机构
[1] Forsyth Inst, Epidemiol & Biostat Core, Cambridge, MA USA
[2] Harvard Sch Dent Med, Dept Oral Hlth Policy & Epidemiol, Boston, MA USA
[3] Univ Missouri, Dept Stat, Columbia, MO 65211 USA
基金
美国国家科学基金会;
关键词
Accelerated failure time model; Bayesian lasso; Gibbs sampler; Group lasso; Penalized regression; FAILURE TIME MODEL; MICROARRAY DATA; SURVIVAL ANALYSIS; HAZARD RATIOS; ELASTIC NET; COX MODEL; REGRESSION; PREDICTION; SHRINKAGE;
D O I
10.1016/j.csda.2017.02.014
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The variable selection problem is discussed in the context of high-dimensional failure time data arising from the accelerated failure time model. A data augmentation approach is employed in order to deal with censored survival times and to facilitate prior-posterior conjugacy. To identify a set of grouped relevant covariates, a shrinkage prior distribution is specified for regression coefficients mimicking the effect of group lasso penalty. It is noted that unlike the corresponding frequentist method, a Bayesian penalized regression approach cannot shrink the estimates of coefficients to exact zeros in general. Towards resolving the issue, a two-stage thresholding method that exploits the scaled neighbor-hood criterion and the Bayesian information criterion is devised. Simulation studies are performed to assess the robustness and performance of the proposed method in terms of variable selection accuracy and predictive power. The method is successfully applied to a set of microarray data on the individuals diagnosed with diffuse large B-cell lymphoma. In addition, an R package called psbcGroup, which can be downloaded freely from CRAN, is developed for the implementation of the methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [21] Robust Adaptive Lasso method for parameter's estimation and variable selection in high-dimensional sparse models
    Wahid, Abdul
    Khan, Dost Muhammad
    Hussain, Ijaz
    PLOS ONE, 2017, 12 (08):
  • [22] High-dimensional variable selection for ordinal outcomes with error control
    Fu, Han
    Archer, Kellie J.
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) : 334 - 345
  • [23] Variable selection and estimation in high-dimensional models
    Horowitz, Joel L.
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2015, 48 (02): : 389 - 407
  • [24] The EAS approach to variable selection for multivariate response data in high-dimensional settings
    Koner, Salil
    Williams, Jonathan P.
    ELECTRONIC JOURNAL OF STATISTICS, 2023, 17 (02): : 1947 - 1995
  • [25] RANKING-BASED VARIABLE SELECTION FOR HIGH-DIMENSIONAL DATA
    Baranowski, Rafal
    Chen, Yining
    Fryzlewicz, Piotr
    STATISTICA SINICA, 2020, 30 (03) : 1485 - 1516
  • [26] High-dimensional genomic feature selection with the ordered stereotype logit model
    Seffernick, Anna Eames
    Mrozek, Krzysztof
    Nicolet, Deedra
    Stone, Richard M.
    Eisfeld, Ann-Kathrin
    Byrd, John C.
    Archer, Kellie J.
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (06)
  • [27] Controlled variable selection in Weibull mixture cure models for high-dimensional data
    Fu, Han
    Nicolet, Deedra
    Mrozek, Krzysztof
    Stone, Richard M.
    Eisfeld, Ann-Kathrin
    Byrd, John C.
    Archer, Kellie J.
    STATISTICS IN MEDICINE, 2022, 41 (22) : 4340 - 4366
  • [28] Bayesian variable selection in multinomial probit model for classifying high-dimensional data
    Yang, Aijun
    Li, Yunxian
    Tang, Niansheng
    Lin, Jinguan
    COMPUTATIONAL STATISTICS, 2015, 30 (02) : 399 - 418
  • [29] Group Lasso Estimation of High-dimensional Covariance Matrices
    Bigot, Jeremie
    Biscay, Rolando J.
    Loubes, Jean-Michel
    Muniz-Alvarez, Lilian
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 3187 - 3225
  • [30] Concave group methods for variable selection and estimation in high-dimensional varying coefficient models
    Yang GuangRen
    Huang Jian
    Zhou Yong
    SCIENCE CHINA-MATHEMATICS, 2014, 57 (10) : 2073 - 2090