The spike-and-slab lasso Cox model for survival prediction and associated genes detection

被引:47
作者
Tang, Zaixiang [1 ,2 ,3 ]
Shen, Yueping [1 ,2 ,3 ]
Zhang, Xinyan [4 ]
Yi, Nengjun [4 ]
机构
[1] Soochow Univ, Sch Publ Hlth, Dept Biostat, Suzhou 215123, Peoples R China
[2] Soochow Univ, Coll Med, Jiangsu Key Lab Prevent & Translat Med Geriatr Di, Suzhou 215123, Peoples R China
[3] Soochow Univ, Coll Med, Ctr Genet Epidemiol & Genom, Suzhou 215123, Peoples R China
[4] Univ Alabama Birmingham, Dept Biostat, Birmingham, AL 35294 USA
基金
中国国家自然科学基金; 美国国家卫生研究院;
关键词
BAYESIAN VARIABLE SELECTION; GENERALIZED LINEAR-MODELS; REGULARIZATION PATHS; REGRESSION; SHRINKAGE;
D O I
10.1093/bioinformatics/btx300
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Large-scale molecular profiling data have offered extraordinary opportunities to improve survival prediction of cancers and other diseases and to detect disease associated genes. However, there are considerable challenges in analyzing large-scale molecular data. Results: We propose new Bayesian hierarchical Cox proportional hazards models, called the spike-and-slab lasso Cox, for predicting survival outcomes and detecting associated genes. We also develop an efficient algorithm to fit the proposed models by incorporating Expectation-Maximization steps into the extremely fast cyclic coordinate descent algorithm. The performance of the proposed method is assessed via extensive simulations and compared with the lasso Cox regression. We demonstrate the proposed procedure on two cancer datasets with censored survival outcomes and thousands of molecular features. Our analyses suggest that the proposed procedure can generate powerful prognostic models for predicting cancer survival and can detect associated genes. Availability and implementation: The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/). Contact:nyi@uab.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2799 / 2807
页数:9
相关论文
共 48 条
  • [1] [Anonymous], 2002, Statistical Applications in Genetics and Molecular Biology, DOI DOI 10.2202/1544-6115.1000
  • [2] [Anonymous], 2019, Statistical learning with sparsity: the lasso and generalizations
  • [3] Barillot E., 2012, Computational systems biology of cancer
  • [4] Bayesian ensemble methods for survival prediction in gene expression data
    Bonato, Vinicius
    Baladandayuthapani, Veerabhadran
    Broom, Bradley M.
    Sulman, Erik P.
    Aldape, Kenneth D.
    Do, Kim-Anh
    [J]. BIOINFORMATICS, 2011, 27 (03) : 359 - 367
  • [5] Predicting survival from microarray data -: a comparative study
    Bovelstad, H. M.
    Nygard, S.
    Storvold, H. L.
    Aldrin, M.
    Borgan, O.
    Frigessi, A.
    Lingjaerde, O. C.
    [J]. BIOINFORMATICS, 2007, 23 (16) : 2080 - 2087
  • [6] Survival prediction from clinico-genomic models - a comparative study
    Bovelstad, Hege M.
    Nygard, Stale
    Borgan, Ornulf
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [7] COVARIANCE ANALYSIS OF CENSORED SURVIVAL DATA
    BRESLOW, N
    [J]. BIOMETRICS, 1974, 30 (01) : 89 - 99
  • [8] Breslow N. E., 1972, Journal of the Royal Statistical Society. Series B (Statistical Methodology), V34, P216
  • [9] Chen M-H., 2001, Bayesian Survival Analysis
  • [10] Cancer genomics: from discovery science to personalized medicine
    Chin, Lynda
    Andersen, Jannik N.
    Futreal, P. Andrew
    [J]. NATURE MEDICINE, 2011, 17 (03) : 297 - 303