The spike-and-slab lasso Cox model for survival prediction and associated genes detection

被引:47
作者
Tang, Zaixiang [1 ,2 ,3 ]
Shen, Yueping [1 ,2 ,3 ]
Zhang, Xinyan [4 ]
Yi, Nengjun [4 ]
机构
[1] Soochow Univ, Sch Publ Hlth, Dept Biostat, Suzhou 215123, Peoples R China
[2] Soochow Univ, Coll Med, Jiangsu Key Lab Prevent & Translat Med Geriatr Di, Suzhou 215123, Peoples R China
[3] Soochow Univ, Coll Med, Ctr Genet Epidemiol & Genom, Suzhou 215123, Peoples R China
[4] Univ Alabama Birmingham, Dept Biostat, Birmingham, AL 35294 USA
基金
中国国家自然科学基金; 美国国家卫生研究院;
关键词
BAYESIAN VARIABLE SELECTION; GENERALIZED LINEAR-MODELS; REGULARIZATION PATHS; REGRESSION; SHRINKAGE;
D O I
10.1093/bioinformatics/btx300
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Large-scale molecular profiling data have offered extraordinary opportunities to improve survival prediction of cancers and other diseases and to detect disease associated genes. However, there are considerable challenges in analyzing large-scale molecular data. Results: We propose new Bayesian hierarchical Cox proportional hazards models, called the spike-and-slab lasso Cox, for predicting survival outcomes and detecting associated genes. We also develop an efficient algorithm to fit the proposed models by incorporating Expectation-Maximization steps into the extremely fast cyclic coordinate descent algorithm. The performance of the proposed method is assessed via extensive simulations and compared with the lasso Cox regression. We demonstrate the proposed procedure on two cancer datasets with censored survival outcomes and thousands of molecular features. Our analyses suggest that the proposed procedure can generate powerful prognostic models for predicting cancer survival and can detect associated genes. Availability and implementation: The methods have been implemented in a freely available R package BhGLM (http://www.ssg.uab.edu/bhglm/). Contact:nyi@uab.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2799 / 2807
页数:9
相关论文
共 48 条
  • [31] EMVS: The EM Approach to Bayesian Variable Selection
    Rockova, Veronika
    George, Edward I.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (506) : 828 - 846
  • [32] Simon N, 2011, J STAT SOFTW, V39, P1
  • [33] Predictive Modeling Using a Somatic Mutational Profile in Ovarian High Grade Serous Carcinoma
    Sohn, Insuk
    Sung, Chang Ohk
    [J]. PLOS ONE, 2013, 8 (01):
  • [34] A BAYESIAN GRAPHICAL MODELING APPROACH TO MICRORNA REGULATORY NETWORK INFERENCE
    Stingo, Francesco C.
    Chen, Yian A.
    Vannucci, Marina
    Barrier, Marianne
    Mirkes, Philip E.
    [J]. ANNALS OF APPLIED STATISTICS, 2010, 4 (04) : 2024 - 2048
  • [35] Tai F., 2009, Bayesian Variable Selection in Regression with Networked Predictors
  • [36] The Spike-and-Slab Lasso Generalized Linear Models for Prediction and Associated Genes Detection
    Tang, Zaixiang
    Shen, Yueping
    Zhang, Xinyan
    Yi, Nengjun
    [J]. GENETICS, 2017, 205 (01) : 77 - +
  • [37] Tibshirani R, 1997, STAT MED, V16, P385, DOI 10.1002/(SICI)1097-0258(19970228)16:4<385::AID-SIM380>3.0.CO
  • [38] 2-3
  • [40] A gene-expression signature as a predictor of survival in breast cancer.
    van de Vijver, MJ
    He, YD
    van 't Veer, LJ
    Dai, H
    Hart, AAM
    Voskuil, DW
    Schreiber, GJ
    Peterse, JL
    Roberts, C
    Marton, MJ
    Parrish, M
    Atsma, D
    Witteveen, A
    Glas, A
    Delahaye, L
    van der Velde, T
    Bartelink, H
    Rodenhuis, S
    Rutgers, ET
    Friend, SH
    Bernards, R
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 2002, 347 (25) : 1999 - 2009