Buckley-James Boosting for Survival Analysis with High-Dimensional Biomarker Data

被引:33
作者
Wang, Zhu [1 ]
Wang, C. Y. [2 ]
机构
[1] Yale Univ, New Haven, CT 06520 USA
[2] Fred Hutchinson Canc Res Ctr, Seattle, WA 98109 USA
基金
美国国家卫生研究院;
关键词
boosting; accelerated failure time model; Buckley-James estimator; censored survival data; LASSO; variable selection; HIGH-ORDER INTERACTIONS; PARTIAL LEAST-SQUARES; REGULARIZED ESTIMATION; MICROARRAY DATA; REGRESSION; FAILURE; MODELS; GENES; REDUCTION; SELECTION;
D O I
10.2202/1544-6115.1550
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
There has been increasing interest in predicting patients' survival after therapy by investigating gene expression microarray data. In the regression and classification models with high-dimensional genomic data, boosting has been successfully applied to build accurate predictive models and conduct variable selection simultaneously. We propose the Buckley-James boosting for the semiparametric accelerated failure time models with right censored survival data, which can be used to predict survival of future patients using the high-dimensional genomic data. In the spirit of adaptive LASSO, twin boosting is also incorporated to fit more sparse models. The proposed methods have a unified approach to fit linear models, non-linear effects models with possible interactions. The methods can perform variable selection and parameter estimation simultaneously. The proposed methods are evaluated by simulations and applied to a recent microarray gene expression data set for patients with diffuse large B-cell lymphoma under the current gold standard therapy.
引用
收藏
页数:33
相关论文
共 71 条
[51]  
R Core Team, 2020, R foundation for statistical computing Computer software
[52]   A CONVERSATION WITH COX,DAVID [J].
REID, N ;
COX, D .
STATISTICAL SCIENCE, 1994, 9 (03) :439-455
[53]   Array-based DNA methylation profiling of primary lymphomas of the central nervous system [J].
Richter, Julia ;
Ammerpohl, Ole ;
Martin-Subero, Jose I. ;
Montesinos-Rongen, Manuel ;
Bibikova, Marina ;
Wickham-Garcia, Eliza ;
Wiestler, Otmar D. ;
Deckert, Martina ;
Siebert, Reiner .
BMC CANCER, 2009, 9
[54]  
Ridgeway Greg., 1999, COMP SCI STAT, V31, P172, DOI DOI 10.1016/J.APENERGY.2017.09.060
[55]   Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer [J].
Ritchie, MD ;
Hahn, LW ;
Roodi, N ;
Bailey, LR ;
Dupont, WD ;
Parl, FF ;
Moore, JH .
AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 69 (01) :138-147
[56]   The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma [J].
Rosenwald, A ;
Wright, G ;
Chan, WC ;
Connors, JM ;
Campo, E ;
Fisher, RI ;
Gascoyne, RD ;
Muller-Hermelink, HK ;
Smeland, EB ;
Staudt, LM .
NEW ENGLAND JOURNAL OF MEDICINE, 2002, 346 (25) :1937-1947
[57]   Flexible boosting of accelerated failure time models [J].
Schmid, Matthias ;
Hothorn, Torsten .
BMC BIOINFORMATICS, 2008, 9 (1)
[58]   Microarray gene expression data with linked survival phenotypes: diffuse large-B-cell lymphoma revisited [J].
Segal, MR .
BIOSTATISTICS, 2006, 7 (02) :268-285
[59]   Regression approaches for microarray data analysis [J].
Segal, MR ;
Dahlquist, KD ;
Conklin, BR .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2003, 10 (06) :961-980
[60]   Genes, environment, and cardiovascular disease [J].
Sing, CF ;
Stengård, JH ;
Kardia, SLR .
ARTERIOSCLEROSIS THROMBOSIS AND VASCULAR BIOLOGY, 2003, 23 (07) :1190-1196