Identification of breast cancer prognosis markers via integrative analysis

被引:6
作者
Ma, Shuangge [1 ]
Dai, Ying [1 ]
Huang, Jian [2 ]
Xie, Yang [3 ]
机构
[1] Yale Univ, Sch Publ Hlth, New Haven, CT 06520 USA
[2] Univ Iowa, Dept Stat & Actuarial Sci & Biostat, Iowa City, IA 52242 USA
[3] UT SW Med Ctr, Dept Clin Sci, Dallas, TX USA
基金
美国国家科学基金会;
关键词
Breast cancer prognosis; Gene expression; Marker identification; Integrative analysis; 2-norm group bridge; DIMENSIONAL LINEAR-REGRESSION; FAILURE TIME MODEL; MICROARRAY DATA; VARIABLE SELECTION; HISTOLOGIC GRADE; EXPRESSION; SURVIVAL; LASSO; SIGNATURE; PROFILES;
D O I
10.1016/j.csda.2012.02.017
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In breast cancer research, it is of great interest to identify genomic markers associated with prognosis. Multiple gene profiling studies have been conducted for such a purpose. Genomic markers identified from the analysis of single datasets often do not have satisfactory reproducibility. Among the multiple possible reasons, the most important one is the small sample sizes of individual studies. A cost-effective solution is to pool data from multiple comparable studies and conduct integrative analysis. In this study, we collect four breast cancer prognosis studies with gene expression measurements. We describe the relationship between prognosis and gene expressions using the accelerated failure time (AFT) models. We adopt a 2-norm group bridge penalization approach for marker identification. This integrative analysis approach can effectively identify markers with consistent effects across multiple datasets and naturally accommodate the heterogeneity among studies. Statistical and simulation studies demonstrate satisfactory performance of this approach. Breast cancer prognosis markers identified using this approach have sound biological implications and satisfactory prediction performance. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:2718 / 2728
页数:11
相关论文
共 34 条
[1]  
BUCKLEY J, 1979, BIOMETRIKA, V66, P429
[2]   Gene expression profiling of breast cancer [J].
Cheang, Maggie C. U. ;
van de Rijn, Matt ;
Nielsen, Torsten O. .
ANNUAL REVIEW OF PATHOLOGY-MECHANISMS OF DISEASE, 2008, 3 :67-97
[3]   Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO [J].
Datta, Susmita ;
Le-Rademacher, Jennifer ;
Datta, Somnath .
BIOMETRICS, 2007, 63 (01) :259-271
[4]  
Friedman J., 2010, NOTE GROUP LASSO SPA, P1, DOI [10.1007/978-1-0716-2237-7_18, DOI 10.1007/978-1-0716-2237-7_18]
[5]  
Guerra Rudy., 2009, Meta‐analysis and combining information in genetics and genomics
[6]   Gene expression predictors of breast cancer outcomes [J].
Huang, E ;
Cheng, SH ;
Dressman, H ;
Pittman, J ;
Tsou, MH ;
Horng, CF ;
Bild, A ;
Iversen, ES ;
Liao, M ;
Chen, CM ;
West, M ;
Nevins, JR ;
Huang, AT .
LANCET, 2003, 361 (9369) :1590-1596
[7]   Asymptotic properties of bridge estimators in sparse high-dimensional regression models [J].
Huang, Jian ;
Horowitz, Joel L. ;
Ma, Shuangge .
ANNALS OF STATISTICS, 2008, 36 (02) :587-613
[8]   Regularized estimation in the accelerated failure time model with high-dimensional covariates [J].
Huang, Jian ;
Ma, Shuangge ;
Xie, Huiliang .
BIOMETRICS, 2006, 62 (03) :813-820
[9]   Variable selection in the accelerated failure time model via the bridge method [J].
Huang, Jian ;
Ma, Shuangge .
LIFETIME DATA ANALYSIS, 2010, 16 (02) :176-195
[10]   A group bridge approach for variable selection [J].
Huang, Jian ;
Ma, Shuange ;
Xie, Huiliang ;
Zhang, Cun-Hui .
BIOMETRIKA, 2009, 96 (02) :339-355