Analysis of Genome-Wide Association Studies with Multiple Outcomes Using Penalization

被引:2
作者
Liu, Jin [1 ]
Huang, Jian [2 ]
Ma, Shuangge [1 ]
机构
[1] Yale Univ, Sch Publ Hlth, Dept Biostat, New Haven, CT 06520 USA
[2] Univ Iowa, Dept Biostat, Dept Stat & Actuarial Sci, Iowa City, IA USA
关键词
MODEL SELECTION; COMPLEX TRAITS; REGRESSION;
D O I
10.1371/journal.pone.0051198
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide association studies have been extensively conducted, searching for markers for biologically meaningful outcomes and phenotypes. Penalization methods have been adopted in the analysis of the joint effects of a large number of SNPs (single nucleotide polymorphisms) and marker identification. This study is partly motivated by the analysis of heterogeneous stock mice dataset, in which multiple correlated phenotypes and a large number of SNPs are available. Existing penalization methods designed to analyze a single response variable cannot accommodate the correlation among multiple response variables. With multiple response variables sharing the same set of markers, joint modeling is first employed to accommodate the correlation. The group Lasso approach is adopted to select markers associated with all the outcome variables. An efficient computational algorithm is developed. Simulation study and analysis of the heterogeneous stock mice dataset show that the proposed method can outperform existing penalization methods. Citation: Liu J, Huang J, Ma S (2012) Analysis of Genome-Wide Association Studies with Multiple Outcomes Using Penalization. PLoS ONE 7(12): e51198. doi: 10.1371/journal.pone.0051198
引用
收藏
页数:12
相关论文
共 18 条
[1]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[2]   COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION [J].
Breheny, Patrick ;
Huang, Jian .
ANNALS OF APPLIED STATISTICS, 2011, 5 (01) :232-253
[3]   Extended Bayesian information criteria for model selection with large model spaces [J].
Chen, Jiahua ;
Chen, Zehua .
BIOMETRIKA, 2008, 95 (03) :759-771
[4]   EXTENDED BIC FOR SMALL-n-LARGE-P SPARSE GLM [J].
Chen, Jiahua ;
Chen, Zehua .
STATISTICA SINICA, 2012, 22 (02) :555-574
[5]   Reduced rank stochastic regression with a sparse singular value decomposition [J].
Chen, Kun ;
Chan, Kung-Sik ;
Stenseth, Nils Chr. .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2012, 74 :203-221
[6]   Regularization Paths for Generalized Linear Models via Coordinate Descent [J].
Friedman, Jerome ;
Hastie, Trevor ;
Tibshirani, Rob .
JOURNAL OF STATISTICAL SOFTWARE, 2010, 33 (01) :1-22
[7]  
Huang J, 2011, STAT SINICA IN PRESS
[8]   GENOMIC SELECTION IN PLANT BREEDING: KNOWLEDGE AND PROSPECTS [J].
Lorenz, Aaron J. ;
Chao, Shiaoman ;
Asoro, Franco G. ;
Heffner, Elliot L. ;
Hayashi, Takeshi ;
Iwata, Hiroyoshi ;
Smith, Kevin P. ;
Sorrells, Mark E. ;
Jannink, Jean-Luc .
ADVANCES IN AGRONOMY, VOL 110, 2011, 110 :77-123
[9]   SparseNet: Coordinate Descent With Nonconvex Penalties [J].
Mazumder, Rahul ;
Friedman, Jerome H. ;
Hastie, Trevor .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (495) :1125-1138
[10]   p-Values for High-Dimensional Regression [J].
Meinshausen, Nicolai ;
Meier, Lukas ;
Buehlmann, Peter .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (488) :1671-1681