Use of the gamma method for self-contained gene-set analysis of SNP data

被引:24
作者
Biernacka, Joanna M. [1 ,2 ]
Jenkins, Gregory D. [1 ]
Wang, Liewei [3 ]
Moyer, Ann M. [3 ]
Fridley, Brooke L. [1 ]
机构
[1] Mayo Clin, Dept Hlth Sci Res, Rochester, MN 55905 USA
[2] Mayo Clin & Mayo Fdn, Dept Psychiat & Psychol, Rochester, MN 55905 USA
[3] Mayo Clin, Dept Mol Pharmacol & Expt Therapeut, Rochester, MN 55905 USA
基金
美国国家卫生研究院;
关键词
Fisher's method; gamma method; principal components; gene-level association; pathway; random effects model; GENOME-WIDE ASSOCIATION; TESTING ASSOCIATION; MICROARRAY DATA; PHARMACOGENOMICS;
D O I
10.1038/ejhg.2011.236
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Gene-set analysis (GSA) evaluates the overall evidence of association between a phenotype and all genotyped single nucleotide polymorphisms (SNPs) in a set of genes, as opposed to testing for association between a phenotype and each SNP individually. We propose using the Gamma Method (GM) to combine gene-level P-values for assessing the significance of GS association. We performed simulations to compare the GM with several other self-contained GSA strategies, including both one-step and two-step GSA approaches, in a variety of scenarios. We denote a 'one-step' GSA approach to be one in which all SNPs in a GS are used to derive a test of GS association without consideration of gene-level effects, and a 'two-step' approach to be one in which all genotyped SNPs in a gene are first used to evaluate association of the phenotype with all measured variation in the gene and then the gene-level tests of association are aggregated to assess the GS association with the phenotype. The simulations suggest that, overall, two-step methods provide higher power than one-step approaches and that combining gene-level P-values using the GM with a soft truncation threshold between 0.05 and 0.20 is a powerful approach for conducting GSA, relative to the competing approaches assessed. We also applied all of the considered GSA methods to data from a pharmacogenomic study of cisplatin, and obtained evidence suggesting that the glutathione metabolism GS is associated with cisplatin drug response. European Journal of Human Genetics (2012) 20, 565-571; doi:10.1038/ejhg.2011.236; published online 14 December 2011
引用
收藏
页码:565 / 571
页数:7
相关论文
共 26 条
[1]   Microarray data analysis: from disarray to consolidation and consensus [J].
Allison, DB ;
Cui, XQ ;
Page, GP ;
Sabripour, M .
NATURE REVIEWS GENETICS, 2006, 7 (01) :55-65
[2]  
[Anonymous], 1932, STAT METHODS RES WOR
[3]   Insights into Colon Cancer Etiology via a Regularized Approach to Gene Set Analysis of GWAS Data [J].
Chen, Lin S. ;
Hutter, Carolyn M. ;
Potter, John D. ;
Liu, Yan ;
Prentice, Ross L. ;
Peters, Ulrike ;
Hsu, Li .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (06) :860-871
[4]   Gene, Region and Pathway Level Analyses in Whole-Genome Studies [J].
De la Cruz, Omar ;
Wen, Xiaoquan ;
Ke, Baoguan ;
Song, Minsun ;
Nicolae, Dan L. .
GENETIC EPIDEMIOLOGY, 2010, 34 (03) :222-231
[5]   Improving gene set analysis of microarray data by SAM-GS [J].
Dinu, Irina ;
Potter, John D. ;
Mueller, Thomas ;
Liu, Qi ;
Adewale, Adeniyi J. ;
Jhangri, Gian S. ;
Einecke, Gunilla ;
Famulski, Konrad S. ;
Halloran, Philip ;
Yasui, Yutaka .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]   ON TESTING THE SIGNIFICANCE OF SETS OF GENES [J].
Efron, Bradley ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2007, 1 (01) :107-129
[7]   ON FISHER METHOD OF COMBINING P-VALUES [J].
ELSTON, RC .
BIOMETRICAL JOURNAL, 1991, 33 (03) :339-345
[8]   Gene set analysis of SNP data: benefits, challenges, and future directions [J].
Fridley, Brooke L. ;
Biernacka, Joanna M. .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2011, 19 (08) :837-843
[9]   Self-Contained Gene-Set Analysis of Expression Data: An Evaluation of Existing and Novel Methods [J].
Fridley, Brooke L. ;
Jenkins, Gregory D. ;
Biernacka, Joanna M. .
PLOS ONE, 2010, 5 (09) :1-9
[10]  
Gallant AR., 1987, NONLINEAR STAT MODEL