A Fast and Accurate Method for Genome-wide Scale Phenome-wide G x E Analysis and Its Application to UK Biobank

被引:19
作者
Bi, Wenjian [1 ,2 ]
Zhao, Zhangchen [1 ,2 ]
Dey, Rounak [1 ,2 ,3 ]
Fritsche, Lars G. [1 ,2 ]
Mukherjee, Bhramar [1 ]
Lee, Seunggeun [1 ,2 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Ctr Stat Genet, Ann Arbor, MI 48109 USA
[3] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
GENE-ENVIRONMENT INTERACTION; MIXED-MODEL ANALYSIS; ASSOCIATION; DISEASE; SMOKING; RISK; INFERENCE; VARIANT; TRAITS; GENDER;
D O I
10.1016/j.ajhg.2019.10.008
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The etiology of most complex diseases involves genetic variants, environmental factors, and gene-environment interaction (G x F.) effects. Compared with marginal genetic association studies, G x F. analysis requires more samples and detailed measure of environmental exposures, and this limits the possible discoveries. Large-scale population-based biobanks with detailed phenotypic and environmental information, such as UK-Biobank, can be ideal resources for identifying G x F. effects. However, due to the large computation cost and the presence of case-control imbalance, existing methods often fail. Here we propose a scalable and accurate method, SPAGE (SaddlePoint Approximation implementation of G x F. analysis), that is applicable for genome-wide scale phenome-wide G x F. studies. SPAGE fits a genotype-independent logistic model only once across the genome-wide analysis in order to reduce computation cost, and SPAGE uses a saddlepoint approximation (SPA) to calibrate the test statistics for analysis of phenotypes with unbalanced case-control ratios. Simulation studies show that SPAGE is 33-79 times faster than the Wald test and 72-439 times faster than the Firth's test, and SPAGE can control type I error rates at the genome-wide significance level even when case-control ratios are extremely unbalanced. Through the analysis of UK-Biobank data of 344,341 white British European-ancestry samples, we show that SPAGE can efficiently analyze large samples while controlling for unbalanced case-control ratios.
引用
收藏
页码:1182 / 1192
页数:11
相关论文
共 53 条
[41]   UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age [J].
Sudlow, Cathie ;
Gallacher, John ;
Allen, Naomi ;
Beral, Valerie ;
Burton, Paul ;
Danesh, John ;
Downey, Paul ;
Elliott, Paul ;
Green, Jane ;
Landray, Martin ;
Liu, Bette ;
Matthews, Paul ;
Ong, Giok ;
Pell, Jill ;
Silman, Alan ;
Young, Alan ;
Sprosen, Tim ;
Peakman, Tim ;
Collins, Rory .
PLOS MEDICINE, 2015, 12 (03)
[42]   Methods for Investigating Gene-Environment Interactions in Candidate Pathway and Genome-Wide Association Studies [J].
Thomas, Duncan .
ANNUAL REVIEW OF PUBLIC HEALTH, VOL 31, 2010, 31 :21-36
[43]   Gene-environment-wide association studies: emerging approaches [J].
Thomas, Duncan .
NATURE REVIEWS GENETICS, 2010, 11 (04) :259-272
[44]   EFFECT MODIFICATION AND THE LIMITS OF BIOLOGICAL INFERENCE FROM EPIDEMIOLOGIC DATA [J].
THOMPSON, WD .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1991, 44 (03) :221-232
[45]   Genome-wide gene-environment interaction in depression: A systematic evaluation of candidate genes: The childhood trauma working-group of PGC-MDD [J].
Van der Auwera, Sandra ;
Peyrot, Wouter J. ;
Milaneschi, Yuri ;
Hertel, Johannes ;
Baune, Bernhard ;
Breen, Gerome ;
Byrne, Enda ;
Dunn, Erin C. ;
Fisher, Helen ;
Homuth, Georg ;
Levinson, Douglas ;
Lewis, Cathryn ;
Mills, Natalie ;
Mullins, Niamh ;
Nauck, Matthias ;
Pistis, Giorgio ;
Preisig, Martin ;
Rietschel, Marcella ;
Ripke, Stephan ;
Sullivan, Patrick ;
Teumer, Alexander ;
Voelzke, Henry ;
Boomsma, Dorret I. ;
Wray, Naomi R. ;
Penninx, Brenda ;
Grabe, Hans .
AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS, 2018, 177 (01) :40-49
[46]  
Villareal RP, 2001, TEX HEART I J, V28, P265
[47]   Mediating Effects of Smoking and Chronic Obstructive Pulmonary Disease on the Relation Between the CHRNA5-A3 Genetic Locus and Lung Cancer Risk [J].
Wang, Jian ;
Spitz, Margaret R. ;
Amos, Christopher I. ;
Wilkinson, Anna V. ;
Wu, Xifeng ;
Shete, Sanjay .
CANCER, 2010, 116 (14) :3458-3462
[48]   Gene-environment interactions in genome-wide association studies: current approaches and new directions [J].
Winham, Stacey J. ;
Biernacka, Joanna M. .
JOURNAL OF CHILD PSYCHOLOGY AND PSYCHIATRY, 2013, 54 (10) :1120-1134
[49]   Gender differences in arrhythmias [J].
Wolbrette, D ;
Naccarelli, G ;
Curtis, A ;
Lehmann, M ;
Kadish, A .
CLINICAL CARDIOLOGY, 2002, 25 (02) :49-56
[50]   Electronic health records: the next wave of complex disease genetics [J].
Wolford, Brooke N. ;
Willer, Cristen J. ;
Surakka, Ida .
HUMAN MOLECULAR GENETICS, 2018, 27 (R1) :R14-R21