Bayesian Semiparametric Meta-Analysis for Genetic Association Studies

被引:3
作者
De Iorio, Maria [1 ]
Newcombe, Paul J. [2 ]
Tachmazidou, Ioanna [3 ]
Verzilli, Claudio J. [2 ,4 ]
Whittaker, John C. [2 ,4 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Epidemiol & Biostat, London W2 1PG, England
[2] GlaxoSmithKline, London, England
[3] MRC, Biostat Unit, Cambridge CB2 2BW, England
[4] London Sch Hyg & Trop Med, London, England
基金
英国医学研究理事会;
关键词
delta method; Dirichlet process; linkage disequilibrium; multivariate multinomial logit distribution; SNP data; GENOME-WIDE ASSOCIATION; 5-LIPOXYGENASE ACTIVATING PROTEIN; ISCHEMIC-STROKE; PHOSPHODIESTERASE; 4D; PDE4D GENE; CONFERS RISK; POPULATION; POLYMORPHISMS; INFARCTION; REGION;
D O I
10.1002/gepi.20581
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We present a Bayesian semiparametric model for the meta-analysis of candidate gene studies with a binary outcome. Such studies often report results from association tests for different, possibly study-specific and non-overlapping genetic markers in the same genetic region. Meta-analyses of the results at each marker in isolation are seldom appropriate as they ignore the correlation that may exist between markers due to linkage disequilibrium (LD) and cannot assess the relative importance of variants at each marker. Also such marker-wise meta-analyses are restricted to only those studies that have typed the marker in question, with a potential loss of power. A better strategy is one which incorporates information about the LD between markers so that any combined estimate of the effect of each variant is corrected for the effect of other variants, as in multiple regression. Here we develop a Bayesian semiparametric model which models the observed genotype group frequencies conditional to the case/control status and uses pairwise LD measurements between markers as prior information to make posterior inference on adjusted effects. The approach allows borrowing of strength across studies and across markers. The analysis is based on a mixture of Dirichlet processes model as the underlying semiparametric model. Full posterior inference is performed through Markov chain Monte Carlo algorithms. The approach is demonstrated on simulated and real data. Genet. Epidemiol. 35: 333-340, 2011. (C) 2011 Wiley-Liss, Inc.
引用
收藏
页码:333 / 340
页数:8
相关论文
共 47 条
[1]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]   Genetic liability in stroke - A long-term follow-up study of Danish twins [J].
Bak, S ;
Gaist, D ;
Sindrup, SH ;
Skytthe, A ;
Christensen, K .
STROKE, 2002, 33 (03) :769-774
[3]   Variation in the PDE4D gene and ischemic stroke risk - A systematic review and meta-analysis on 5200 cases and 6600 controls [J].
Bevan, Steve ;
Dichgans, Martin ;
Gschwendtner, Andreas ;
Kuhlenbaeumer, Gregor ;
Ringelstein, E. B. ;
Markus, Hugh S. .
STROKE, 2008, 39 (07) :1966-1971
[4]  
Bishop M.M., 1975, DISCRETE MULTIVARIAT
[5]   A STUDY OF TWINS AND STROKE [J].
BRASS, LM ;
ISAACSOHN, JL ;
MERIKANGAS, KR ;
ROBINETTE, CD .
STROKE, 1992, 23 (02) :221-223
[6]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[7]   Finishing the euchromatic sequence of the human genome [J].
Collins, FS ;
Lander, ES ;
Rogers, J ;
Waterston, RH .
NATURE, 2004, 431 (7011) :931-945
[8]   Random effects modeling of multiple binomial responses using the multivariate binomial logit-normal distribution [J].
Coull, BA ;
Agresti, A .
BIOMETRICS, 2000, 56 (01) :73-80
[9]   ESTIMATING NORMAL MEANS WITH A DIRICHLET PROCESS PRIOR [J].
ESCOBAR, MD .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (425) :268-277
[10]   Meta-Analysis in Genome-Wide Association Datasets: Strategies and Application in Parkinson Disease [J].
Evangelou, Evangelos ;
Maraganore, Demetrius M. ;
Ioannidis, John P. A. .
PLOS ONE, 2007, 2 (02)