Association Analysis and Meta-Analysis of Multi-Allelic Variants for Large-Scale Sequence Data

被引:5
作者
Jiang, Yu [1 ]
Chen, Sai [2 ]
Wang, Xingyan [1 ]
Liu, Mengzhen [3 ]
Iacono, William G. [4 ]
Hewitt, John K. [5 ]
Hokanson, John E. [6 ]
Krauter, Kenneth [5 ]
Laakso, Markku [7 ,8 ]
Li, Kevin W. [9 ]
Lutz, Sharon M. [10 ]
McGue, Matthew [3 ]
Pandit, Anita [9 ]
Zajac, Gregory J. M. [9 ]
Boehnke, Michael [9 ]
Abecasis, Goncalo R. [9 ]
Vrieze, Scott, I [3 ]
Jiang, Bibo [1 ]
Zhan, Xiaowei [11 ]
Liu, Dajiang J. [1 ]
机构
[1] Penn State Coll Med, Dept Publ Hlth Sci, Hershey, PA 17033 USA
[2] Illumina Inc, 5200 Illuminay Way, San Diego, CA 92122 USA
[3] Univ Minnesota, Dept Psychol, Minneapolis, MN 55454 USA
[4] Univ Minnesota, Dept Psychiat, Minneapolis, MN 55454 USA
[5] Univ Colorado Boulder, Inst Behav Genet, Aurora, CO 80045 USA
[6] Univ Colorado Denver, Sch Publ Hlth, Dept Epidemiol, Aurora, CO 80045 USA
[7] Univ Eastern Finland, Dept Med, Kuopio 70211, Finland
[8] Kuopio Univ Hosp, Kuopio 70211, Finland
[9] Univ Michigan, Ctr Stat Genet, Dept Biostat, Ann Arbor, MI 48109 USA
[10] Univ Colorado, Dept Biostat & Informat, Anschutz Med Campus, Aurora, CO 80045 USA
[11] Univ Texas Southwestern Med Ctr Dallas, Quantitat Biomed Res Ctr, Dept Clin Sci, Dallas, TX 75390 USA
关键词
multi-allelic variants; GWAS; meta-analysis; smoking; RARE VARIANTS; GENOTYPE IMPUTATION; GENERAL FRAMEWORK; PROTEIN; RISK; TOOL;
D O I
10.3390/genes11050586
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
There is great interest in understanding the impact of rare variants in human diseases using large sequence datasets. In deep sequence datasets of >10,000 samples, similar to 10% of the variant sites are observed to be multi-allelic. Many of the multi-allelic variants have been shown to be functional and disease-relevant. Proper analysis of multi-allelic variants is critical to the success of a sequencing study, but existing methods do not properly handle multi-allelic variants and can produce highly misleading association results. We discuss practical issues and methods to encode multi-allelic sites, conduct single-variant and gene-level association analyses, and perform meta-analysis for multi-allelic variants. We evaluated these methods through extensive simulations and the study of a large meta-analysis of similar to 18,000 samples on the cigarettes-per-day phenotype. We showed that our joint modeling approach provided an unbiased estimate of genetic effects, greatly improved the power of single-variant association tests among methods that can properly estimate allele effects, and enhanced gene-level tests over existing approaches. Software packages implementing these methods are available online.
引用
收藏
页数:16
相关论文
共 36 条
[1]   Rare and low-frequency coding variants in CXCR2 and other genes are associated with hematological traits [J].
Auer, Paul L. ;
Teumer, Alexander ;
Schick, Ursula ;
O'Shaughnessy, Andrew ;
Lo, Ken Sin ;
Chami, Nathalie ;
Carlson, Chris ;
de Denus, Simon ;
Dube, Marie-Pierre ;
Haessler, Jeff ;
Jackson, Rebecca D. ;
Kooperberg, Charles ;
Perreault, Louis-Philippe Lemieux ;
Nauck, Matthias ;
Peters, Ulrike ;
Rioux, John D. ;
Schmidt, Frank ;
Turcot, Valerie ;
Voelker, Uwe ;
Voelzke, Henry ;
Greinacher, Andreas ;
Hsu, Li ;
Tardif, Jean-Claude ;
Diaz, George A. ;
Reiner, Alexander P. ;
Lettre, Guillaume .
NATURE GENETICS, 2014, 46 (06) :629-634
[2]   Second-generation PLINK: rising to the challenge of larger and richer datasets [J].
Chang, Christopher C. ;
Chow, Carson C. ;
Tellier, Laurent C. A. M. ;
Vattikuti, Shashaank ;
Purcell, Shaun M. ;
Lee, James J. .
GIGASCIENCE, 2015, 4
[3]   Sequence variations in PCSK9, low LDL, and protection against coronary heart disease [J].
Cohen, JC ;
Boerwinkle, E ;
Mosley, TH ;
Hobbs, HH .
NEW ENGLAND JOURNAL OF MEDICINE, 2006, 354 (12) :1264-1272
[4]   Loss-of-Function Mutations in APOC3, Triglycerides, and Coronary Disease [J].
Crosby, Jacy ;
Peloso, Gina M. ;
Auer, Paul L. ;
Crosslin, David R. ;
Stitziel, Nathan O. ;
Lange, Leslie A. ;
Lu, Yingchang ;
Tang, Zheng-zheng ;
Zhang, He ;
Hindy, George ;
Masca, Nicholas ;
Stirrups, Kathleen ;
Kanoni, Stavroula ;
Do, Ron ;
Jun, Goo ;
Hu, Youna ;
Kang, Hyun Min ;
Xue, Chenyi ;
Goel, Anuj ;
Farrall, Martin ;
Duga, Stefano ;
Merlini, Pier Angelica ;
Asselta, Rosanna ;
Girelli, Domenico ;
Olivieri, Oliviero ;
Martinelli, Nicola ;
Yin, Wu ;
Reilly, Dermot ;
Speliotes, Elizabeth ;
Fox, Caroline S. ;
Hveem, Kristian ;
Holmen, Oddgeir L. ;
Nikpay, Majid ;
Farlow, Deborah N. ;
Assimes, Themistocles L. ;
Franceschini, Nora ;
Robinson, Jennifer ;
North, Kari E. ;
Martin, Lisa W. ;
DePristo, Mark ;
Gupta, Namrata ;
Escher, Stefan A. ;
Jansson, Jan-Hakan ;
Van Zuydam, Natalie ;
Palmer, Colin N. A. ;
Wareham, Nicholas ;
Koch, Werner ;
Meitinger, Thomas ;
Peters, Annette ;
Lieb, Wolfgang .
NEW ENGLAND JOURNAL OF MEDICINE, 2014, 371 (01) :22-31
[5]   The hazardous effects of tobacco smoking on male fertility [J].
Dai, Jing-Bo ;
Wang, Zhao-Xia ;
Qiao, Zhong-Dong .
ASIAN JOURNAL OF ANDROLOGY, 2015, 17 (06) :954-960
[6]   Next-generation genotype imputation service and methods [J].
Das, Sayantan ;
Forer, Lukas ;
Schoenherr, Sebastian ;
Sidore, Carlo ;
Locke, Adam E. ;
Kwong, Alan ;
Vrieze, Scott I. ;
Chew, Emily Y. ;
Levy, Shawn ;
McGue, Matt ;
Schlessinger, David ;
Stambolian, Dwight ;
Loh, Po-Ru ;
Iacono, William G. ;
Swaroop, Anand ;
Scott, Laura J. ;
Cucca, Francesco ;
Kronenberg, Florian ;
Boehnke, Michael ;
Abecasis, Goncalo R. ;
Fuchsberger, Christian .
NATURE GENETICS, 2016, 48 (10) :1284-1287
[7]   Exome sequencing identifies rare LDLR and APOA5 alleles conferring risk for myocardial infarction [J].
Do, Ron ;
Stitziel, Nathan O. ;
Won, Hong-Hee ;
Jorgensen, Anders Berg ;
Duga, Stefano ;
Merlini, Pier Angelica ;
Kiezun, Adam ;
Farrall, Martin ;
Goel, Anuj ;
Zuk, Or ;
Guella, Illaria ;
Asselta, Rosanna ;
Lange, Leslie A. ;
Peloso, Gina M. ;
Auer, Paul L. ;
Girelli, Domenico ;
Martinelli, Nicola ;
Farlow, Deborah N. ;
DePristo, Mark A. ;
Roberts, Robert ;
Stewart, Alexander F. R. ;
Saleheen, Danish ;
Danesh, John ;
Epstein, Stephen E. ;
Sivapalaratnam, Suthesh ;
Hovingh, G. Kees ;
Kastelein, John J. ;
Samani, Nilesh J. ;
Schunkert, Heribert ;
Erdmann, Jeanette ;
Shah, Svati H. ;
Kraus, William E. ;
Davies, Robert ;
Nikpay, Majid ;
Johansen, Christopher T. ;
Wang, Jian ;
Hegele, Robert A. ;
Hechter, Eliana ;
Marz, Winfried ;
Kleber, Marcus E. ;
Huang, Jie ;
Johnson, Andrew D. ;
Li, Mingyao ;
Burke, Greg L. ;
Gross, Myron ;
Liu, Yongmei ;
Assimes, Themistocles L. ;
Heiss, Gerardo ;
Lange, Ethan M. ;
Folsom, Aaron R. .
NATURE, 2015, 518 (7537) :102-+
[8]   Methods to test for association between a disease and a multi-allelic marker applied to a candidate region [J].
El Galta, R ;
Hsu, L ;
Houwing-Duistermaat, JJ .
BMC GENETICS, 2005, 6 (Suppl 1)
[9]   Meta-analysis methods for genome-wide association studies and beyond [J].
Evangelou, Evangelos ;
Ioannidis, John P. A. .
NATURE REVIEWS GENETICS, 2013, 14 (06) :379-389
[10]   RAREMETAL: fast and powerful meta-analysis for rare variants [J].
Feng, Shuang ;
Liu, Dajiang ;
Zhan, Xiaowei ;
Wing, Mary Kate ;
Abecasis, Goncalo R. .
BIOINFORMATICS, 2014, 30 (19) :2828-2829