Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes

被引:39
作者
Li, Li [1 ,2 ,3 ]
Li, Yun [4 ]
Browning, Sharon R. [5 ]
Browning, Brian L. [6 ]
Slater, Andrew J. [1 ,2 ,3 ]
Kong, Xiangyang [1 ,2 ,3 ]
Aponte, Jennifer L. [1 ,2 ,3 ]
Mooser, Vincent E. [1 ,2 ,3 ]
Chissoe, Stephanie L. [1 ,2 ,3 ]
Whittaker, John C. [1 ,2 ,3 ]
Nelson, Matthew R. [1 ,2 ,3 ]
Ehm, Margaret Gelder [1 ,2 ,3 ]
机构
[1] GlaxoSmithKline, Genet, Res Triangle Pk, NC 27709 USA
[2] GlaxoSmithKline, Genet, King Of Prussia, PA USA
[3] GlaxoSmithKline, Genet, Stevenage, Herts, England
[4] Univ N Carolina, Dept Biostat, Dept Genet, Chapel Hill, NC USA
[5] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[6] Univ Washington, Dept Med, Div Med Genet, Seattle, WA 98195 USA
来源
PLOS ONE | 2011年 / 6卷 / 09期
基金
美国国家卫生研究院;
关键词
GENOME-WIDE ASSOCIATION; SUSCEPTIBILITY; DISEASE;
D O I
10.1371/journal.pone.0024945
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genotype imputation has the potential to assess human genetic variation at a lower cost than assaying the variants using laboratory techniques. The performance of imputation for rare variants has not been comprehensively studied. We utilized 8865 human samples with high depth resequencing data for the exons and flanking regions of 202 genes and Genome-Wide Association Study (GWAS) data to characterize the performance of genotype imputation for rare variants. We evaluated reference sets ranging from 100 to 3713 subjects for imputing into samples typed for the Affymetrix (500K and 6.0) and Illumina 550K GWAS panels. The proportion of variants that could be well imputed (true r(2)>0.7) with a reference panel of 3713 individuals was: 31% (Illumina 550K) or 25% (Affymetrix 500K) with MAF (Minor Allele Frequency) less than or equal 0.001, 48% or 35% with 0.001<MAF<= 0.005, 54% or 38% with 0.005<MAF<= 0.01, 78% or 57% with 0.01<MAF<= 0.05, and 97% or 86% with MAF>0.05. The performance for common SNPs (MAF>0.05) within exons and flanking regions is comparable to imputation of more uniformly distributed SNPs. The performance for rare SNPs (0.01<MAF<= 0.05) was much more dependent on the GWAS panel and the number of reference samples. These results suggest routine use of genotype imputation for extending the assessment of common variants identified in humans via targeted exon resequencing into additional samples with GWAS data, but imputation of very rare variants (MAF<= 0.005) will require reference panels with thousands of subjects.
引用
收藏
页数:7
相关论文
共 21 条
  • [1] A map of human genome variation from population-scale sequencing
    Altshuler, David
    Durbin, Richard M.
    Abecasis, Goncalo R.
    Bentley, David R.
    Chakravarti, Aravinda
    Clark, Andrew G.
    Collins, Francis S.
    De la Vega, Francisco M.
    Donnelly, Peter
    Egholm, Michael
    Flicek, Paul
    Gabriel, Stacey B.
    Gibbs, Richard A.
    Knoppers, Bartha M.
    Lander, Eric S.
    Lehrach, Hans
    Mardis, Elaine R.
    McVean, Gil A.
    Nickerson, DebbieA.
    Peltonen, Leena
    Schafer, Alan J.
    Sherry, Stephen T.
    Wang, Jun
    Wilson, Richard K.
    Gibbs, Richard A.
    Deiros, David
    Metzker, Mike
    Muzny, Donna
    Reid, Jeff
    Wheeler, David
    Wang, Jun
    Li, Jingxiang
    Jian, Min
    Li, Guoqing
    Li, Ruiqiang
    Liang, Huiqing
    Tian, Geng
    Wang, Bo
    Wang, Jian
    Wang, Wei
    Yang, Huanming
    Zhang, Xiuqing
    Zheng, Huisong
    Lander, Eric S.
    Altshuler, David L.
    Ambrogio, Lauren
    Bloom, Toby
    Cibulskis, Kristian
    Fennell, Tim J.
    Gabriel, Stacey B.
    [J]. NATURE, 2010, 467 (7319) : 1061 - 1073
  • [2] Integrating common and rare genetic variation in diverse human populations
    Altshuler, David M.
    Gibbs, Richard A.
    Peltonen, Leena
    Dermitzakis, Emmanouil
    Schaffner, Stephen F.
    Yu, Fuli
    Bonnen, Penelope E.
    de Bakker, Paul I. W.
    Deloukas, Panos
    Gabriel, Stacey B.
    Gwilliam, Rhian
    Hunt, Sarah
    Inouye, Michael
    Jia, Xiaoming
    Palotie, Aarno
    Parkin, Melissa
    Whittaker, Pamela
    Chang, Kyle
    Hawes, Alicia
    Lewis, Lora R.
    Ren, Yanru
    Wheeler, David
    Muzny, Donna Marie
    Barnes, Chris
    Darvishi, Katayoon
    Hurles, Matthew
    Korn, Joshua M.
    Kristiansson, Kati
    Lee, Charles
    McCarroll, Steven A.
    Nemesh, James
    Keinan, Alon
    Montgomery, Stephen B.
    Pollack, Samuela
    Price, Alkes L.
    Soranzo, Nicole
    Gonzaga-Jauregui, Claudia
    Anttila, Verneri
    Brodeur, Wendy
    Daly, Mark J.
    Leslie, Stephen
    McVean, Gil
    Moutsianas, Loukas
    Nguyen, Huy
    Zhang, Qingrun
    Ghori, Mohammed J. R.
    McGinnis, Ralph
    McLaren, William
    Takeuchi, Fumihiko
    Grossman, Sharon R.
    [J]. NATURE, 2010, 467 (7311) : 52 - 58
  • [3] Lack of Association Between the Trp719Arg Polymorphism in Kinesin-Like Protein-6 and Coronary Artery Disease in 19 Case-Control Studies
    Assimes, Themistocles L.
    Holm, Hilma
    Kathiresan, Sekar
    Reilly, Muredach P.
    Thorleifsson, Gudmar
    Voight, Benjamin F.
    Erdmann, Jeanette
    Willenborg, Christina
    Vaidya, Dhananjay
    Xie, Changchun
    Patterson, Chris C.
    Morgan, Thomas M.
    Burnett, Mary Susan
    Li, Mingyao
    Hlatky, Mark A.
    Knowles, Joshua W.
    Thompson, John R.
    Absher, Devin
    Iribarren, Carlos
    Go, Alan
    Fortmann, Stephen P.
    Sidney, Stephen
    Risch, Neil
    Tang, Hua
    Myers, Richard M.
    Berger, Klaus
    Stoll, Monika
    Shah, Svati H.
    Thorgeirsson, Gudmundur
    Andersen, Karl
    Havulinna, Aki S.
    Herrera, J. Enrique
    Faraday, Nauder
    Kim, Yoonhee
    Kral, Brian G.
    Mathias, Rasika A.
    Ruczinski, Ingo
    Suktitipat, Bhoom
    Wilson, Alexander F.
    Yanek, Lisa R.
    Becker, Lewis C.
    Linsel-Nitschke, Patrick
    Lieb, Wolfgang
    Koenig, Inke R.
    Hengstenberg, Christian
    Fischer, Marcus
    Stark, Klaus
    Reinhard, Wibke
    Winogradow, Janina
    Grassl, Martina
    [J]. JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2010, 56 (19) : 1552 - 1563
  • [4] Genome-wide association analysis of susceptibility and clinical phenotype in multiple sclerosis
    Baranzini, Sergio E.
    Wang, Joanne
    Gibson, Rachel A.
    Galwey, Nicholas
    Naegelin, Yvonne
    Barkhof, Frederik
    Radue, Ernst-Wilhelm
    Lindberg, Raija L. P.
    Uitdehaag, Bernard M. G.
    Johnson, Michael R.
    Angelakopoulou, Aspasia
    Hall, Leslie
    Richardson, Jill C.
    Prinjha, Rab K.
    Gass, Achim
    Geurts, Jeroen J. G.
    Kragt, Jolijn
    Sombekke, Madeleine
    Vrenken, Hugo
    Qualley, Pamela
    Lincoln, Robin R.
    Gomez, Refujia
    Caillier, Stacy J.
    George, Michaela F.
    Mousavi, Hourieh
    Guerrero, Rosa
    Okuda, Darin T.
    Cree, Bruce A. C.
    Green, Ari J.
    Waubant, Emmanuelle
    Goodin, Douglas S.
    Pelletier, Daniel
    Matthews, Paul M.
    Hauser, Stephen L.
    Kappos, Ludwig
    Polman, Chris H.
    Oksenberg, Jorge R.
    [J]. HUMAN MOLECULAR GENETICS, 2009, 18 (04) : 767 - 778
  • [5] A Unified Approach to Genotype Imputation and Haplotype-Phase Inference for Large Data Sets of Trios and Unrelated Individuals
    Browning, Brian L.
    Browning, Sharon R.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2009, 84 (02) : 210 - 223
  • [6] Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering
    Browning, Sharon R.
    Browning, Brian L.
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) : 1084 - 1097
  • [7] Deep resequencing reveals excess rare recent variants consistent with explosive population growth
    Coventry, Alex
    Bull-Otterson, Lara M.
    Liu, Xiaoming
    Clark, Andrew G.
    Maxwell, Taylor J.
    Crosby, Jacy
    Hixson, James E.
    Rea, Thomas J.
    Muzny, Donna M.
    Lewis, Lora R.
    Wheeler, David A.
    Sabo, Aniko
    Lusk, Christine
    Weiss, Kenneth G.
    Akbar, Humeira
    Cree, Andrew
    Hawes, Alicia C.
    Newsham, Irene
    Varghese, Robin T.
    Villasana, Donna
    Gross, Shannon
    Joshi, Vandita
    Santibanez, Jireh
    Morgan, Margaret
    Chang, Kyle
    Hale, Walker
    Templeton, Alan R.
    Boerwinkle, Eric
    Gibbs, Richard
    Sing, Charles F.
    [J]. NATURE COMMUNICATIONS, 2010, 1
  • [8] The CoLaus study: a population-based study to investigate the epidemiology and genetic determinants of cardiovascular risk factors and metabolic syndrome
    Firmann, Mathieu
    Mayor, Vladimir
    Vidal, Pedro Marques
    Bochud, Murielle
    Pecoud, Alain
    Hayoz, Daniel
    Paccaud, Fred
    Preisig, Martin
    Song, Kijoung S.
    Yuan, Xin
    Danoff, Theodore M.
    Stirnadel, Heide A.
    Waterworth, Dawn
    Mooser, Vincent
    Waeber, Gerard
    Vollenweider, Peter
    [J]. BMC CARDIOVASCULAR DISORDERS, 2008, 8 (1)
  • [9] Population-based linkage analysis of schizophrenia and bipolar case-control cohorts identifies a potential susceptibility locus on 19q13
    Francks, C.
    Tozzi, F.
    Farmer, A.
    Vincent, J. B.
    Rujescu, D.
    St Clair, D.
    Muglia, P.
    [J]. MOLECULAR PSYCHIATRY, 2010, 15 (03) : 319 - 325
  • [10] A second generation human haplotype map of over 3.1 million SNPs
    Frazer, Kelly A.
    Ballinger, Dennis G.
    Cox, David R.
    Hinds, David A.
    Stuve, Laura L.
    Gibbs, Richard A.
    Belmont, John W.
    Boudreau, Andrew
    Hardenbol, Paul
    Leal, Suzanne M.
    Pasternak, Shiran
    Wheeler, David A.
    Willis, Thomas D.
    Yu, Fuli
    Yang, Huanming
    Zeng, Changqing
    Gao, Yang
    Hu, Haoran
    Hu, Weitao
    Li, Chaohua
    Lin, Wei
    Liu, Siqi
    Pan, Hao
    Tang, Xiaoli
    Wang, Jian
    Wang, Wei
    Yu, Jun
    Zhang, Bo
    Zhang, Qingrun
    Zhao, Hongbin
    Zhao, Hui
    Zhou, Jun
    Gabriel, Stacey B.
    Barry, Rachel
    Blumenstiel, Brendan
    Camargo, Amy
    Defelice, Matthew
    Faggart, Maura
    Goyette, Mary
    Gupta, Supriya
    Moore, Jamie
    Nguyen, Huy
    Onofrio, Robert C.
    Parkin, Melissa
    Roy, Jessica
    Stahl, Erich
    Winchester, Ellen
    Ziaugra, Liuda
    Altshuler, David
    Shen, Yan
    [J]. NATURE, 2007, 449 (7164) : 851 - U3