Practical consideration of genotype imputation: Sample size, window size, reference choice, and untyped rate

被引:0
作者
Zhang, Boshao [1 ]
Zhi, Degui [1 ]
Zhang, Kui [1 ]
Gao, Guimin [2 ]
Limdi, Nita A. [3 ,4 ]
Liu, Nianjun [1 ]
机构
[1] Univ Alabama, Dept Biostat, Sch Publ Hlth, Birmingham, AL 35294 USA
[2] Virginia Commonwealth Univ, Dept Biostat, Sch Med, Richmond, VA 23298 USA
[3] Univ Alabama, Dept Neurol, Birmingham, AL 35294 USA
[4] Univ Alabama, Dept Epidemiol, Birmingham, AL 35294 USA
基金
美国国家卫生研究院;
关键词
Genotype imputation; Genetic study; Admixed population; Untyped rate; Window size; Reference; GENOME-WIDE ASSOCIATION; WARFARIN DOSE REQUIREMENTS; AFRICAN-AMERICANS; INTERINDIVIDUAL VARIABILITY; EUROPEAN-AMERICANS; MISSING GENOTYPES; HUMAN-POPULATIONS; VKORC1; MAINTENANCE; CYP2C9;
D O I
暂无
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Imputation offers a promising way to infer the missing and/or untyped genotypes in genetic studies. In practice, however, many factors may affect the quality of imputation. In this study, we evaluated the influence of untyped rate, sizes of the study sample and the reference sample, window size, and reference choice (for admixed population), as the factors affecting the quality of imputation. The results show that in order to obtain good imputation quality, it is necessary to have an untyped rate less than 50%, a reference sample size greater than 50, and a window size of greater than 500 SNPs (roughly 1 MB in base pairs). Compared with the whole-region imputation, piecewise imputation with large-enough window sizes provides improved efficacy. For an admixed study sample, if only an external reference panel is used, it should include samples from the ancestral populations that represent the admixed population under investigation. Internal references are strongly recommended. When internal references are limited, however, augmentation by external references should be used carefully. More specifically, augmentation with samples from the major source populations of the admixture can lower the quality of imputation; augmentation with seemingly genetically unrelated cohorts may improve the quality of imputation.
引用
收藏
页码:339 / 351
页数:13
相关论文
共 57 条
[1]   Fast model-based estimation of ancestry in unrelated individuals [J].
Alexander, David H. ;
Novembre, John ;
Lange, Kenneth .
GENOME RESEARCH, 2009, 19 (09) :1655-1664
[2]   Integrating common and rare genetic variation in diverse human populations [J].
Altshuler, David M. ;
Gibbs, Richard A. ;
Peltonen, Leena ;
Dermitzakis, Emmanouil ;
Schaffner, Stephen F. ;
Yu, Fuli ;
Bonnen, Penelope E. ;
de Bakker, Paul I. W. ;
Deloukas, Panos ;
Gabriel, Stacey B. ;
Gwilliam, Rhian ;
Hunt, Sarah ;
Inouye, Michael ;
Jia, Xiaoming ;
Palotie, Aarno ;
Parkin, Melissa ;
Whittaker, Pamela ;
Chang, Kyle ;
Hawes, Alicia ;
Lewis, Lora R. ;
Ren, Yanru ;
Wheeler, David ;
Muzny, Donna Marie ;
Barnes, Chris ;
Darvishi, Katayoon ;
Hurles, Matthew ;
Korn, Joshua M. ;
Kristiansson, Kati ;
Lee, Charles ;
McCarroll, Steven A. ;
Nemesh, James ;
Keinan, Alon ;
Montgomery, Stephen B. ;
Pollack, Samuela ;
Price, Alkes L. ;
Soranzo, Nicole ;
Gonzaga-Jauregui, Claudia ;
Anttila, Verneri ;
Brodeur, Wendy ;
Daly, Mark J. ;
Leslie, Stephen ;
McVean, Gil ;
Moutsianas, Loukas ;
Nguyen, Huy ;
Zhang, Qingrun ;
Ghori, Mohammed J. R. ;
McGinnis, Ralph ;
McLaren, William ;
Takeuchi, Fumihiko ;
Grossman, Sharon R. .
NATURE, 2010, 467 (7311) :52-58
[3]   Evaluating the effects of imputation on the power, coverage, and cost efficiency of genome-wide SNP platforms [J].
Anderson, Carl A. ;
Pettersson, Fredrik H. ;
Barrett, Jeffrey C. ;
Zhuang, Joanna J. ;
Ragoussis, Jiannis ;
Cardon, Lon R. ;
Morris, Andrew P. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2008, 83 (01) :112-119
[4]   Influence of coagulation factor, vitamin K epoxide reductase complex subunit 1, and cytochrome P4502C9 gene polymorphisms on warfarin dose requirements [J].
Aquilante, CL ;
Langaee, TY ;
Lopez, LM ;
Yarandi, HN ;
Tromberg, JS ;
Mohuczy, D ;
Gaston, KL ;
Waddell, CD ;
Chirico, MJ ;
Johnson, JA .
CLINICAL PHARMACOLOGY & THERAPEUTICS, 2006, 79 (04) :291-302
[5]   ProbABEL package for genome-wide association analysis of imputed data [J].
Aulchenko, Yurii S. ;
Struchalin, Maksim V. ;
van Duijn, Cornelia M. .
BMC BIOINFORMATICS, 2010, 11
[6]   Allelic variants in the CYP2C9 and VKORC1 loci and interindividual variability in the anticoagulant dose effect of warfarin in Italians [J].
Borgiani, Paold ;
Ciccacci, Cinzia ;
Forte, Vittorio ;
Romano, Silvia ;
Federici, Giorgio ;
Novelli, Giuseppe .
PHARMACOGENOMICS, 2007, 8 (11) :1545-1550
[7]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[8]   Missing data imputation and haplotype phase inference for genome-wide association studies [J].
Browning, Sharon R. .
HUMAN GENETICS, 2008, 124 (05) :439-450
[9]   Medication use leading to emergency department visits for adverse drug events in older adults [J].
Budnitz, Daniel S. ;
Shehab, Nadine ;
Kegler, Scott R. ;
Richards, Chesley L. .
ANNALS OF INTERNAL MEDICINE, 2007, 147 (11) :755-U26
[10]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678