A genotype calling algorithm for the Illumina BeadArray platform

被引:172
作者
Teo, Yik Y.
Inouye, Michael
Small, Kerrin S.
Gwilliam, Rhian
Deloukas, Panagiotis
Kwiatkowski, Dominic P.
Clark, Taane G.
机构
[1] Univ Oxford, Wellcome Trust Ctr Human Genet, Oxford OX3 7BN, England
[2] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
基金
英国医学研究理事会; 英国惠康基金;
关键词
D O I
10.1093/bioinformatics/btm443
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Large-scale genotyping relies on the use of unsupervised automated calling algorithms to assign genotypes to hybridization data. A number of such calling algorithms have been recently established for the Affymetrix GeneChip genotyping technology. Here, we present a fast and accurate genotype calling algorithm for the Illumina BeadArray genotyping platforms. As the technology moves towards assaying millions of genetic polymorphisms simultaneously, there is a need for an integrated and easy-to-use software for calling genotypes. Results: We have introduced a model-based genotype calling algorithm which does not rely on having prior training data or require computationally intensive procedures. The algorithm can assign genotypes to hybridization data from thousands of individuals simultaneously and pools information across multiple individuals to improve the calling. The method can accommodate variations in hybridization intensities which result in dramatic shifts of the position of the genotype clouds by identifying the optimal coordinates to initialize the algorithm. By incorporating the process of perturbation analysis, we can obtain a quality metric measuring the stability of the assigned genotype calls. We show that this quality metric can be used to identify SNPs with low call rates and accuracy.
引用
收藏
页码:2741 / 2746
页数:6
相关论文
共 17 条
[1]  
*AFFYM INC, 2006, BRLMM IMPR GEN CALL
[2]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[3]   Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls [J].
Burton, Paul R. ;
Clayton, David G. ;
Cardon, Lon R. ;
Craddock, Nick ;
Deloukas, Panos ;
Duncanson, Audrey ;
Kwiatkowski, Dominic P. ;
McCarthy, Mark I. ;
Ouwehand, Willem H. ;
Samani, Nilesh J. ;
Todd, John A. ;
Donnelly, Peter ;
Barrett, Jeffrey C. ;
Davison, Dan ;
Easton, Doug ;
Evans, David ;
Leung, Hin-Tak ;
Marchini, Jonathan L. ;
Morris, Andrew P. ;
Spencer, Chris C. A. ;
Tobin, Martin D. ;
Attwood, Antony P. ;
Boorman, James P. ;
Cant, Barbara ;
Everson, Ursula ;
Hussey, Judith M. ;
Jolley, Jennifer D. ;
Knight, Alexandra S. ;
Koch, Kerstin ;
Meech, Elizabeth ;
Nutland, Sarah ;
Prowse, Christopher V. ;
Stevens, Helen E. ;
Taylor, Niall C. ;
Walters, Graham R. ;
Walker, Neil M. ;
Watkins, Nicholas A. ;
Winzer, Thilo ;
Jones, Richard W. ;
McArdle, Wendy L. ;
Ring, Susan M. ;
Strachan, David P. ;
Pembrey, Marcus ;
Breen, Gerome ;
St Clair, David ;
Caesar, Sian ;
Gordon-Smith, Katherine ;
Jones, Lisa ;
Fraser, Christine ;
Green, Elain K. .
NATURE, 2007, 447 (7145) :661-678
[4]   Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data [J].
Carvalho, Benilton ;
Bengtsson, Henrik ;
Speed, Terence P. ;
Irizarry, Rafael A. .
BIOSTATISTICS, 2007, 8 (02) :485-499
[5]   Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarrays [J].
Di, XJ ;
Matsuzaki, H ;
Webster, TA ;
Hubbell, E ;
Liu, GY ;
Dong, SL ;
Bartell, D ;
Huang, J ;
Chiles, R ;
Yang, G ;
Shen, MM ;
Kulp, D ;
Kennedy, GC ;
Mei, R ;
Jones, KW ;
Cawley, S .
BIOINFORMATICS, 2005, 21 (09) :1958-1963
[6]   Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24 [J].
Gudmundsson, Julius ;
Sulem, Patrick ;
Manolescu, Andrei ;
Amundadottir, Laufey T. ;
Gudbjartsson, Daniel ;
Helgason, Agnar ;
Rafnar, Thorunn ;
Bergthorsson, Jon T. ;
Agnarsson, Bjarni A. ;
Baker, Adam ;
Sigurdsson, Asgeir ;
Benediktsdottir, Kristrun R. ;
Jakobsdottir, Margret ;
Xu, Jianfeng ;
Blondal, Thorarinn ;
Kostic, Jelena ;
Sun, Jielin ;
Ghosh, Shyamali ;
Stacey, Simon N. ;
Mouy, Magali ;
Saemundsdottir, Jona ;
Backman, Valgerdur M. ;
Kristjansson, Kristleifur ;
Tres, Alejandro ;
Partin, Alan W. ;
Albers-Akkers, Marjo T. ;
Marcos, Javier Godino-Ivan ;
Walsh, Patrick C. ;
Swinkels, Dorine W. ;
Navarrete, Sebastian ;
Isaacs, Sarah D. ;
Aben, Katja K. ;
Graif, Theresa ;
Cashy, John ;
Ruiz-Echarri, Manuel ;
Wiley, Kathleen E. ;
Suarez, Brian K. ;
Witjes, J. Alfred ;
Frigge, Mike ;
Ober, Carole ;
Jonsson, Eirikur ;
Einarsson, Gudmundur V. ;
Mayordomo, Jose I. ;
Kiemeney, Lambertus A. ;
Isaacs, William B. ;
Catalona, William J. ;
Barkardottir, Rosa B. ;
Gulcher, Jeffrey R. ;
Thorsteinsdottir, Unnur ;
Kong, Augustine .
NATURE GENETICS, 2007, 39 (05) :631-637
[7]   Whole-genome genotyping of haplotype tag single nucleotide polymorphisms [J].
Gunderson, KL ;
Kuhn, KM ;
Steemers, FJ ;
Ng, P ;
Murray, SS ;
Shen, R .
PHARMACOGENOMICS, 2006, 7 (04) :641-648
[8]  
KERMANI BG, 2005, Patent No. 20060224529
[9]   Optimal genotype determination in highly multiplexed SNP data [J].
Moorhead, M ;
Hardenbol, P ;
Siddiqui, F ;
Falkowski, M ;
Bruckner, C ;
Ireland, J ;
Jones, HB ;
Jain, M ;
Willis, TD ;
Faham, M .
EUROPEAN JOURNAL OF HUMAN GENETICS, 2006, 14 (02) :207-215
[10]   A method to address differential bias in genotyping in large-scale association studies [J].
Plagnol, Vincent ;
Cooper, Jason. D. ;
Todd, John A. ;
Clayton, David G. .
PLOS GENETICS, 2007, 3 (05) :759-767