Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation

被引:21
作者
Tindall E.A. [1 ,2 ]
Petersen D.C. [1 ]
Nikolaysen S. [1 ,3 ]
Miller W. [4 ]
Schuster S.C. [4 ]
Hayes V.M. [1 ,2 ,4 ]
机构
[1] Cancer Genetics Group, Children's Cancer Institute Australia for Medical Research, Sydney Children's Hospital, Randwick, NSW 2031, High Street
[2] Faculty of Medicine, University of New South Wales, Randwick
[3] Faculty of Science, University of New South Wales, Randwick
[4] Pennsylvania State University, Center for Comparative Genomics and Bioinformatics, University Park
关键词
Minimal Coverage; Single Nucleotide Polymorphism Array; Tasmanian Devil; Allele Specific Oligo; Single Nucleotide Polymorphism Assay;
D O I
10.1186/1756-0500-3-39
中图分类号
学科分类号
摘要
Background. High-throughput custom designed genotyping arrays are a valuable resource for biologically focused research studies and increasingly for validation of variation predicted by next-generation sequencing (NGS) technologies. We investigate the Illumina GoldenGate chemistry using custom designed VeraCode and sentrix array matrix (SAM) assays for each of these applications, respectively. We highlight applications for interpretation of Illumina generated genotype cluster plots to maximise data inclusion and reduce genotyping errors. Findings. We illustrate the dramatic effect of outliers in genotype calling and data interpretation, as well as suggest simple means to avoid genotyping errors. Furthermore we present this platform as a successful method for two-cluster rare or non-autosomal variant calling. The success of high-throughput technologies to accurately call rare variants will become an essential feature for future association studies. Finally, we highlight additional advantages of the Illumina GoldenGate chemistry in generating unusually segregated cluster plots that identify potential NGS generated sequencing error resulting from minimal coverage. Conclusions. We demonstrate the importance of visually inspecting genotype cluster plots generated by the Illumina software and issue warnings regarding commonly accepted quality control parameters. In addition to suggesting applications to minimise data exclusion, we propose that the Illumina cluster plots may be helpful in identifying potential in-put sequence errors, particularly important for studies to validate NGS generated variation. © 2010 Hayes et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 17 条
[1]  
Pavy N., Pelgas B., Beauseigle S., Blais S., Gagnon F., Gosselin I., Lamothe M., Isabel N., Bousquet J., Enhancing genetic mapping of complex genomes through the design of highly-multiplexed SNP arrays: Application to the large and unsequenced genomes of white spruce and black spruce, BMC Genomics, 9, (2008)
[2]  
Van Tassell C.P., Smith T.P., Matukumalli L.K., Taylor J.F., Schnabel R.D., Lawley C.T., Haudenschild C.D., Moore S.S., Warren W.C., Sonstegard T.S., SNP discovery and allele frequency estimation by deep sequencing of reduced representation libraries, Nat Methods, 5, 3, pp. 247-252, (2008)
[3]  
Agalliu I., Schweitzer P.A., Leanza S.M., Burk R.D., Rohan T.E., Illumina DNA test panel-based genotyping of whole genome amplified-DNA extracted from hair samples: Performance and agreement with genotyping results from genomic DNA from buccal cells, Clin Chem Lab Med, 47, 5, pp. 516-522, (2009)
[4]  
Paynter R.A., Skibola D.R., Skibola C.F., Buffler P.A., Wiemels J.L., Smith M.T., Accuracy of multiplexed Illumina platform-based single-nucleotide polymorphism genotyping compared between genomic and whole genome amplified DNA collected from multiple sources, Cancer Epidemiol Biomarkers Prev, 15, 12, pp. 2533-2536, (2006)
[5]  
Hollegaard M.V., Grove J., Thorsen P., Norgaard-Pedersen B., Hougaard D.M., High-throughput genotyping on archived dried blood spot samples, Genet Test Mol Biomarkers, 13, 2, pp. 173-179, (2009)
[6]  
McClure M.C., McKay S.D., Schnabel R.D., Taylor J.F., Assessment of DNA extracted from FTA(R) cards for use on the Illumina iSelect BeadChip, BMC Res Notes, 2, 1, (2009)
[7]  
Berthier-Schaad Y., Kao W.H., Coresh J., Zhang L., Ingersoll R.G., Stephens R., Smith M.W., Reliability of high-throughput genotyping of whole genome amplified DNA in SNP genotyping studies, Electrophoresis, 28, 16, pp. 2812-2817, (2007)
[8]  
Cunningham J.M., Sellers T.A., Schildkraut J.M., Fredericksen Z.S., Vierkant R.A., Kelemen L.E., Gadre M., Phelan C.M., Huang Y., Meyer J.G., Et al., Performance of amplified DNA in an Illumina GoldenGate BeadArray assay, Cancer Epidemiol Biomarkers Prev, 17, 7, pp. 1781-1789, (2008)
[9]  
Mead S., Poulter M., Beck J., Uphill J., Jones C., Ang C.E., Mein C.A., Collinge J., Successful amplification of degraded DNA for use with high-throughput SNP genotyping platforms, Hum Mutat, 29, 12, pp. 1452-1458, (2008)
[10]  
Brown W.M., Pierce J., Hilner J.E., Perdue L.H., Lohman K., Li L., Venkatesh R.B., Hunt S., Mychaleckyj J.C., Deloukas P., Overview of the MHC fine mapping data, Diabetes Obes Metab, 11, SUPPL 1, pp. 2-7, (2009)