Identification and validation of copy number variants using SNP genotyping arrays from a large clinical cohort

被引:14
|
作者
Valsesia, Armand [1 ,2 ,4 ]
Stevenson, Brian J. [2 ,4 ]
Waterworth, Dawn [3 ]
Mooser, Vincent [3 ,5 ]
Vollenweider, Peter [5 ]
Waeber, Gerard [5 ]
Jongeneel, C. Victor [2 ,4 ,6 ,7 ]
Beckmann, Jacques S. [1 ,8 ]
Kutalik, Zoltan [1 ,2 ]
Bergmann, Sven [1 ,2 ]
机构
[1] Univ Lausanne, Dept Med Genet, Lausanne, Switzerland
[2] Swiss Inst Bioinformat, Lausanne, Switzerland
[3] GlaxoSmithKline, Med Genet Clin Pharmacol & Discovery Med, Philadelphia, PA USA
[4] Ludwig Inst Canc Res, Lausanne, Switzerland
[5] CHU Vaudois, Dept Med, CH-1011 Lausanne, Switzerland
[6] Univ Illinois, Inst Genom Biol, Chicago, IL 60680 USA
[7] Univ Illinois, Natl Ctr Supercomp Applicat, Chicago, IL 60680 USA
[8] CHU Vaudois, Serv Med Genet, CH-1011 Lausanne, Switzerland
来源
BMC GENOMICS | 2012年 / 13卷
关键词
GENOME-WIDE ASSOCIATION; CIRCULAR BINARY SEGMENTATION; HIDDEN-MARKOV MODEL; STRUCTURAL VARIATION; SUSCEPTIBILITY LOCI; CGH MICROARRAYS; RESOLUTION; POPULATION; ALGORITHMS; DELETIONS;
D O I
10.1186/1471-2164-13-241
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Genotypes obtained with commercial SNP arrays have been extensively used in many large case-control or population-based cohorts for SNP-based genome-wide association studies for a multitude of traits. Yet, these genotypes capture only a small fraction of the variance of the studied traits. Genomic structural variants (GSV) such as Copy Number Variation (CNV) may account for part of the missing heritability, but their comprehensive detection requires either next-generation arrays or sequencing. Sophisticated algorithms that infer CNVs by combining the intensities from SNP-probes for the two alleles can already be used to extract a partial view of such GSV from existing data sets. Results: Here we present several advances to facilitate the latter approach. First, we introduce a novel CNV detection method based on a Gaussian Mixture Model. Second, we propose a new algorithm, PCA merge, for combining copy-number profiles from many individuals into consensus regions. We applied both our new methods as well as existing ones to data from 5612 individuals from the CoLaus study who were genotyped on Affymetrix 500K arrays. We developed a number of procedures in order to evaluate the performance of the different methods. This includes comparison with previously published CNVs as well as using a replication sample of 239 individuals, genotyped with Illumina 550K arrays. We also established a new evaluation procedure that employs the fact that related individuals are expected to share their CNVs more frequently than randomly selected individuals. The ability to detect both rare and common CNVs provides a valuable resource that will facilitate association studies exploring potential phenotypic associations with CNVs. Conclusion: Our new methodologies for CNV detection and their evaluation will help in extracting additional information from the large amount of SNP-genotyping data on various cohorts and use this to explore structural variants and their impact on complex traits.
引用
收藏
页数:15
相关论文
共 44 条
  • [31] CNV-ClinViewer: enhancing the clinical interpretation of large copy-number variants online
    Macnee, Marie
    Perez-Palma, Eduardo
    Bruenger, Tobias
    Kloeckner, Chiara
    Platzer, Konrad
    Stefanski, Arthur
    Montanucci, Ludovica
    Bayat, Allan
    Radtke, Maximilian
    Collins, Ryan L.
    Talkowski, Michael
    Blankenberg, Daniel
    Moller, Rikke S.
    Lemke, Johannes R.
    Nothnagel, Michael
    May, Patrick
    Lal, Dennis
    BIOINFORMATICS, 2023, 39 (05)
  • [32] Copy number variation identification and analysis of the chicken genome using a 60K SNP BeadChip
    Rao, Y. S.
    Li, J.
    Zhang, R.
    Lin, X. R.
    Xu, J. G.
    Xie, L.
    Xu, Z. Q.
    Wang, L.
    Gan, J. K.
    Xie, X. J.
    He, J.
    Zhang, X. Q.
    POULTRY SCIENCE, 2016, 95 (08) : 1750 - 1756
  • [33] Genome-wide characteristics of copy number variation in Polish Holstein and Polish Red cattle using SNP genotyping assay
    Gurgul, A.
    Jasielczuk, I.
    Szmatola, T.
    Pawlina, K.
    Zabek, T.
    Zukowski, K.
    Bugno-Poniewierska, M.
    GENETICA, 2015, 143 (02) : 145 - 155
  • [34] Genome-wide analysis of copy number variants and normal facial variation in a large cohort of Bantu Africans
    Null, Megan
    Yilmaz, Feyza
    Astling, David
    Yu, Hung-Chun
    Cole, Joanne B.
    Hallgrimsson, Benedikt
    Santorico, Stephanie A.
    Spritz, Richard A.
    Shaikh, Tamim H.
    Hendricks, Audrey E.
    HUMAN GENETICS AND GENOMICS ADVANCES, 2022, 3 (01):
  • [35] MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells
    Liu, Zongzhi
    Li, Ao
    Schulz, Vincent
    Chen, Min
    Tuck, David
    PLOS ONE, 2010, 5 (06):
  • [36] Identification of copy number variants in children and adolescents with autism spectrum disorder: a study from Turkey
    Ozaslan, Ahmet
    Kayhan, Gulsum
    Iseri, Elvan
    Ergun, Mehmet Ali
    Guney, Esra
    Percin, Ferda Emriye
    MOLECULAR BIOLOGY REPORTS, 2021, 48 (11) : 7371 - 7378
  • [37] Detection of pathogenic copy number variants in children with idiopathic intellectual disability using 500 K SNP array genomic hybridization
    Friedman, J. M.
    Adam, Shelin
    Arbour, Laura
    Armstrong, Linlea
    Baross, Agnes
    Birch, Patricia
    Boerkoel, Cornelius
    Chan, Susanna
    Chai, David
    Delaney, Allen D.
    Flibotte, Stephane
    Gibson, William T.
    Langlois, Sylvie
    Lemyre, Emmanuelle
    Li, H. Irene
    MacLeod, Patrick
    Mathers, Joan
    Michaud, Jacques L.
    McGillivray, Barbara C.
    Patel, Millan S.
    Qian, Hong
    Rouleau, Guy A.
    Van Allen, Margot I.
    Yong, Siu-Li
    Zahir, Farah R.
    Eydoux, Patrice
    Marra, Marco A.
    BMC GENOMICS, 2009, 10
  • [38] XCAVATOR: accurate detection and genotyping of copy number variants from second and third generation whole-genome sequencing experiments
    Magi, Alberto
    Pippucci, Tommaso
    Sidore, Carlo
    BMC GENOMICS, 2017, 18
  • [39] Comparative analyses of seven algorithms for copy number variant identification from single nucleotide polymorphism arrays
    Dellinger, Andrew E.
    Saw, Seang-Mei
    Goh, Liang K.
    Seielstad, Mark
    Young, Terri L.
    Li, Yi-Ju
    NUCLEIC ACIDS RESEARCH, 2010, 38 (09) : e105
  • [40] Concordance of copy number abnormality detection using SNP arrays and Multiplex Ligation-dependent Probe Amplification (MLPA) in acute lymphoblastic leukaemia
    Bashton, Matthew
    Hollis, Robin
    Ryan, Sarra
    Schwab, Claire J.
    Moppett, John
    Harrison, Christine J.
    Moorman, Anthony V.
    Enshaei, Amir
    SCIENTIFIC REPORTS, 2020, 10 (01)