Identification and validation of copy number variants using SNP genotyping arrays from a large clinical cohort

被引:14
|
作者
Valsesia, Armand [1 ,2 ,4 ]
Stevenson, Brian J. [2 ,4 ]
Waterworth, Dawn [3 ]
Mooser, Vincent [3 ,5 ]
Vollenweider, Peter [5 ]
Waeber, Gerard [5 ]
Jongeneel, C. Victor [2 ,4 ,6 ,7 ]
Beckmann, Jacques S. [1 ,8 ]
Kutalik, Zoltan [1 ,2 ]
Bergmann, Sven [1 ,2 ]
机构
[1] Univ Lausanne, Dept Med Genet, Lausanne, Switzerland
[2] Swiss Inst Bioinformat, Lausanne, Switzerland
[3] GlaxoSmithKline, Med Genet Clin Pharmacol & Discovery Med, Philadelphia, PA USA
[4] Ludwig Inst Canc Res, Lausanne, Switzerland
[5] CHU Vaudois, Dept Med, CH-1011 Lausanne, Switzerland
[6] Univ Illinois, Inst Genom Biol, Chicago, IL 60680 USA
[7] Univ Illinois, Natl Ctr Supercomp Applicat, Chicago, IL 60680 USA
[8] CHU Vaudois, Serv Med Genet, CH-1011 Lausanne, Switzerland
来源
BMC GENOMICS | 2012年 / 13卷
关键词
GENOME-WIDE ASSOCIATION; CIRCULAR BINARY SEGMENTATION; HIDDEN-MARKOV MODEL; STRUCTURAL VARIATION; SUSCEPTIBILITY LOCI; CGH MICROARRAYS; RESOLUTION; POPULATION; ALGORITHMS; DELETIONS;
D O I
10.1186/1471-2164-13-241
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Genotypes obtained with commercial SNP arrays have been extensively used in many large case-control or population-based cohorts for SNP-based genome-wide association studies for a multitude of traits. Yet, these genotypes capture only a small fraction of the variance of the studied traits. Genomic structural variants (GSV) such as Copy Number Variation (CNV) may account for part of the missing heritability, but their comprehensive detection requires either next-generation arrays or sequencing. Sophisticated algorithms that infer CNVs by combining the intensities from SNP-probes for the two alleles can already be used to extract a partial view of such GSV from existing data sets. Results: Here we present several advances to facilitate the latter approach. First, we introduce a novel CNV detection method based on a Gaussian Mixture Model. Second, we propose a new algorithm, PCA merge, for combining copy-number profiles from many individuals into consensus regions. We applied both our new methods as well as existing ones to data from 5612 individuals from the CoLaus study who were genotyped on Affymetrix 500K arrays. We developed a number of procedures in order to evaluate the performance of the different methods. This includes comparison with previously published CNVs as well as using a replication sample of 239 individuals, genotyped with Illumina 550K arrays. We also established a new evaluation procedure that employs the fact that related individuals are expected to share their CNVs more frequently than randomly selected individuals. The ability to detect both rare and common CNVs provides a valuable resource that will facilitate association studies exploring potential phenotypic associations with CNVs. Conclusion: Our new methodologies for CNV detection and their evaluation will help in extracting additional information from the large amount of SNP-genotyping data on various cohorts and use this to explore structural variants and their impact on complex traits.
引用
收藏
页数:15
相关论文
共 44 条
  • [21] CNstream: A method for the identification and genotyping of copy number polymorphisms using Illumina microarrays
    Arnald Alonso
    Antonio Julià
    Raül Tortosa
    Cristina Canaleta
    Juan D Cañete
    Javier Ballina
    Alejandro Balsa
    Jesús Tornero
    Sara Marsal
    BMC Bioinformatics, 11
  • [22] CNstream: A method for the identification and genotyping of copy number polymorphisms using Illumina microarrays
    Alonso, Arnald
    Julia, Antonio
    Tortosa, Rauel
    Canaleta, Cristina
    Canete, Juan D.
    Ballina, Javier
    Balsa, Alejandro
    Tornero, Jesus
    Marsal, Sara
    BMC BIOINFORMATICS, 2010, 11
  • [23] Identification of copy number variants in whole-genome data using Reference Coverage Profiles
    Glusman, Gustavo
    Severson, Alissa
    Dhankani, Varsha
    Robinson, Max
    Farrah, Terry
    Mauldin, Denise E.
    Stittrich, Anna B.
    Ament, Seth A.
    Roach, Jared C.
    Brunkow, Mary E.
    Bodian, Dale L.
    Vockley, Joseph G.
    Shmulevich, Ilya
    Niederhuber, John E.
    Hood, Leroy
    FRONTIERS IN GENETICS, 2015, 6
  • [24] Identification of copy number variations in Qinchuan cattle using BovineHD Genotyping Beadchip array
    Zhang, Quanwei
    Ma, Youji
    Wang, Xueying
    Zhang, Yong
    Zhao, Xingxu
    MOLECULAR GENETICS AND GENOMICS, 2015, 290 (01) : 319 - 327
  • [25] Identification of Copy Number Variants on Human Chromosome 22 in Patients with a Variety of Clinical Findings
    Yu, S.
    Graf, W. D.
    Ramalingam, A.
    Brawner, S. J.
    Joyce, J. M.
    Fiedler, S.
    Zhou, X. -G.
    Liu, H. -Y.
    CYTOGENETIC AND GENOME RESEARCH, 2011, 134 (04) : 260 - 268
  • [26] A Single-Array-Based Method for Detecting Copy Number Variants Using Affymetrix High Density SNP Arrays and its Application to Breast Cancer
    Li, Ming
    Wen, Yalu
    Fu, Wenjiang
    CANCER INFORMATICS, 2014, 13 : 95 - 103
  • [27] Identification of Rare Recurrent Copy Number Variants in High-Risk Autism Families and Their Prevalence in a Large ASD Population
    Matsunami, Nori
    Hadley, Dexter
    Hensel, Charles H.
    Christensen, G. Bryce
    Kim, Cecilia
    Frackelton, Edward
    Thomas, Kelly
    da Silva, Renata Pellegrino
    Stevens, Jeff
    Baird, Lisa
    Otterud, Brith
    Ho, Karen
    Varvil, Tena
    Leppert, Tami
    Lambert, Christophe G.
    Leppert, Mark
    Hakonarson, Hakon
    PLOS ONE, 2013, 8 (01):
  • [28] An optimization framework for unsupervised identification of rare copy number variation from SNP array data
    Gökhan Yavaş
    Mehmet Koyutürk
    Meral Özsoyoğlu
    Meetha P Gould
    Thomas LaFramboise
    Genome Biology, 10
  • [29] Genome-wide detection of copy number variations using high-density SNP genotyping platforms in Holsteins
    Jiang, Li
    Jiang, Jicai
    Yang, Jie
    Liu, Xuan
    Wang, Jiying
    Wang, Haifei
    Ding, Xiangdong
    Liu, Jianfeng
    Zhang, Qin
    BMC GENOMICS, 2013, 14
  • [30] Identification of novel rare copy number variants associated with sporadic tetralogy of Fallot and clinical implications
    He, Guo-Wei
    Maslen, Cheryl L.
    Chen, Huan-Xin
    Hou, Hai-Tao
    Bai, Xiao-Yan
    Wang, Xiu-Li
    Liu, Xiao-Cheng
    Lu, Wan-Li
    Chen, Xin-Xin
    Chen, Wei-Dan
    Xing, Quan-Sheng
    Wu, Qin
    Wang, Jun
    Yang, Qin
    CLINICAL GENETICS, 2022, 102 (05) : 391 - 403