Analysis of copy number variants and segmental duplications in the human genome: Evidence for a change in the process of formation in recent evolutionary history

被引:102
|
作者
Kim, Philip M. [1 ]
Lam, Hugo Y. K. [2 ]
Urban, Alexander E. [3 ]
Korbel, Jan O. [1 ,6 ]
Affourtit, Jason
Grubert, Fabian [4 ]
Chen, Xueying [1 ]
Weissman, Sherman [4 ]
Snyder, Michael [3 ]
Gerstein, Mark B. [1 ,2 ,5 ]
机构
[1] Yale Univ, Dept Mol Biophys & Biochem, New Haven, CT 06520 USA
[2] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06520 USA
[3] Yale Univ, Dept Mol Cellular & Dev Biol, New Haven, CT 06520 USA
[4] Yale Univ, Dept Genet, New Haven, CT 06520 USA
[5] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
[6] European Mol Biol Lab, D-69177 Heidelberg, Germany
关键词
D O I
10.1101/gr.081422.108
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Segmental duplications (SDs) are operationally defined as >1 kb stretches of duplicated DNA with high sequence identity. They arise from copy number variants (CNVs) fixed in the population. To investigate the formation of SDs and CNVs, we examine their large-scale patterns of co-occurrence with different repeats. Alu elements, a major class of genomic repeats, had previously been identified as prime drivers of SD formation. We also observe this association; however, we find that it sharply decreases for younger SDs. Continuing this trend, we find only weak associations of CNVs with Alus. Similarly, we find an association of SDs with processed pseudogenes, which is decreasing for younger SDs and absent entirely for CNVs. Next, we find that SDs are significantly co-localized with each other, resulting in a highly skewed "power-law" distribution and chromosomal hotspots. We also observe a significant association of CNVs with SDs, but find that an SD-mediated mechanism only accounts for some CNVs (<28%). Overall, our results imply that a shift in predominant formation mechanism occurred in recent history: similar to 40 million years ago, during the "Alu burst" in retrotransposition activity, non-allelic homologous recombination, first mediated by Alus and then the by newly formed CNVs themselves, was the main driver of genome rearrangements; however, its relative importance has decreased markedly since then, with proportionally more events now stemming from other repeats and from non-homologous end-joining. In addition to a coarse-grained analysis, we performed targeted sequencing of 67 CNVs and then analyzed a combined set of 270 CNVs (540 breakpoints) to verify our conclusions.
引用
收藏
页码:1865 / 1874
页数:10
相关论文
共 10 条
  • [1] Segmental duplications and copy-number variation in the human genome
    Sharp, AJ
    Locke, DP
    McGrath, SD
    Cheng, Z
    Bailey, JA
    Vallente, RU
    Pertz, LM
    Clark, RA
    Schwartz, S
    Segraves, R
    Oseroff, VV
    Albertson, DG
    Pinkel, D
    Eichler, EE
    AMERICAN JOURNAL OF HUMAN GENETICS, 2005, 77 (01) : 78 - 88
  • [2] The neutral coalescent process for recent gene duplications and copy-number variants
    Thornton, Kevin R.
    GENETICS, 2007, 177 (02) : 987 - 1000
  • [3] Complex patterns of copy number variation at sites of segmental duplications: an important category of structural variation in the human genome
    Goidts, Violaine
    Cooper, David N.
    Armengol, Lluis
    Schempp, Werner
    Conroy, JeVrey
    Estivill, Xavier
    Nowak, Norma
    Hameister, Horst
    Kehrer-Sawatzki, Hildegard
    HUMAN GENETICS, 2006, 120 (02) : 270 - 284
  • [4] Complex patterns of copy number variation at sites of segmental duplications: an important category of structural variation in the human genome
    Violaine Goidts
    David N. Cooper
    Lluis Armengol
    Werner Schempp
    Jeffrey Conroy
    Xavier Estivill
    Norma Nowak
    Horst Hameister
    Hildegard Kehrer-Sawatzki
    Human Genetics, 2006, 120 : 270 - 284
  • [5] Genome-Wide Analysis of Copy Number Variants in Attention Deficit Hyperactivity Disorder: The Role of Rare Variants and Duplications at 15q13.3
    Williams, Nigel M.
    Franke, Barbara
    Mick, Eric
    Anney, Richard J. L.
    Freitag, Christine M.
    Gill, Michael
    Thapar, Anita
    O'Donovan, Michael C.
    Owen, Michael J.
    Holmans, Peter
    Kent, Lindsey
    Middleton, Frank
    Zhang-James, Yanli
    Liu, Lu
    Meyer, Jobst
    Thuy Trang Nguyen
    Romanos, Jasmin
    Romanos, Marcel
    Seitz, Christiane
    Renner, Tobias J.
    Walitza, Susanne
    Warnke, Andreas
    Palmason, Haukur
    Buitelaar, Jan
    Rommelse, Nanda
    Vasquez, Alejandro Arias
    Hawi, Ziarih
    Langley, Kate
    Sergeant, Joseph
    Steinhausen, Hans-Christoph
    Roeyers, Herbert
    Biederman, Joseph
    Zaharieva, Irina
    Hakonarson, Hakon
    Elia, Josephine
    Lionel, Anath C.
    Crosbie, Jennifer
    Marshall, Christian R.
    Schachar, Russell
    Scherer, Stephen W.
    Todorov, Alexandre
    Smalley, Susan L.
    Loo, Sandra
    Nelson, Stanley
    Shtir, Corina
    Asherson, Philip
    Reif, Andreas
    Lesch, Klaus-Peter
    Faraone, Stephen V.
    AMERICAN JOURNAL OF PSYCHIATRY, 2012, 169 (02): : 195 - 204
  • [6] Microarray Analysis of Copy Number Variants on the Human Y Chromosome Reveals Novel and Frequent Duplications Overrepresented in Specific Haplogroups
    Johansson, Martin M.
    Van Geystelen, Anneleen
    Larmuseau, Maarten H. D.
    Djurovic, Srdjan
    Andreassen, Ole A.
    Agartz, Ingrid
    Jazin, Elena
    PLOS ONE, 2015, 10 (08):
  • [7] Development of bioinformatics resources for display and analysis of copy number and other structural variants in the human genome
    Zhang, J.
    Feuk, L.
    Duggan, G. E.
    Khaja, R.
    Scherer, S. W.
    CYTOGENETIC AND GENOME RESEARCH, 2006, 115 (3-4) : 205 - 214
  • [8] Analysis of copy number variation in the rhesus macaque genome identifies candidate loci for evolutionary and human disease studies
    Lee, Arthur S.
    Gutierrez-Arcelus, Maria
    Perry, George H.
    Vallender, Eric J.
    Johnson, Welkin E.
    Miller, Gregory M.
    Korbel, Jan O.
    Lee, Charles
    HUMAN MOLECULAR GENETICS, 2008, 17 (08) : 1127 - 1136
  • [9] Whole genome amplification effect on segmental copy-number changes and copy-number neutral loss of heterozygosity analysis by oligonucleotide-based array-comparative genomic hybridization in human myeloma cell line
    Mikulasova, Aneta
    Smetana, Jan
    Wayhelova, Marketa
    Janyskova, Helena
    Okubote, Samuel A.
    Hajek, Roman
    Kuglik, Petr
    INTERNATIONAL JOURNAL OF CLINICAL AND EXPERIMENTAL PATHOLOGY, 2016, 9 (07): : 6965 - +
  • [10] Analysis of copy number variation in the normal human population within a region containing complex segmental duplications on 22q11 using high-resolution array-CGH
    de Bustos, Cecilia
    Diaz de Stahl, Teresita
    Piotrowski, Arkadiusz
    Mantripragada, Kiran K.
    Buckley, Patrick G.
    Darai, Eva
    Hansson, Caisa M.
    Grigelionis, Gintautas
    Menzel, Uwe
    Dumanski, Jan P.
    GENOMICS, 2006, 88 (02) : 152 - 162