Identification of Small Exonic CNV from Whole-Exome Sequence Data and Application to Autism Spectrum Disorder

被引:122
作者
Poultney, Christopher S. [1 ,2 ]
Goldberg, Arthur P. [1 ,2 ,3 ]
Drapeau, Elodie [1 ,2 ]
Kou, Yan [1 ,4 ,5 ]
Harony-Nicolas, Hala [1 ,2 ]
Kajiwara, Yuji [1 ,2 ]
De Rubeis, Silvia [1 ,2 ]
Durand, Simon [1 ,2 ]
Stevens, Christine [6 ]
Rehnstroem, Karola [7 ,8 ]
Palotie, Aarno [6 ,7 ]
Daly, Mark J. [6 ,9 ,10 ]
Ma'ayan, Avi [4 ,5 ]
Fromer, Menachem [2 ,11 ,12 ]
Buxbaum, Joseph D. [1 ,2 ,3 ,11 ,12 ,13 ]
机构
[1] Icahn Sch Med Mt Sinai, Seaver Autism Ctr Res & Treatment, New York, NY 10029 USA
[2] Icahn Sch Med Mt Sinai, Dept Psychiat, New York, NY 10029 USA
[3] Icahn Sch Med Mt Sinai, Icahn Inst Genom & Multiscale Biol, New York, NY 10029 USA
[4] Icahn Sch Med Mt Sinai, Dept Pharmacol & Syst Therapeut, New York, NY 10029 USA
[5] Icahn Sch Med Mt Sinai, Syst Biol Ctr New York, New York, NY 10029 USA
[6] Broad Inst MIT & Harvard, Program Med & Populat Genet, Cambridge, MA 02142 USA
[7] Univ Helsinki, Inst Mol Med Finland FIMM, FIN-00290 Helsinki, Finland
[8] Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[9] Massachusetts Gen Hosp, Dept Med, Analyt & Translat Genet Unit, Boston, MA 02114 USA
[10] Harvard Univ, Sch Med, Boston, MA 02114 USA
[11] Icahn Sch Med Mt Sinai, Dept Neurosci, Dept Genet & Genom Sci, New York, NY 10029 USA
[12] Icahn Sch Med Mt Sinai, Friedman Brain Inst, New York, NY 10029 USA
[13] Icahn Sch Med Mt Sinai, Mindich Child Hlth & Dev Inst, New York, NY 10029 USA
基金
美国国家卫生研究院;
关键词
COPY-NUMBER VARIATION; DE-NOVO MUTATIONS; GENE; REVEALS; HAPLOINSUFFICIENCY; DISCOVERY; NETWORK; SHANK3; TOOL;
D O I
10.1016/j.ajhg.2013.09.001
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Copy number variation (CNV) is an important determinant of human diversity and plays important roles in susceptibility to disease. Most studies of CNV carried out to date have made use of chromosome microarray and have had a lower size limit for detection of about 30 kilobases (kb). With the emergence of whole-exome sequencing studies, we asked whether such data could be used to reliably call rare exonic CNV in the size range of 1-30 kilobases (kb), making use of the eXome Hidden Markov Model (XHMM) program. By using both transmission information and validation by molecular methods, we confirmed that small CNV encompassing as few as three exons can be reliably called from whole-exome data. We applied this approach to an autism case-control sample (n = 811, mean per-target read depth = 161) and observed a significant increase in the burden of rare (MAF <= 1%) 1-30 kb CNV, 1-30 kb deletions, and 1-10 kb deletions in ASD. CNV in the 1-30 kb range frequently hit just a single gene, and we were therefore able to carry out enrichment and pathway analyses, where we observed enrichment for disruption of genes in cytoskeletal and autophagy pathways in ASD. In summary, our results showed that XHMM provided an effective means to assess small exonic CNV from whole-exome data, indicated that rare 1-30 kb exonic deletions could contribute to risk in up to 7% of individuals with ASD, and implicated a candidate pathway in developmental delay syndromes.
引用
收藏
页码:607 / 619
页数:13
相关论文
共 47 条
[1]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[2]  
Aoki K.F., 2005, USING KEGG DATABASE
[3]   The IntAct molecular interaction database in 2010 [J].
Aranda, B. ;
Achuthan, P. ;
Alam-Faruque, Y. ;
Armean, I. ;
Bridge, A. ;
Derow, C. ;
Feuermann, M. ;
Ghanbarian, A. T. ;
Kerrien, S. ;
Khadake, J. ;
Kerssemakers, J. ;
Leroy, C. ;
Menden, M. ;
Michaut, M. ;
Montecchi-Palazzi, L. ;
Neuhauser, S. N. ;
Orchard, S. ;
Perreau, V. ;
Roechert, B. ;
van Eijk, K. ;
Hermjakob, H. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D525-D531
[4]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[5]   Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases [J].
Berger, Seth I. ;
Posner, Jeremy M. ;
Ma'ayan, Avi .
BMC BIOINFORMATICS, 2007, 8 (1)
[6]   SHANK3 haploinsufficiency: a "common" but underdiagnosed highly penetrant monogenic cause of autism spectrum disorders [J].
Betancur, Catalina ;
Buxbaum, Joseph D. .
MOLECULAR AUTISM, 2013, 4
[7]   Etiological heterogeneity in autism spectrum disorders: More than 100 genetic and genomic disorders and still counting [J].
Betancur, Catalina .
BRAIN RESEARCH, 2011, 1380 :42-77
[8]   MINT, the molecular interaction database: 2009 update [J].
Ceol, Arnaud ;
Aryamontri, Andrew Chatr ;
Licata, Luana ;
Peluso, Daniele ;
Briganti, Leonardo ;
Perfetto, Livia ;
Castagnoli, Luisa ;
Cesareni, Gianni .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D532-D539
[9]   Enrichr: interactive and collaborative HTML']HTML5 gene list enrichment analysis tool [J].
Chen, Edward Y. ;
Tan, Christopher M. ;
Kou, Yan ;
Duan, Qiaonan ;
Wang, Zichen ;
Meirelles, Gabriela Vaz ;
Clark, Neil R. ;
Ma'ayan, Avi .
BMC BIOINFORMATICS, 2013, 14
[10]  
Choi AMK, 2013, NEW ENGL J MED, V368, P1845, DOI [10.1056/NEJMra1205406, 10.1056/NEJMc1303158]