Automatic characterization of copy number polymorphism using high throughput sequencing

被引:1
作者
Alkan, Can [1 ]
机构
[1] Bilkent Univ, Fac Engn, Dept Comp Engn, Ankara, Turkey
关键词
Genomics; copy number polymorphism; whole genome sequencing; containers; STRUCTURAL VARIATION; SEGMENTAL DUPLICATIONS; COMBINATORIAL ALGORITHMS; HUMAN-GENOME; DIVERSITY; INSERTIONS; STRATEGIES; EVOLUTION; DISCOVERY; GENOTYPE;
D O I
10.3906/elk-1903-135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Genome structural variation, broadly defined as alterations longer than 50 bp, are important sources for genetic variation among humans, including those that cause complex diseases such as autism, developmental delay, and schizophrenia. Although there has been considerable progress in characterizing structural variation since the beginnings of the 1000 Genomes Project, one form of structural variation called segmental duplications (SDs) remained largely understudied in large cohorts. This is mostly because SDs cannot be accurately discovered using the alignment files generated with standard read mapping tools. Instead, they can only be found when multiple map locations are considered. There is still a single algorithm available for SD discovery, which includes various tools and scripts that are not portable and are difficult to use. Additionally, this algorithm relies on a priori information for regions where no structural variations are discovered in large number of genomes. Therefore, there is a need for fully automated, portable, and user-friendly tools to make SD characterization a part of genome analyses. Here we introduce such an algorithm and efficient implementation, called mrCaNaVaR, that aims to fill this gap in genome analysis toolbox.
引用
收藏
页码:253 / 261
页数:9
相关论文
共 50 条
[31]   Characterization of the naive murine antibody repertoire using unamplified high-throughput sequencing [J].
Rettig, Trisha A. ;
Ward, Claire ;
Bye, Bailey A. ;
Pecaut, Michael J. ;
Chapes, Stephen K. .
PLOS ONE, 2018, 13 (01)
[32]   Discovery of common Asian copy number variants using integrated high-resolution array CGH and massively parallel DNA sequencing [J].
Park, Hansoo ;
Kim, Jong-Il ;
Ju, Young Seok ;
Gokcumen, Omer ;
Mills, Ryan E. ;
Kim, Sheehyun ;
Lee, Seungbok ;
Suh, Dongwhan ;
Hong, Dongwan ;
Kang, Hyunseok Peter ;
Yoo, Yun Joo ;
Shin, Jong-Yeon ;
Kim, Hyun-Jin ;
Yavartanoo, Maryam ;
Chang, Young Wha ;
Ha, Jung-Sook ;
Chong, Wilson ;
Hwang, Ga-Ram ;
Darvishi, Katayoon ;
Kim, HyeRan ;
Yang, Song Ju ;
Yang, Kap-Seok ;
Kim, Hyungtae ;
Hurles, Matthew E. ;
Scherer, Stephen W. ;
Carter, Nigel P. ;
Tyler-Smith, Chris ;
Lee, Charles ;
Seo, Jeong-Sun .
NATURE GENETICS, 2010, 42 (05) :400-U61
[33]   Determining multiallelic complex copy number and sequence variation from high coverage exome sequencing data [J].
Forni, Diego ;
Martin, Diana ;
Abujaber, Razan ;
Sharp, Andrew J. ;
Sironi, Manuela ;
Hollox, Edward J. .
BMC GENOMICS, 2015, 16
[34]   A Computational Approach to Detect CNVs Using High-throughput Sequencing [J].
Moon, Myungjin ;
Ahn, Jaegyoon ;
Park, Chihyun ;
Park, Sanghyun ;
Yoon, Youngmi ;
Yoon, Jeehee .
2009 9TH IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING, 2009, :266-+
[35]   A Deep Learning Approach for Detecting Copy Number Variation in Next-Generation Sequencing Data [J].
Hill, Tom ;
Unckless, Robert L. .
G3-GENES GENOMES GENETICS, 2019, 9 (11) :3575-3582
[36]   Global characterization of copy number variants in epilepsy patients from whole genome sequencing [J].
Monlong, Jean ;
Girard, Simon L. ;
Meloche, Caroline ;
Cadieux-Dion, Maxime ;
Andrade, Danielle M. ;
Lafreniere, Ron G. ;
Gravel, Micheline ;
Spiegelman, Dan ;
Dionne-Laporte, Alexandre ;
Boelman, Cyrus ;
Hamdan, Fadi F. ;
Michaud, Jacques L. ;
Rouleau, Guy ;
Minassian, Berge A. ;
Bourque, Guillaume ;
Cossette, Patrick .
PLOS GENETICS, 2018, 14 (04)
[37]   Population clustering based on copy number variations detected from next generation sequencing data [J].
Duan, Junbo ;
Zhang, Ji-Gang ;
Wan, Mingxi ;
Deng, Hong-Wen ;
Wang, Yu-Ping .
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2014, 12 (04)
[38]   An Evaluation of Copy Number Variation Detection Tools from Whole-Exome Sequencing Data [J].
Tan, Renjie ;
Wang, Yadong ;
Kleinstein, Sarah E. ;
Liu, Yongzhuang ;
Zhu, Xiaolin ;
Guo, Hongzhe ;
Jiang, Qinghua ;
Allen, Andrew S. ;
Zhu, Mingfu .
HUMAN MUTATION, 2014, 35 (07) :899-907
[39]   MONTAGE: a new tool for high-throughput detection of mosaic copy number variation [J].
Glessner, Joseph T. ;
Chang, Xiao ;
Liu, Yichuan ;
Li, Jin ;
Khan, Munir ;
Wei, Zhi ;
Sleiman, Patrick M. A. ;
Hakonarson, Hakon .
BMC GENOMICS, 2021, 22 (01)
[40]   Nucleotide polymorphism and copy number variant detection using exome capture and next-generation sequencing in the polyploid grass Panicum virgatum [J].
Evans, Joseph ;
Kim, Jeongwoon ;
Childs, Kevin L. ;
Vaillancourt, Brieanne ;
Crisovan, Emily ;
Nandety, Aruna ;
Gerhardt, Daniel J. ;
Richmond, Todd A. ;
Jeddeloh, Jeffrey A. ;
Kaeppler, Shawn M. ;
Casler, Michael D. ;
Buell, C. Robin .
PLANT JOURNAL, 2014, 79 (06) :993-1008