Automatic characterization of copy number polymorphism using high throughput sequencing

被引:1
|
作者
Alkan, Can [1 ]
机构
[1] Bilkent Univ, Fac Engn, Dept Comp Engn, Ankara, Turkey
关键词
Genomics; copy number polymorphism; whole genome sequencing; containers; STRUCTURAL VARIATION; SEGMENTAL DUPLICATIONS; COMBINATORIAL ALGORITHMS; HUMAN-GENOME; DIVERSITY; INSERTIONS; STRATEGIES; EVOLUTION; DISCOVERY; GENOTYPE;
D O I
10.3906/elk-1903-135
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Genome structural variation, broadly defined as alterations longer than 50 bp, are important sources for genetic variation among humans, including those that cause complex diseases such as autism, developmental delay, and schizophrenia. Although there has been considerable progress in characterizing structural variation since the beginnings of the 1000 Genomes Project, one form of structural variation called segmental duplications (SDs) remained largely understudied in large cohorts. This is mostly because SDs cannot be accurately discovered using the alignment files generated with standard read mapping tools. Instead, they can only be found when multiple map locations are considered. There is still a single algorithm available for SD discovery, which includes various tools and scripts that are not portable and are difficult to use. Additionally, this algorithm relies on a priori information for regions where no structural variations are discovered in large number of genomes. Therefore, there is a need for fully automated, portable, and user-friendly tools to make SD characterization a part of genome analyses. Here we introduce such an algorithm and efficient implementation, called mrCaNaVaR, that aims to fill this gap in genome analysis toolbox.
引用
收藏
页码:253 / 261
页数:9
相关论文
共 50 条
  • [1] Inferring Variation in Copy Number Using High Throughput Sequencing Data in R
    Knaus, Brian J.
    Gruenwald, Niklaus J.
    FRONTIERS IN GENETICS, 2018, 9
  • [2] Estimating relative mitochondrial DNA copy number using high throughput sequencing data
    Zhang, Pan
    Lehmann, Brian D.
    Samuels, David C.
    Zhao, Shilin
    Zhao, Ying-Yong
    Shyr, Yu
    Guo, Yan
    GENOMICS, 2017, 109 (5-6) : 457 - 462
  • [3] High-Throughput Multiplex Sequencing to Discover Copy Number Variants in Drosophila
    Daines, Bryce
    Wang, Hui
    Li, Yumei
    Han, Yi
    Gibbs, Richard
    Chen, Rui
    GENETICS, 2009, 182 (04) : 935 - 941
  • [4] Detecting common copy number variants in high-throughput sequencing data by using JointSLM algorithm
    Magi, Alberto
    Benelli, Matteo
    Yoon, Seungtai
    Roviello, Franco
    Torricelli, Francesca
    NUCLEIC ACIDS RESEARCH, 2011, 39 (10) : e65
  • [5] GtTR: Bayesian estimation of absolute tandem repeat copy number using sequence capture and high throughput sequencing
    Ganesamoorthy, Devika
    Minh Duc Cao
    Duarte, Tania
    Chen, Wenhan
    Coin, Lachlan
    BMC BIOINFORMATICS, 2018, 19
  • [6] CNV-seq, a new method to detect copy number variation using high-throughput sequencing
    Xie, Chao
    Tammi, Martti T.
    BMC BIOINFORMATICS, 2009, 10
  • [7] Detection of Copy Number Alterations Using Single Cell Sequencing
    Knouse, Kristin A.
    Wu, Jie
    Hendricks, Austin
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2017, (120):
  • [8] Quadruplex MAPH: improvement of throughput in high-resolution copy number screening
    Tyson, Jess
    Majerus, Tamsin M. O.
    Walker, Susan
    Armour, John A. L.
    BMC GENOMICS, 2009, 10 : 453
  • [9] Combinatorial approach to estimate copy number genotype using whole-exome sequencing data
    Hwang, Mi Yeong
    Moon, Sanghoon
    Heo, Lyong
    Kim, Young Jin
    Oh, Ji Hee
    Kim, Yeon-Jung
    Kim, Yun Kyoung
    Lee, Juyoung
    Han, Bok-Ghee
    Kim, Bong-Jo
    GENOMICS, 2015, 105 (03) : 145 - 149
  • [10] An evaluation of copy number variation detection tools for cancer using whole exome sequencing data
    Zare, Fatima
    Dow, Michelle
    Monteleone, Nicholas
    Hosny, Abdelrahman
    Nabavi, Sheida
    BMC BIOINFORMATICS, 2017, 18