Multi-genome Scaffold Co-assembly Based on the Analysis of Gene Orders and Genomic Repeats

被引:7
作者
Aganezov, Sergey [1 ,2 ]
Alekseyev, Max A. [1 ]
机构
[1] George Washington Univ, Washington, DC 20052 USA
[2] ITMO Univ, St Petersburg, Russia
来源
BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2016 | 2016年 / 9683卷
基金
美国国家科学基金会;
关键词
Genome assembly; Scaffolding; Gene order;
D O I
10.1007/978-3-319-38782-6_20
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Advances in the DNA sequencing technology over the past decades have increased the volume of raw sequenced genomic data available for further assembly and analysis. While there exist many software tools for assembly of sequenced genomic material, they often experience difficulties with reconstructing complete chromosomes. Major obstacles include uneven read coverage and long similar subsequences (repeats) in genomes. Assemblers therefore often are able to reliably reconstruct only long subsequences, called scaffolds. We present a method for simultaneous co-assembly of all fragmented genomes (represented as collections of scaffolds rather than chromosomes) in a given set of annotated genomes. The method is based on the analysis of gene orders and relies on the evolutionary model, which includes genome rearrangements as well as gene insertions and deletions. It can also utilize information about genomic repeats and the phylogenetic tree of the given genomes, further improving their assembly quality.
引用
收藏
页码:237 / 249
页数:13
相关论文
共 14 条
  • [1] Scaffold assembly based on genome rearrangement analysis
    Aganezov, Sergey
    Sitdykova, Nadia
    Alekseyev, Max A.
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2015, 57 : 46 - 53
  • [2] Breakpoint graphs and ancestral genome reconstructions
    Alekseyev, Max A.
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2009, 19 (05) : 943 - 957
  • [3] Ancestral gene synteny reconstruction improves extant species scaffolding
    Anselmetti, Yoann
    Berry, Vincent
    Chauve, Cedric
    Chateau, Annie
    Tannier, Eric
    Berard, Severine
    [J]. BMC GENOMICS, 2015, 16 : 1 - 13
  • [4] Assour L., 2015, Proceedings of 7th International Conference on Bioinformatics and Computational Biology, P193
  • [5] Reconstruction of Ancestral Genomes in Presence of Gene Gain and Loss
    Avdeyev, Pavel
    Jiang, Shuai
    Aganezov, Sergey
    Hu, Fei
    Alekseyev, Max A.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2016, 23 (03) : 150 - 164
  • [6] SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
    Bankevich, Anton
    Nurk, Sergey
    Antipov, Dmitry
    Gurevich, Alexey A.
    Dvorkin, Mikhail
    Kulikov, Alexander S.
    Lesin, Valery M.
    Nikolenko, Sergey I.
    Son Pham
    Prjibelski, Andrey D.
    Pyshkin, Alexey V.
    Sirotkin, Alexander V.
    Vyahhi, Nikolay
    Tesler, Glenn
    Alekseyev, Max A.
    Pevzner, Pavel A.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) : 455 - 477
  • [7] Structural variation in the human genome
    Feuk, L
    Carson, AR
    Scherer, SW
    [J]. NATURE REVIEWS GENETICS, 2006, 7 (02) : 85 - 97
  • [8] High-quality draft assemblies of mammalian genomes from massively parallel sequence data
    Gnerre, Sante
    MacCallum, Iain
    Przybylski, Dariusz
    Ribeiro, Filipe J.
    Burton, Joshua N.
    Walker, Bruce J.
    Sharpe, Ted
    Hall, Giles
    Shea, Terrance P.
    Sykes, Sean
    Berlin, Aaron M.
    Aird, Daniel
    Costello, Maura
    Daza, Riza
    Williams, Louise
    Nicol, Robert
    Gnirke, Andreas
    Nusbaum, Chad
    Lander, Eric S.
    Jaffe, David B.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (04) : 1513 - 1518
  • [9] A comprehensive evaluation of assembly scaffolding tools
    Hunt, Martin
    Newbold, Chris
    Berriman, Matthew
    Otto, Thomas D.
    [J]. GENOME BIOLOGY, 2014, 15 (03):
  • [10] BioMart: driving a paradigm change in biological data management
    Kasprzyk, Arek
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,