Scaffold assembly based on genome rearrangement analysis

被引:9
作者
Aganezov, Sergey [2 ]
Sitdykova, Nadia [1 ]
Alekseyev, Max A. [2 ]
机构
[1] Acad Univ, St Petersburg, Russia
[2] George Washington Univ, Washington, DC 20052 USA
基金
美国国家科学基金会;
关键词
Scaffolding; Genome rearrangements; Breakpoint graph; MGRA; Genome assembly;
D O I
10.1016/j.compbiolchem.2015.02.005
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Advances in DNA sequencing technology over the past decade have increased the volume of raw sequenced genomic data available for further assembly and analysis. While there exist many algorithms for assembly of sequenced genomic material, they often experience difficulties in constructing complete genomic sequences. Instead, they produce long genomic subsequences (scaffolds), which then become a subject to scaffold assembly aimed at reconstruction of their order along genome chromosomes. The balance between reliability and cost for scaffold assembly is not there just yet, which inspires one to seek for new approaches to address this problem. We present a new method for scaffold assembly based on the analysis of gene orders and genome rearrangements in multiple related genomes (some or even all of which may be fragmented). Evaluation of the proposed method on artificially fragmented mammalian genomes demonstrates its high reliability. We also apply our method for incomplete anophelinae genomes, which expose high fragmentation, and further validate the assembly results with referenced-based scaffolding. While the two methods demonstrate consistent results, the proposed method is able to identify more assembly points than the reference-based scaffolding. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:46 / 53
页数:8
相关论文
共 17 条
  • [1] Multi-break rearrangements and chromosomal evolution
    Alekseyev, Max A.
    Pevzner, Pavel A.
    [J]. THEORETICAL COMPUTER SCIENCE, 2008, 395 (2-3) : 193 - 202
  • [2] Breakpoint graphs and ancestral genome reconstructions
    Alekseyev, Max A.
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2009, 19 (05) : 943 - 957
  • [3] Efficient de novo assembly of single-cell bacterial genomes from short-read data sets
    Chitsaz, Hamidreza
    Yee-Greenbaum, Joyclyn L.
    Tesler, Glenn
    Lombardo, Mary-Jane
    Dupont, Christopher L.
    Badger, Jonathan H.
    Novotny, Mark
    Rusch, Douglas B.
    Fraser, Louise J.
    Gormley, Niall A.
    Schulz-Trieglaff, Ole
    Smith, Geoffrey P.
    Evers, Dirk J.
    Pevzner, Pavel A.
    Lasken, Roger S.
    [J]. NATURE BIOTECHNOLOGY, 2011, 29 (10) : 915 - U214
  • [4] DIRECTIONAL CLONING OF DNA FRAGMENTS AT A LARGE DISTANCE FROM AN INITIAL PROBE - A CIRCULARIZATION METHOD
    COLLINS, FS
    WEISSMAN, SM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1984, 81 (21): : 6812 - 6816
  • [5] Structural variation in the human genome
    Feuk, L
    Carson, AR
    Scherer, SW
    [J]. NATURE REVIEWS GENETICS, 2006, 7 (02) : 85 - 97
  • [6] High-resolution cytogenetic map for the African malaria vector Anopheles gambiae
    George, P.
    Sharakhova, M. V.
    Sharakhov, I. V.
    [J]. INSECT MOLECULAR BIOLOGY, 2010, 19 (05) : 675 - 682
  • [7] genoPlotR: comparative gene and genome visualization in R
    Guy, Lionel
    Roat Kultima, Jens
    Andersson, Siv G. E.
    [J]. BIOINFORMATICS, 2010, 26 (18) : 2334 - 2335
  • [8] The genome sequence of the malaria mosquito Anopheles gambiae
    Holt, RA
    Subramanian, GM
    Halpern, A
    Sutton, GG
    Charlab, R
    Nusskern, DR
    Wincker, P
    Clark, AG
    Ribeiro, JMC
    Wides, R
    Salzberg, SL
    Loftus, B
    Yandell, M
    Majoros, WH
    Rusch, DB
    Lai, ZW
    Kraft, CL
    Abril, JF
    Anthouard, V
    Arensburger, P
    Atkinson, PW
    Baden, H
    de Berardinis, V
    Baldwin, D
    Benes, V
    Biedler, J
    Blass, C
    Bolanos, R
    Boscus, D
    Barnstead, M
    Cai, S
    Center, A
    Chatuverdi, K
    Christophides, GK
    Chrystal, MA
    Clamp, M
    Cravchik, A
    Curwen, V
    Dana, A
    Delcher, A
    Dew, I
    Evans, CA
    Flanigan, M
    Grundschober-Freimoser, A
    Friedli, L
    Gu, ZP
    Guan, P
    Guigo, R
    Hillenmeyer, ME
    Hladun, SL
    [J]. SCIENCE, 2002, 298 (5591) : 129 - +
  • [9] A comprehensive evaluation of assembly scaffolding tools
    Hunt, Martin
    Newbold, Chris
    Berriman, Matthew
    Otto, Thomas D.
    [J]. GENOME BIOLOGY, 2014, 15 (03):
  • [10] BioMart: driving a paradigm change in biological data management
    Kasprzyk, Arek
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2011,