Gossamer - a resource-efficient de novo assembler

被引:12
作者
Conway, Thomas [1 ]
Wazny, Jeremy [1 ]
Bromage, Andrew [1 ]
Zobel, Justin [1 ]
Beresford-Smith, Bryan [1 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, NICTA Victoria Res Lab, Parkville, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
VELVET; READS;
D O I
10.1093/bioinformatics/bts297
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The de novo assembly of short read high-throughput sequencing data poses significant computational challenges. The volume of data is huge; the reads are tiny compared to the underlying sequence, and there are significant numbers of sequencing errors. There are numerous software packages that allow users to assemble short reads, but most are either limited to relatively small genomes (e. g. bacteria) or require large computing infrastructure or employ greedy algorithms and thus often do not yield high-quality results. Results: We have developed Gossamer, an implementation of the de Bruijn approach to assembly that requires close to the theoretical minimum of memory, but still allows efficient processing. Our results show that it is space efficient and produces high-quality assemblies.
引用
收藏
页码:1937 / 1938
页数:2
相关论文
共 10 条
  • [1] De novo fragment assembly with short mate-paired reads: Does the read length matter?
    Chaisson, Mark J.
    Brinza, Dumitru
    Pevzner, Pavel A.
    [J]. GENOME RESEARCH, 2009, 19 (02) : 336 - 346
  • [2] Succinct data structures for assembling large genomes
    Conway, Thomas C.
    Bromage, Andrew J.
    [J]. BIOINFORMATICS, 2011, 27 (04) : 479 - 486
  • [3] De novo bacterial genome sequencing: Millions of very short reads assembled on a desktop computer
    Hernandez, David
    Francois, Patrice
    Farinelli, Laurent
    Osteras, Magne
    Schrenzel, Jacques
    [J]. GENOME RESEARCH, 2008, 18 (05) : 802 - 809
  • [4] De novo assembly of human genomes with massively parallel short read sequencing
    Li, Ruiqiang
    Zhu, Hongmei
    Ruan, Jue
    Qian, Wubin
    Fang, Xiaodong
    Shi, Zhongbin
    Li, Yingrui
    Li, Shengting
    Shan, Gao
    Kristiansen, Karsten
    Li, Songgang
    Yang, Huanming
    Wang, Jian
    Wang, Jun
    [J]. GENOME RESEARCH, 2010, 20 (02) : 265 - 272
  • [5] GAGE: A critical evaluation of genome assemblies and assembly algorithms
    Salzberg, Steven L.
    Phillippy, Adam M.
    Zimin, Aleksey
    Puiu, Daniela
    Magoc, Tanja
    Koren, Sergey
    Treangen, Todd J.
    Schatz, Michael C.
    Delcher, Arthur L.
    Roberts, Michael
    Marcais, Guillaume
    Pop, Mihai
    Yorke, James A.
    [J]. GENOME RESEARCH, 2012, 22 (03) : 557 - 567
  • [6] Efficient de novo assembly of large genomes using compressed data structures
    Simpson, Jared T.
    Durbin, Richard
    [J]. GENOME RESEARCH, 2012, 22 (03) : 549 - 556
  • [7] ABySS: A parallel assembler for short read sequence data
    Simpson, Jared T.
    Wong, Kim
    Jackman, Shaun D.
    Schein, Jacqueline E.
    Jones, Steven J. M.
    Birol, Inanc
    [J]. GENOME RESEARCH, 2009, 19 (06) : 1117 - 1123
  • [8] Assembling millions of short DNA sequences using SSAKE
    Warren, Rene L.
    Sutton, Granger G.
    Jones, Steven J. M.
    Holt, Robert A.
    [J]. BIOINFORMATICS, 2007, 23 (04) : 500 - 501
  • [9] Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
    Zerbino, Daniel R.
    Birney, Ewan
    [J]. GENOME RESEARCH, 2008, 18 (05) : 821 - 829
  • [10] Pebble and Rock Band: Heuristic Resolution of Repeats and Scaffolding in the Velvet Short-Read de Novo Assembler
    Zerbino, Daniel R.
    McEwen, Gayle K.
    Margulies, Elliott H.
    Birney, Ewan
    [J]. PLOS ONE, 2009, 4 (12):