Gossamer - a resource-efficient de novo assembler

被引:13
作者
Conway, Thomas [1 ]
Wazny, Jeremy [1 ]
Bromage, Andrew [1 ]
Zobel, Justin [1 ]
Beresford-Smith, Bryan [1 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, NICTA Victoria Res Lab, Parkville, Vic 3010, Australia
基金
澳大利亚研究理事会;
关键词
VELVET; READS;
D O I
10.1093/bioinformatics/bts297
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The de novo assembly of short read high-throughput sequencing data poses significant computational challenges. The volume of data is huge; the reads are tiny compared to the underlying sequence, and there are significant numbers of sequencing errors. There are numerous software packages that allow users to assemble short reads, but most are either limited to relatively small genomes (e. g. bacteria) or require large computing infrastructure or employ greedy algorithms and thus often do not yield high-quality results. Results: We have developed Gossamer, an implementation of the de Bruijn approach to assembly that requires close to the theoretical minimum of memory, but still allows efficient processing. Our results show that it is space efficient and produces high-quality assemblies.
引用
收藏
页码:1937 / 1938
页数:2
相关论文
共 10 条
[1]   De novo fragment assembly with short mate-paired reads: Does the read length matter? [J].
Chaisson, Mark J. ;
Brinza, Dumitru ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2009, 19 (02) :336-346
[2]   Succinct data structures for assembling large genomes [J].
Conway, Thomas C. ;
Bromage, Andrew J. .
BIOINFORMATICS, 2011, 27 (04) :479-486
[3]   De novo bacterial genome sequencing: Millions of very short reads assembled on a desktop computer [J].
Hernandez, David ;
Francois, Patrice ;
Farinelli, Laurent ;
Osteras, Magne ;
Schrenzel, Jacques .
GENOME RESEARCH, 2008, 18 (05) :802-809
[4]   De novo assembly of human genomes with massively parallel short read sequencing [J].
Li, Ruiqiang ;
Zhu, Hongmei ;
Ruan, Jue ;
Qian, Wubin ;
Fang, Xiaodong ;
Shi, Zhongbin ;
Li, Yingrui ;
Li, Shengting ;
Shan, Gao ;
Kristiansen, Karsten ;
Li, Songgang ;
Yang, Huanming ;
Wang, Jian ;
Wang, Jun .
GENOME RESEARCH, 2010, 20 (02) :265-272
[5]   GAGE: A critical evaluation of genome assemblies and assembly algorithms [J].
Salzberg, Steven L. ;
Phillippy, Adam M. ;
Zimin, Aleksey ;
Puiu, Daniela ;
Magoc, Tanja ;
Koren, Sergey ;
Treangen, Todd J. ;
Schatz, Michael C. ;
Delcher, Arthur L. ;
Roberts, Michael ;
Marcais, Guillaume ;
Pop, Mihai ;
Yorke, James A. .
GENOME RESEARCH, 2012, 22 (03) :557-567
[6]   Efficient de novo assembly of large genomes using compressed data structures [J].
Simpson, Jared T. ;
Durbin, Richard .
GENOME RESEARCH, 2012, 22 (03) :549-556
[7]   ABySS: A parallel assembler for short read sequence data [J].
Simpson, Jared T. ;
Wong, Kim ;
Jackman, Shaun D. ;
Schein, Jacqueline E. ;
Jones, Steven J. M. ;
Birol, Inanc .
GENOME RESEARCH, 2009, 19 (06) :1117-1123
[8]   Assembling millions of short DNA sequences using SSAKE [J].
Warren, Rene L. ;
Sutton, Granger G. ;
Jones, Steven J. M. ;
Holt, Robert A. .
BIOINFORMATICS, 2007, 23 (04) :500-501
[9]   Velvet: Algorithms for de novo short read assembly using de Bruijn graphs [J].
Zerbino, Daniel R. ;
Birney, Ewan .
GENOME RESEARCH, 2008, 18 (05) :821-829
[10]   Pebble and Rock Band: Heuristic Resolution of Repeats and Scaffolding in the Velvet Short-Read de Novo Assembler [J].
Zerbino, Daniel R. ;
McEwen, Gayle K. ;
Margulies, Elliott H. ;
Birney, Ewan .
PLOS ONE, 2009, 4 (12)