SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing

被引:18721
作者
Bankevich, Anton [2 ]
Nurk, Sergey [2 ]
Antipov, Dmitry [2 ]
Gurevich, Alexey A. [2 ]
Dvorkin, Mikhail [2 ]
Kulikov, Alexander S. [2 ,3 ]
Lesin, Valery M. [2 ]
Nikolenko, Sergey I. [2 ,3 ]
Son Pham [4 ]
Prjibelski, Andrey D. [2 ]
Pyshkin, Alexey V. [2 ]
Sirotkin, Alexander V. [2 ]
Vyahhi, Nikolay [2 ]
Tesler, Glenn [5 ]
Alekseyev, Max A. [1 ,2 ]
Pevzner, Pavel A. [2 ,4 ]
机构
[1] Univ S Carolina, Dept Comp Sci & Engn, Columbia, SC 29208 USA
[2] St Petersburg Acad Univ, Russian Acad Sci, Algorithm Biol Lab, St Petersburg, Russia
[3] VA Steklov Math Inst, St Petersburg 191011, Russia
[4] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[5] Univ Calif San Diego, Dept Math, La Jolla, CA 92093 USA
基金
美国国家卫生研究院;
关键词
assembly; de Bruijn graph; single cell; sequencing; bacteria; DE-BRUIJN GRAPHS; BACTERIAL GENOMES; AMPLIFICATION; MATTER;
D O I
10.1089/cmb.2012.0021
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V-SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of un-cultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online (http://bioinf.spbau.ru/spades). It is distributed as open source software.
引用
收藏
页码:455 / 477
页数:23
相关论文
共 40 条
[1]   A new approach to sequence comparison:: normalired sequence alignment [J].
Arslan, AN ;
Egecioglu, Ö ;
Pevzner, PA .
BIOINFORMATICS, 2001, 17 (04) :327-337
[2]   Shotgun protein sequencing - Assembly of peptide tandem mass spectra from mixtures of modified proteins [J].
Bandeira, Nuno ;
Clauser, Karl R. ;
Pevzner, Pavel A. .
MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (07) :1123-1134
[3]   Automated de novo protein sequencing of monoclonal antibodies [J].
Bandeira, Nuno ;
Pham, Victoria ;
Pevzner, Pavel ;
Arnott, David ;
Lill, Jennie R. .
NATURE BIOTECHNOLOGY, 2008, 26 (12) :1336-1338
[4]   Genome of a Low-Salinity Ammonia-Oxidizing Archaeon Determined by Single-Cell and Metagenomic Analysis [J].
Blainey, Paul C. ;
Mosier, Annika C. ;
Potanina, Anastasia ;
Francis, Christopher A. ;
Quake, Stephen R. .
PLOS ONE, 2011, 6 (02)
[5]   ALLPATHS: De novo assembly of whole-genome shotgun microreads [J].
Butler, Jonathan ;
MacCallum, Iain ;
Kleber, Michael ;
Shlyakhter, Ilya A. ;
Belmonte, Matthew K. ;
Lander, Eric S. ;
Nusbaum, Chad ;
Jaffe, David B. .
GENOME RESEARCH, 2008, 18 (05) :810-820
[6]   Short read fragment assembly of bacterial genomes [J].
Chaisson, Mark J. ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2008, 18 (02) :324-330
[7]   De novo fragment assembly with short mate-paired reads: Does the read length matter? [J].
Chaisson, Mark J. ;
Brinza, Dumitru ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2009, 19 (02) :336-346
[8]  
Chikhi Rayan, 2011, Algorithms in Bioinformatics. Proceedings of the 11th International Workshop, WABI 2011, P39, DOI 10.1007/978-3-642-23038-7_4
[9]   Efficient de novo assembly of single-cell bacterial genomes from short-read data sets [J].
Chitsaz, Hamidreza ;
Yee-Greenbaum, Joyclyn L. ;
Tesler, Glenn ;
Lombardo, Mary-Jane ;
Dupont, Christopher L. ;
Badger, Jonathan H. ;
Novotny, Mark ;
Rusch, Douglas B. ;
Fraser, Louise J. ;
Gormley, Niall A. ;
Schulz-Trieglaff, Ole ;
Smith, Geoffrey P. ;
Evers, Dirk J. ;
Pevzner, Pavel A. ;
Lasken, Roger S. .
NATURE BIOTECHNOLOGY, 2011, 29 (10) :915-U214
[10]   Single-cell dissection of transcriptional heterogeneity in human colon tumors [J].
Dalerba, Piero ;
Kalisky, Tomer ;
Sahoo, Debashis ;
Rajendran, Pradeep S. ;
Rothenberg, Michael E. ;
Leyrat, Anne A. ;
Sim, Sopheak ;
Okamoto, Jennifer ;
Johnston, Darius M. ;
Qian, Dalong ;
Zabala, Maider ;
Bueno, Janet ;
Neff, Norma F. ;
Wang, Jianbin ;
Shelton, Andrew A. ;
Visser, Brendan ;
Hisamori, Shigeo ;
Shimono, Yohei ;
van de Wetering, Marc ;
Clevers, Hans ;
Clarke, Michael F. ;
Quake, Stephen R. .
NATURE BIOTECHNOLOGY, 2011, 29 (12) :1120-U11