Assessment of de novo assemblers for draft genomes: a case study with fungal genomes

被引:19
作者
Abbas, Mostafa M. [1 ]
Malluhi, Qutaibah M. [1 ]
Balakrishnan, Ponnuraman [1 ]
机构
[1] Qatar Univ, Coll Engn, KINDI Ctr Comp Res, Doha, Qatar
关键词
SINGLE-CELL; SHORT READS; EFFICIENT; SPACE; GAUGE;
D O I
10.1186/1471-2164-15-S9-S10
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Recently, large bio-projects dealing with the release of different genomes have transpired. Most of these projects use next-generation sequencing platforms. As a consequence, many de novo assembly tools have evolved to assemble the reads generated by these platforms. Each tool has its own inherent advantages and disadvantages, which make the selection of an appropriate tool a challenging task. Results: We have evaluated the performance of frequently used de novo assemblers namely ABySS, IDBA-UD, Minia, SOAP, SPAdes, Sparse, and Velvet. These assemblers are assessed based on their output quality during the assembly process conducted over fungal data. We compared the performance of these assemblers by considering both computational as well as quality metrics. By analyzing these performance metrics, the assemblers are ranked and a procedure for choosing the candidate assembler is illustrated. Conclusions: In this study, we propose an assessment method for the selection of de novo assemblers by considering their computational as well as quality metrics at the draft genome level. We divide the quality metrics into three groups: g1 measures the goodness of the assemblies, g2 measures the problems of the assemblies, and g3 measures the conservation elements in the assemblies. Our results demonstrate that the assemblers ABySS and IDBA-UD exhibit a good performance for the studied data from fungal genomes in terms of running time, memory, and quality. The results suggest that whole genome shotgun sequencing projects should make use of different assemblers by considering their merits.
引用
收藏
页数:12
相关论文
共 45 条
[1]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[2]   Comparative genomic structure of prokaryotes [J].
Bentley, SD ;
Parkhill, J .
ANNUAL REVIEW OF GENETICS, 2004, 38 :771-792
[3]  
Birren B., 2003, Fungal Genome Initiative: A White Paper for Fungal Comparative Genomics
[4]   Draft Genome Sequence of Botrytis cinerea BcDW1, Inoculum for Noble Rot of Grape Berries [J].
Blanco-Ulate, Barbara ;
Allen, Greg ;
Powell, Ann L. T. ;
Cantu, Dario .
GENOME ANNOUNCEMENTS, 2013, 1 (03)
[5]   Draft Genome Sequence of the Ascomycete Phaeoacremonium aleophilum Strain UCR-PA7, a Causal Agent of the Esca Disease Complex in Grapevines [J].
Blanco-Ulate, Barbara ;
Rolshausen, Philippe ;
Cantu, Dario .
GENOME ANNOUNCEMENTS, 2013, 1 (03)
[6]   Draft Genome Sequence of the Grapevine Dieback Fungus Eutypa lata UCR-EL1 [J].
Blanco-Ulate, Barbara ;
Rolshausen, Philippe E. ;
Cantu, Dario .
GENOME ANNOUNCEMENTS, 2013, 1 (03)
[7]   ALLPATHS: De novo assembly of whole-genome shotgun microreads [J].
Butler, Jonathan ;
MacCallum, Iain ;
Kleber, Michael ;
Shlyakhter, Ilya A. ;
Belmonte, Matthew K. ;
Lander, Eric S. ;
Nusbaum, Chad ;
Jaffe, David B. .
GENOME RESEARCH, 2008, 18 (05) :810-820
[8]   Genome analyses of the wheat yellow (stripe) rust pathogen Puccinia striiformis f. sp tritici reveal polymorphic and haustorial expressed secreted proteins as candidate effectors [J].
Cantu, Dario ;
Segovia, Vanesa ;
MacLean, Daniel ;
Bayles, Rosemary ;
Chen, Xianming ;
Kamoun, Sophien ;
Dubcovsky, Jorge ;
Saunders, Diane G. O. ;
Uauy, Cristobal .
BMC GENOMICS, 2013, 14
[9]   Short read fragment assembly of bacterial genomes [J].
Chaisson, Mark J. ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2008, 18 (02) :324-330
[10]   Space-efficient and exact de Bruijn graph representation based on a Bloom filter [J].
Chikhi, Rayan ;
Rizk, Guillaume .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2013, 8