De novo genome sequence assembly of a filamentous fungus using Sanger, 454 and Illumina sequence data

被引:0
作者
Scott DiGuistini
Nancy Y Liao
Darren Platt
Gordon Robertson
Michael Seidel
Simon K Chan
T Roderick Docking
Inanc Birol
Robert A Holt
Martin Hirst
Elaine Mardis
Marco A Marra
Richard C Hamelin
Jörg Bohlmann
Colette Breuil
Steven JM Jones
机构
[1] University of British Columbia,Department of Wood Science
[2] BC Cancer Agency Genome Sciences Centre,Michael Smith Laboratories
[3] Amyris Biotechnologies,undefined
[4] Inc.,undefined
[5] Washington University School of Medicine,undefined
[6] Natural Resources Canada,undefined
[7] University of British Columbia,undefined
来源
Genome Biology | / 10卷
关键词
Additional Data File; Draft Genome Sequence; Read Data; Draft Assembly; Velvet Assembly;
D O I
暂无
中图分类号
学科分类号
摘要
Sequencing-by-synthesis technologies can reduce the cost of generating de novo genome assemblies. We report a method for assembling draft genome sequences of eukaryotic organisms that integrates sequence information from different sources, and demonstrate its effectiveness by assembling an approximately 32.5 Mb draft genome sequence for the forest pathogen Grosmannia clavigera, an ascomycete fungus. We also developed a method for assessing draft assemblies using Illumina paired end read data and demonstrate how we are using it to guide future sequence finishing. Our results demonstrate that eukaryotic genome sequences can be accurately assembled by combining Illumina, 454 and Sanger sequence data.
引用
收藏
相关论文
共 166 条
  • [1] Huse SM(2007)Accuracy and quality of massively-parallel DNA pyrosequencing. Genome Biol 8 R143-820
  • [2] Huber JA(2008)ALLPATHS: Genome Res 18 810-501
  • [3] Morrison HG(2007)assembly of whole-genome shotgun microreads. Bioinformatics 23 500-829
  • [4] Sogin ML(2008)Assembling millions of short DNA sequences using SSAKE. Genome Res 18 821-1123
  • [5] Welch DM(2009)Velvet: Algorithms for Genome Res 19 1117-1067
  • [6] Butler J(2007)short read assembly using de Bruijn graphs. Bioinformatics 23 1061-868
  • [7] MacCallum I(2003)ABySS: A parallel assembler for short read sequence data. Nature 422 859-986
  • [8] Kleber M(2005)CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Nature 434 980-770
  • [9] Shlyakhter IA(2008)The genome sequence of the filamentous fungus Genome Res 18 763-2872
  • [10] Belmonte MK(2006). Can J Forest Res 36 2864-6116