Integrating genome assemblies with MAIA

被引:30
作者
Nijkamp, Jurgen [1 ,2 ,3 ]
Winterbach, Wynand [1 ,4 ]
van den Broek, Marcel [2 ,3 ]
Daran, Jean-Marc [2 ,3 ]
Reinders, Marcel [1 ,3 ,5 ]
de Ridder, Dick [1 ,3 ,5 ]
机构
[1] Delft Univ Technol, Delft Bioinformat Lab, Dept Mediamat, NL-2628 CD Delft, Netherlands
[2] Delft Univ Technol, Ind Microbiol Grp, Dept Biotechnol, NL-2628 BC Delft, Netherlands
[3] Kluyver Ctr Genom Ind Fermentat, NL-2600 GA Delft, Netherlands
[4] Delft Univ Technol, Network Architectures & Serv, Dept Telecommun, NL-2628 CD Delft, Netherlands
[5] Netherlands Bioinformat Ctr, NL-6500 HB Nijmegen, Netherlands
关键词
GENERATION; ALGORITHMS; ALIGNMENT;
D O I
10.1093/bioinformatics/btq366
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: De novo assembly of a eukaryotic genome with next-generation sequencing data is still a challenging task. Over the past few years several assemblers have been developed, often suitable for one specific type of sequencing data. The number of known genomes is expanding rapidly, therefore it becomes possible to use multiple reference genomes for assembly projects. We introduce an assembly integrator that makes use of all available data, i.e. multiple de novo assemblies and mappings against multiple related genomes, by optimizing a weighted combination of criteria. Results: The developed algorithm was applied on the de novo sequencing of the Saccharomyces cerevisiae CEN. PK 113-7D strain. Using Solexa and 454 read data, two de novo and three comparative assemblies were constructed and subsequently integrated, yielding 29 contigs, covering more than 12 Mbp; a drastic improvement compared with the single assemblies.
引用
收藏
页码:i433 / i439
页数:7
相关论文
共 25 条
[1]  
[Anonymous], 2000, Methods in yeast genetics
[2]   Genome structure of a Saccharomyces cerevisiae strain widely used in bioethanol production [J].
Argueso, Juan Lucas ;
Carazzolle, Marcelo F. ;
Mieczkowski, Piotr A. ;
Duarte, Fabiana M. ;
Netto, Osmar V. C. ;
Missawa, Silvia K. ;
Galzerani, Felipe ;
Costa, Gustavo G. L. ;
Vidal, Ramon O. ;
Noronha, Melline F. ;
Dominska, Margaret ;
Andrietta, Maria G. S. ;
Andrietta, Silvio R. ;
Cunha, Anderson F. ;
Gomes, Luiz H. ;
Tavares, Flavio C. A. ;
Alcarde, Andre R. ;
Dietrich, Fred S. ;
McCusker, John H. ;
Petes, Thomas D. ;
Pereira, Goncalo A. G. .
GENOME RESEARCH, 2009, 19 (12) :2258-2270
[3]   Fast algorithms for large-scale genome alignment and comparison [J].
Delcher, AL ;
Phillippy, A ;
Carlton, J ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 2002, 30 (11) :2478-2483
[4]   De novo genome sequence assembly of a filamentous fungus using Sanger, 454 and Illumina sequence data [J].
DiGuistini, Scott ;
Liao, Nancy Y. ;
Platt, Darren ;
Robertson, Gordon ;
Seidel, Michael ;
Chan, Simon K. ;
Docking, T. Roderick ;
Birol, Inanc ;
Holt, Robert A. ;
Hirst, Martin ;
Mardis, Elaine ;
Marra, Marco A. ;
Hamelin, Richard C. ;
Bohlmann, Joerg ;
Breuil, Colette ;
Jones, Steven J. M. .
GENOME BIOLOGY, 2009, 10 (09)
[5]   FUTURE PATHS FOR INTEGER PROGRAMMING AND LINKS TO ARTIFICIAL-INTELLIGENCE [J].
GLOVER, F .
COMPUTERS & OPERATIONS RESEARCH, 1986, 13 (05) :533-549
[6]   A Sanger/pyrosequencing hybrid approach tor the generation of high-quality draft assemblies of marine microbial genomes [J].
Goldberg, Susanne M. D. ;
Johnson, Justin ;
Busam, Dana ;
Feldblyum, Tamara ;
Ferriera, Steve ;
Friedman, Robert ;
Halpern, Aaron ;
Khouri, Hoda ;
Kravitz, Saul A. ;
Lauro, Federico M. ;
Li, Kelvin ;
Rogers, Yu-Hui ;
Strausberg, Robert ;
Sutton, Granger ;
Tallon, Luke ;
Thomas, Torsten ;
Venter, Eli ;
Frazier, Marvin ;
Venter, J. Craig .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (30) :11240-11245
[7]  
Hwang D, 2005, P NATL ACAD SCI USA, V102, P17296, DOI 10.1073/pnas.0508647102
[8]   Extending assembly of short DNA sequences to handle error [J].
Jeck, William R. ;
Reinhardt, Josephine A. ;
Baltrus, David A. ;
Hickenbotham, Matthew T. ;
Magrini, Vincent ;
Mardis, Elaine R. ;
Dangl, Jeffery L. ;
Jones, Corbin D. .
BIOINFORMATICS, 2007, 23 (21) :2942-2944
[9]   Combinatorial influence of environmental parameters on transcription factor activity [J].
Knijnenburg, T. A. ;
Wessels, L. F. A. ;
Reinders, M. J. T. .
BIOINFORMATICS, 2008, 24 (13) :I172-I181
[10]  
LEE WW, 2007, USING CELERA ASSEMBL