Genome assembly reborn: recent computational challenges

被引:202
作者
Pop, Mihai [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
genome assembly; genome sequencing; next generation sequencing technologies; SEQUENCING TECHNOLOGY; MICROBIAL GENOMES; REPEAT REGIONS; DNA-SEQUENCES; SHOTGUN; TOOL; VALIDATION; ALGORITHMS; GENERATION; ACCURACY;
D O I
10.1093/bib/bbp026
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Research into genome assembly algorithms has experienced a resurgence due to new challenges created by the development of next generation sequencing technologies. Several genome assemblers have been published in recent years specifically targeted at the new sequence data; however, the ever-changing technological landscape leads to the need for continued research. In addition, the low cost of next generation sequencing data has led to an increased use of sequencing in new settings. For example, the new field of metagenomics relies on large-scale sequencing of entire microbial communities instead of isolate genomes, leading to new computational challenges. In this article, we outline the major algorithmic approaches for genome assembly and describe recent developments in this domain.
引用
收藏
页码:354 / 366
页数:13
相关论文
共 73 条
[51]   De novo assembly using low-coverage short read sequence data from the rice pathogen Pseudomonas syringae pv. oryzae [J].
Reinhardt, Josephine A. ;
Baltrus, David A. ;
Nishimura, Marc T. ;
Jeck, William R. ;
Jones, Corbin D. ;
Dangl, Jeffery L. .
GENOME RESEARCH, 2009, 19 (02) :294-305
[52]   The Sorcerer II Global Ocean Sampling expedition:: Northwest Atlantic through Eastern Tropical Pacific [J].
Rusch, Douglas B. ;
Halpern, Aaron L. ;
Sutton, Granger ;
Heidelberg, Karla B. ;
Williamson, Shannon ;
Yooseph, Shibu ;
Wu, Dongying ;
Eisen, Jonathan A. ;
Hoffman, Jeff M. ;
Remington, Karin ;
Beeson, Karen ;
Tran, Bao ;
Smith, Hamilton ;
Baden-Tillson, Holly ;
Stewart, Clare ;
Thorpe, Joyce ;
Freeman, Jason ;
Andrews-Pfannkoch, Cynthia ;
Venter, Joseph E. ;
Li, Kelvin ;
Kravitz, Saul ;
Heidelberg, John F. ;
Utterback, Terry ;
Rogers, Yu-Hui ;
Falcon, Luisa I. ;
Souza, Valeria ;
Bonilla-Rosso, German ;
Eguiarte, Luis E. ;
Karl, David M. ;
Sathyendranath, Shubha ;
Platt, Trevor ;
Bermingham, Eldredge ;
Gallardo, Victor ;
Tamayo-Castillo, Giselle ;
Ferrari, Michael R. ;
Strausberg, Robert L. ;
Nealson, Kenneth ;
Friedman, Robert ;
Frazier, Marvin ;
Venter, J. Craig .
PLOS BIOLOGY, 2007, 5 (03) :398-431
[53]   Gene-Boosted Assembly of a Novel Bacterial Genome from Very Short Reads [J].
Salzberg, Steven L. ;
Sommer, Daniel D. ;
Puiu, Daniela ;
Lee, Vincent T. .
PLOS COMPUTATIONAL BIOLOGY, 2008, 4 (09)
[54]   OPTICAL MAPPING - A NOVEL, SINGLE-MOLECULE APPROACH TO GENOMIC ANALYSIS [J].
SAMAD, A ;
HUFF, EJ ;
CAI, WW ;
SCHWARTZ, DC .
GENOME RESEARCH, 1995, 5 (01) :1-4
[55]   DNA SEQUENCING WITH CHAIN-TERMINATING INHIBITORS [J].
SANGER, F ;
NICKLEN, S ;
COULSON, AR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1977, 74 (12) :5463-5467
[56]   Hawkeye: an interactive visual analytics tool for genome assemblies [J].
Schatz, Michael C. ;
Phillippy, Adam M. ;
Shneiderman, Ben ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2007, 8 (03)
[57]   Population genomic analysis of strain variation in Leptospirillum group II bacteria involved in acid mine drainage formation [J].
Simmons, Sheri L. ;
DiBartolo, Genevieve ;
Denef, Vincent J. ;
Goltsman, Daniela S. Aliaga ;
Thelen, Michael P. ;
Banfield, Jillian F. .
PLOS BIOLOGY, 2008, 6 (07) :1427-1442
[58]   ABySS: A parallel assembler for short read sequence data [J].
Simpson, Jared T. ;
Wong, Kim ;
Jackman, Shaun D. ;
Schein, Jacqueline E. ;
Jones, Steven J. M. ;
Birol, Inanc .
GENOME RESEARCH, 2009, 19 (06) :1117-1123
[59]   1000 Genomes project [J].
Nayanah Siva .
Nature Biotechnology, 2008, 26 (3) :256-256
[60]   Minimus: a fast, lightweight genome assembler [J].
Sommer, Daniel D. ;
Delcher, Arthur L. ;
Salzberg, Steven L. ;
Pop, Mihai .
BMC BIOINFORMATICS, 2007, 8