Short read fragment assembly of bacterial genomes

被引:274
作者
Chaisson, Mark J. [2 ]
Pevzner, Pavel A. [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Bioinformat Program, La Jolla, CA 92093 USA
关键词
D O I
10.1101/gr.7088808
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In the last year, high-throughput sequencing technologies have progressed from proof-of-concept to production quality. While these methods produce high-quality reads, they have yet to produce reads comparable in length to Sanger-based sequencing. Current fragment assembly algorithms have been implemented and optimized for mate-paired Sanger-based reads, and thus do not perform well on short reads produced by short read technologies. We present a new Eulerian assembler that generates nearly optimal short read assemblies of bacterial genomes and describe an approach to assemble reads in the case of the popular hybrid protocol when short and long Sanger-based reads are combined.
引用
收藏
页码:324 / 330
页数:7
相关论文
共 35 条
  • [11] Structural variation in the human genome
    Feuk, L
    Carson, AR
    Scherer, SW
    [J]. NATURE REVIEWS GENETICS, 2006, 7 (02) : 85 - 97
  • [12] Arborescence optimization problems solvable by Edmonds' algorithm
    Georgiadis, L
    [J]. THEORETICAL COMPUTER SCIENCE, 2003, 301 (1-3) : 427 - 437
  • [13] A Sanger/pyrosequencing hybrid approach tor the generation of high-quality draft assemblies of marine microbial genomes
    Goldberg, Susanne M. D.
    Johnson, Justin
    Busam, Dana
    Feldblyum, Tamara
    Ferriera, Steve
    Friedman, Robert
    Halpern, Aaron
    Khouri, Hoda
    Kravitz, Saul A.
    Lauro, Federico M.
    Li, Kelvin
    Rogers, Yu-Hui
    Strausberg, Robert
    Sutton, Granger
    Tallon, Luke
    Thomas, Torsten
    Venter, Eli
    Frazier, Marvin
    Venter, J. Craig
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (30) : 11240 - 11245
  • [14] PCAP: A whole-genome assembly program
    Huang, XQ
    Wang, JM
    Aluru, S
    Yang, SP
    Hillier, L
    [J]. GENOME RESEARCH, 2003, 13 (09) : 2164 - 2170
  • [15] Idury R M, 1995, J Comput Biol, V2, P291, DOI 10.1089/cmb.1995.2.291
  • [16] Whole-genome sequence assembly for mammalian genomes: Arachne 2
    Jaffe, DB
    Butler, J
    Gnerre, S
    Mauceli, E
    Lindblad-Toh, K
    Mesirov, JP
    Zody, MC
    Lander, ES
    [J]. GENOME RESEARCH, 2003, 13 (01) : 91 - 96
  • [17] Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution
    Jiang, Zhaoshi
    Tang, Haixu
    Ventura, Mario
    Cardone, Maria Francesca
    Marques-Bonet, Tomas
    She, Xinwei
    Pevzner, Pavel A.
    Eichler, Evan E.
    [J]. NATURE GENETICS, 2007, 39 (11) : 1361 - 1368
  • [18] Polony multiplex analysis of gene expression (PMAGE) in mouse hypertrophic cardiomyopathy
    Kim, Jae Bum
    Porreca, Gregory J.
    Song, Lei
    Greenway, Steven C.
    Gorham, Joshua M.
    Church, George M.
    Seidman, Christine E.
    Seidman, J. G.
    [J]. SCIENCE, 2007, 316 (5830) : 1481 - 1484
  • [19] A space-efficient construction of the Burrows-Wheeler transform for genomic data
    Lippert, RA
    Mobarry, CM
    Walenz, BP
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2005, 12 (07) : 943 - 951
  • [20] Genome sequencing in microfabricated high-density picolitre reactors
    Margulies, M
    Egholm, M
    Altman, WE
    Attiya, S
    Bader, JS
    Bemben, LA
    Berka, J
    Braverman, MS
    Chen, YJ
    Chen, ZT
    Dewell, SB
    Du, L
    Fierro, JM
    Gomes, XV
    Godwin, BC
    He, W
    Helgesen, S
    Ho, CH
    Irzyk, GP
    Jando, SC
    Alenquer, MLI
    Jarvie, TP
    Jirage, KB
    Kim, JB
    Knight, JR
    Lanza, JR
    Leamon, JH
    Lefkowitz, SM
    Lei, M
    Li, J
    Lohman, KL
    Lu, H
    Makhijani, VB
    McDade, KE
    McKenna, MP
    Myers, EW
    Nickerson, E
    Nobile, JR
    Plant, R
    Puc, BP
    Ronan, MT
    Roth, GT
    Sarkis, GJ
    Simons, JF
    Simpson, JW
    Srinivasan, M
    Tartaro, KR
    Tomasz, A
    Vogt, KA
    Volkmer, GA
    [J]. NATURE, 2005, 437 (7057) : 376 - 380