Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome

被引:244
作者
Goodwin, Sara [1 ]
Gurtowski, James [1 ]
Ethe-Sayers, Scott [1 ]
Deshpande, Panchajanya [1 ]
Schatz, Michael C. [1 ]
McCombie, W. Richard [1 ]
机构
[1] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
DNA; DISCRIMINATION;
D O I
10.1101/gr.191395.115
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Monitoring the progress of DNA molecules through a membrane pore has been postulated as a method for sequencing DNA for several decades. Recently, a nanopore-based sequencing instrument, the Oxford Nanopore MinION, has become available, and we used this for sequencing the Saccharomyces cereyisiae genome. To make use of these data, we developed a novel open-source hybrid error correction algorithm Nanocorr specifically for Oxford Nanopore reads, because existing packages were incapable of assembling the long read lengths (5-50 kbp) at such high error rates (between similar to 5% and 40% error). With this new method, we were able to perform a hybrid error correction of the nanopore reads using complementary MiSeq data and produce a de novo assembly that is highly contiguous and accurate: The contig N50 length is more than ten times greater than an Illumina-only assembly (678 kb versus 59.9 kbp) and has >99.88% consensus identity when compared to the reference. Furthermore, the assembly with the long nanopore reads presents a much more complete representation of the features of the genome and correctly assembles gene cassettes, rRNAs, transposable elements, and other genomic features that were almost entirely absent ill the II lumina-only assembly.
引用
收藏
页码:1750 / 1756
页数:7
相关论文
共 20 条
[1]   Resolving the complexity of the human genome using single-molecule sequencing [J].
Chaisson, Mark J. P. ;
Huddleston, John ;
Dennis, Megan Y. ;
Sudmant, Peter H. ;
Malig, Maika ;
Hormozdiari, Fereydoun ;
Antonacci, Francesca ;
Surti, Urvashi ;
Sandstrom, Richard ;
Boitano, Matthew ;
Landolin, Jane M. ;
Stamatoyannopoulos, John A. ;
Hunkapiller, Michael W. ;
Korlach, Jonas ;
Eichler, Evan E. .
NATURE, 2015, 517 (7536) :608-U163
[2]  
Chin CS, 2013, NAT METHODS, V10, P563, DOI [10.1038/NMETH.2474, 10.1038/nmeth.2474]
[3]  
Clarke J, 2009, NAT NANOTECHNOL, V4, P265, DOI [10.1038/nnano.2009.12, 10.1038/NNANO.2009.12]
[4]   Oxford Nanopore announcement sets sequencing sector abuzz [J].
Eisenstein, Michael .
NATURE BIOTECHNOLOGY, 2012, 30 (04) :295-296
[5]   Characterization of individual polynucleotide molecules using a membrane channel [J].
Kasianowicz, JJ ;
Brandin, E ;
Branton, D ;
Deamer, DW .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (24) :13770-13773
[6]   Hybrid error correction and de novo assembly of single-molecule sequencing reads [J].
Koren, Sergey ;
Schatz, Michael C. ;
Walenz, Brian P. ;
Martin, Jeffrey ;
Howard, Jason T. ;
Ganapathy, Ganeshkumar ;
Wang, Zhong ;
Rasko, David A. ;
McCombie, W. Richard ;
Jarvis, Erich D. ;
Phillippy, Adam M. .
NATURE BIOTECHNOLOGY, 2012, 30 (07) :692-+
[7]   Versatile and open software for comparing large genomes [J].
Kurtz, S ;
Phillippy, A ;
Delcher, AL ;
Smoot, M ;
Shumway, M ;
Antonescu, C ;
Salzberg, SL .
GENOME BIOLOGY, 2004, 5 (02)
[8]  
Lee H., 2014, BIORXIV
[9]   Poretools: a toolkit for analyzing nanopore sequence data [J].
Loman, Nicholas J. ;
Quinlan, Aaron R. .
BIOINFORMATICS, 2014, 30 (23) :3399-3401
[10]   Next-generation DNA sequencing methods [J].
Mardis, Elaine R. .
ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, 2008, 9 :387-402