The power of single molecule real-time sequencing technology in the de novo assembly of a eukaryotic genome

被引:57
作者
Sakai, Hiroaki [1 ]
Naito, Ken [2 ]
Ogiso-Tanaka, Eri [2 ]
Takahashi, Yu [2 ]
Iseki, Kohtaro [2 ]
Muto, Chiaki [2 ]
Satou, Kazuhito [3 ]
Teruya, Kuniko [3 ]
Shiroma, Akino [3 ]
Shimoji, Makiko [3 ]
Hirano, Takashi [3 ]
Itoh, Takeshi [1 ]
Kaga, Akito [2 ]
Tomooka, Norihiko [2 ]
机构
[1] Natl Inst Agrobiol Sci, Agrogen Res Ctr, Tsukuba, Ibaraki 3058602, Japan
[2] Natl Inst Agrobiol Sci, Genet Resources Ctr, Tsukuba, Ibaraki 3058602, Japan
[3] Okinawa Inst Adv Sci, Uruma 9042234, Japan
关键词
DRAFT GENOME; GENERATION; ALIGNMENT; GENE; ANNOTATION; FRAMEWORK;
D O I
10.1038/srep16780
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Second-generation sequencers (SGS) have been game-changing, achieving cost-effective whole genome sequencing in many non-model organisms. However, a large portion of the genomes still remains unassembled. We reconstructed azuki bean (Vigna angularis) genome using single molecule real-time (SMRT) sequencing technology and achieved the best contiguity and coverage among currently assembled legume crops. The SMRT-based assembly produced 100 times longer contigs with 100 times smaller amount of gaps compared to the SGS-based assemblies. A detailed comparison between the assemblies revealed that the SMRT-based assembly enabled a more comprehensive gene annotation than the SGS-based assemblies where thousands of genes were missing or fragmented. A chromosome-scale assembly was generated based on the high-density genetic map, covering 86% of the azuki bean genome. We demonstrated that SMRT technology, though still needed support of SGS data, achieved a near-complete assembly of a eukaryotic genome.
引用
收藏
页数:13
相关论文
共 60 条
[1]   Infinium Assay for Large-scale SNP Genotyping Applications [J].
Adler, Adam J. ;
Wiley, Graham B. ;
Gaffney, Patrick M. .
JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2013, (81)
[2]   Limitations of next-generation genome sequence assembly [J].
Alkan, Can ;
Sajjadian, Saba ;
Eichler, Evan E. .
NATURE METHODS, 2011, 8 (01) :61-65
[3]   NUCLEAR-DNA AMOUNTS IN ANGIOSPERMS [J].
BENNETT, MD ;
LEITCH, IJ .
ANNALS OF BOTANY, 1995, 76 (02) :113-176
[4]   Accurate whole human genome sequencing using reversible terminator chemistry [J].
Bentley, David R. ;
Balasubramanian, Shankar ;
Swerdlow, Harold P. ;
Smith, Geoffrey P. ;
Milton, John ;
Brown, Clive G. ;
Hall, Kevin P. ;
Evers, Dirk J. ;
Barnes, Colin L. ;
Bignell, Helen R. ;
Boutell, Jonathan M. ;
Bryant, Jason ;
Carter, Richard J. ;
Cheetham, R. Keira ;
Cox, Anthony J. ;
Ellis, Darren J. ;
Flatbush, Michael R. ;
Gormley, Niall A. ;
Humphray, Sean J. ;
Irving, Leslie J. ;
Karbelashvili, Mirian S. ;
Kirk, Scott M. ;
Li, Heng ;
Liu, Xiaohai ;
Maisinger, Klaus S. ;
Murray, Lisa J. ;
Obradovic, Bojan ;
Ost, Tobias ;
Parkinson, Michael L. ;
Pratt, Mark R. ;
Rasolonjatovo, Isabelle M. J. ;
Reed, Mark T. ;
Rigatti, Roberto ;
Rodighiero, Chiara ;
Ross, Mark T. ;
Sabot, Andrea ;
Sankar, Subramanian V. ;
Scally, Aylwyn ;
Schroth, Gary P. ;
Smith, Mark E. ;
Smith, Vincent P. ;
Spiridou, Anastassia ;
Torrance, Peta E. ;
Tzonev, Svilen S. ;
Vermaas, Eric H. ;
Walter, Klaudia ;
Wu, Xiaolin ;
Zhang, Lu ;
Alam, Mohammed D. ;
Anastasi, Carole .
NATURE, 2008, 456 (7218) :53-59
[5]   Assembling large genomes with single-molecule sequencing and locality-sensitive hashing [J].
Berlin, Konstantin ;
Koren, Sergey ;
Chin, Chen-Shan ;
Drake, James P. ;
Landolin, Jane M. ;
Phillippy, Adam M. .
NATURE BIOTECHNOLOGY, 2015, 33 (06) :623-+
[6]   Scaffolding pre-assembled contigs using SSPACE [J].
Boetzer, Marten ;
Henkel, Christiaan V. ;
Jansen, Hans J. ;
Butler, Derek ;
Pirovano, Walter .
BIOINFORMATICS, 2011, 27 (04) :578-579
[7]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[8]   Assemblathon 2: evaluating de novo methods of genome assembly in three vertebrate species [J].
Bradnam, Keith R. ;
Fass, Joseph N. ;
Alexandrov, Anton ;
Baranay, Paul ;
Bechner, Michael ;
Birol, Inanc ;
Boisvert, Sebastien ;
Chapman, Jarrod A. ;
Chapuis, Guillaume ;
Chikhi, Rayan ;
Chitsaz, Hamidreza ;
Chou, Wen-Chi ;
Corbeil, Jacques ;
Del Fabbro, Cristian ;
Docking, T. Roderick ;
Durbin, Richard ;
Earl, Dent ;
Emrich, Scott ;
Fedotov, Pavel ;
Fonseca, Nuno A. ;
Ganapathy, Ganeshkumar ;
Gibbs, Richard A. ;
Gnerre, Sante ;
Godzaridis, Elenie ;
Goldstein, Steve ;
Haimel, Matthias ;
Hall, Giles ;
Haussler, David ;
Hiatt, Joseph B. ;
Ho, Isaac Y. ;
Howard, Jason ;
Hunt, Martin ;
Jackman, Shaun D. ;
Jaffe, David B. ;
Jarvis, Erich D. ;
Jiang, Huaiyang ;
Kazakov, Sergey ;
Kersey, Paul J. ;
Kitzman, Jacob O. ;
Knight, James R. ;
Koren, Sergey ;
Lam, Tak-Wah ;
Lavenier, Dominique ;
Laviolette, Francois ;
Li, Yingrui ;
Li, Zhenyu ;
Liu, Binghang ;
Liu, Yue ;
Luo, Ruibang ;
MacCallum, Iain .
GIGASCIENCE, 2013, 2
[9]   R/qtl: QTL mapping in experimental crosses [J].
Broman, KW ;
Wu, H ;
Sen, S ;
Churchill, GA .
BIOINFORMATICS, 2003, 19 (07) :889-890
[10]   Ensembl 2015 [J].
Cunningham, Fiona ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Billis, Konstantinos ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Giron, Carlos Garcia ;
Gordon, Leo ;
Hourlier, Thibaut ;
Hunt, Sarah E. ;
Janacek, Sophie H. ;
Johnson, Nathan ;
Juettemann, Thomas ;
Kaehaeri, Andreas K. ;
Keenan, Stephen ;
Martin, Fergal J. ;
Maurel, Thomas ;
McLaren, William ;
Murphy, Daniel N. ;
Nag, Rishi ;
Overduin, Bert ;
Parker, Anne ;
Patricio, Mateus ;
Perry, Emily ;
Pignatelli, Miguel ;
Riat, Harpreet Singh ;
Sheppard, Daniel ;
Taylor, Kieron ;
Thormann, Anja ;
Vullo, Alessandro ;
Wilder, Steven P. ;
Zadissa, Amonida ;
Aken, Bronwen L. ;
Birney, Ewan ;
Harrow, Jennifer ;
Kinsella, Rhoda ;
Muffato, Matthieu ;
Ruffier, Magali ;
Searle, Stephen M. J. ;
Spudich, Giulietta ;
Trevanion, Stephen J. ;
Yates, Andy ;
Zerbino, Daniel R. ;
Flicek, Paul .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D662-D669