Full-length transcriptome assembly from RNA-Seq data without a reference genome

被引:16300
作者
Grabherr, Manfred G. [2 ]
Haas, Brian J. [2 ]
Yassour, Moran [1 ,2 ]
Levin, Joshua Z. [2 ]
Thompson, Dawn A. [2 ]
Amit, Ido [2 ]
Adiconis, Xian [2 ]
Fan, Lin [2 ]
Raychowdhury, Raktima [2 ]
Zeng, Qiandong [2 ]
Chen, Zehua [2 ]
Mauceli, Evan [2 ]
Hacohen, Nir [2 ]
Gnirke, Andreas [2 ]
Rhind, Nicholas [3 ]
di Palma, Federica [2 ]
Birren, Bruce W. [2 ]
Nusbaum, Chad [2 ]
Lindblad-Toh, Kerstin [2 ,4 ]
Friedman, Nir [1 ,5 ]
Regev, Aviv [2 ,6 ]
机构
[1] Hebrew Univ Jerusalem, Sch Comp Sci, Jerusalem, Israel
[2] Broad Inst Massachusetts Inst Technol & Harvard, Cambridge, MA USA
[3] Univ Massachusetts, Sch Med, Dept Mol Pharmacol & Biochem, Worcester, MA USA
[4] Uppsala Univ, Dept Med Biochem & Microbiol, Sci Life Lab, Uppsala, Sweden
[5] Hebrew Univ Jerusalem, Alexander Silberman Inst Life Sci, Jerusalem, Israel
[6] MIT, Howard Hughes Med Inst, Dept Biol, Cambridge, MA USA
基金
美国国家卫生研究院;
关键词
SCHIZOSACCHAROMYCES-POMBE; EUKARYOTIC TRANSCRIPTOME; MESSENGER-RNA; ALIGNMENT; GENE; RESOURCE; REVEALS; PROTEIN; YEAST; TOOL;
D O I
10.1038/nbt.1883
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.
引用
收藏
页码:644 / U130
页数:11
相关论文
共 35 条
[1]   De novo transcriptome assembly with ABySS [J].
Birol, Inanc ;
Jackman, Shaun D. ;
Nielsen, Cydney B. ;
Qian, Jenny Q. ;
Varhol, Richard ;
Stazyk, Greg ;
Morin, Ryan D. ;
Zhao, Yongjun ;
Hirst, Martin ;
Schein, Jacqueline E. ;
Horsman, Doug E. ;
Connors, Joseph M. ;
Gascoyne, Randy D. ;
Marra, Marco A. ;
Jones, Steven J. M. .
BIOINFORMATICS, 2009, 25 (21) :2872-2877
[2]   ALLPATHS: De novo assembly of whole-genome shotgun microreads [J].
Butler, Jonathan ;
MacCallum, Iain ;
Kleber, Michael ;
Shlyakhter, Ilya A. ;
Belmonte, Matthew K. ;
Lander, Eric S. ;
Nusbaum, Chad ;
Jaffe, David B. .
GENOME RESEARCH, 2008, 18 (05) :810-820
[3]  
de Bruijn N. G., 1946, P SECT SCI KONINKLIJ, V49, P758
[4]  
Good I.J., 1946, J. London Math. Soc, V21, P167
[5]   Genome-wide synteny through highly sensitive sequence alignment: Satsuma [J].
Grabherr, Manfred G. ;
Russell, Pamela ;
Meyer, Miriah ;
Mauceli, Evan ;
Alfoldi, Jessica ;
Di Palma, Federica ;
Lindblad-Toh, Kerstin .
BIOINFORMATICS, 2010, 26 (09) :1145-1151
[6]   Alternative splicing: increasing diversity in the proteomic world [J].
Graveley, BR .
TRENDS IN GENETICS, 2001, 17 (02) :100-107
[7]   Ab initio reconstruction of cell type-specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAs [J].
Guttman, Mitchell ;
Garber, Manuel ;
Levin, Joshua Z. ;
Donaghey, Julie ;
Robinson, James ;
Adiconis, Xian ;
Fan, Lin ;
Koziol, Magdalena J. ;
Gnirke, Andreas ;
Nusbaum, Chad ;
Rinn, John L. ;
Lander, Eric S. ;
Regev, Aviv .
NATURE BIOTECHNOLOGY, 2010, 28 (05) :503-U166
[8]   Advancing RNA-Seq analysis [J].
Haas, Brian J. ;
Zody, Michael C. .
NATURE BIOTECHNOLOGY, 2010, 28 (05) :421-423
[9]   GeneDB: a resource for prokaryotic and eukaryotic organisms [J].
Hertz-Fowler, C ;
Peacock, CS ;
Wood, V ;
Aslett, M ;
Kerhornou, A ;
Mooney, P ;
Tivey, A ;
Berriman, M ;
Hall, N ;
Rutherford, K ;
Parkhill, J ;
Ivens, AC ;
Rajandream, MA ;
Barrell, B .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D339-D343
[10]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]