Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks

被引:9382
|
作者
Trapnell, Cole [1 ,2 ]
Roberts, Adam [3 ]
Goff, Loyal [1 ,2 ,4 ]
Pertea, Geo [5 ,6 ]
Kim, Daehwan [5 ,7 ]
Kelley, David R. [1 ,2 ]
Pimentel, Harold [3 ]
Salzberg, Steven L. [5 ,6 ]
Rinn, John L. [1 ,2 ]
Pachter, Lior [3 ,8 ,9 ]
机构
[1] Broad Inst MIT & Harvard, Cambridge, MA USA
[2] Harvard Univ, Dept Stem Cell & Regenerat Biol, Cambridge, MA 02138 USA
[3] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[4] MIT, Dept Elect Engn & Comp Sci, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
[5] Johns Hopkins Univ, Sch Med, Dept Med, McKusick Nathans Inst Genet Med, Baltimore, MD 21205 USA
[6] Johns Hopkins Univ, Dept Biostat, Baltimore, MD 21205 USA
[7] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD USA
[8] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[9] Univ Calif Berkeley, Dept Mol & Cell Biol, Berkeley, CA 94720 USA
基金
美国国家卫生研究院;
关键词
SPLICE JUNCTIONS; MESSENGER-RNA; IN-VIVO; IDENTIFICATION; REVEALS; QUANTIFICATION; ANNOTATION;
D O I
10.1038/nprot.2012.016
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Recent advances in high-throughput cDNA sequencing (RNA-seq) can reveal new genes and splice variants and quantify expression genome-wide in a single assay. The volume and complexity of data from RNA-seq experiments necessitate scalable, fast and mathematically principled analysis software. TopHat and Cufflinks are free, open-source software tools for gene discovery and comprehensive expression analysis of high-throughput mRNA sequencing (RNA-seq) data. Together, they allow biologists to identify new genes and new splice variants of known ones, as well as compare gene and transcript expression under two or more conditions. This protocol describes in detail how to use TopHat and Cufflinks to perform such analyses. It also covers several accessory tools and utilities that aid in managing data, including CummeRbund, a tool for visualizing RNA-seq analysis results. Although the procedure assumes basic informatics skills, these tools assume little to no background with RNA-seq analysis and are meant for novices and experts alike. The protocol begins with raw sequencing reads and produces a transcriptome assembly, lists of differentially expressed and regulated genes and transcripts, and publication-quality visualizations of analysis results. The protocol's execution time depends on the volume of transcriptome sequencing data and available computing resources but takes less than 1 d of computer time for typical experiments and similar to 1 h of hands-on time.
引用
收藏
页码:562 / 578
页数:17
相关论文
共 50 条
  • [21] A fuzzy method for RNA-Seq differential expression analysis in presence of multireads
    Consiglio, Arianna
    Mencar, Corrado
    Grillo, Giorgio
    Marzano, Flaviana
    Caratozzolo, Mariano Francesco
    Liuni, Sabino
    BMC BIOINFORMATICS, 2016, 17
  • [22] Differential gene expression between the vigorous and dwarf litchi cultivars based on RNA-Seq transcriptome analysis
    Hu, Fuchu
    Chen, Zhe
    Zhao, Jietang
    Wang, Xianghe
    Su, Wenbing
    Qin, Yonghua
    Hu, Guibing
    PLOS ONE, 2018, 13 (12):
  • [23] Gene dispersion is the key determinant of the read count bias in differential expression analysis of RNA-seq data
    Yoon, Sora
    Nam, Dougu
    BMC GENOMICS, 2017, 18
  • [24] Assessment of transcript reconstruction methods for RNA-seq
    Steijger, Tamara
    Abril, Josep F.
    Engstrom, Par G.
    Kokocinski, Felix
    Hubbard, Tim J.
    Guigo, Roderic
    Harrow, Jennifer
    Bertone, Paul
    NATURE METHODS, 2013, 10 (12) : 1177 - +
  • [25] Differential Gene Expression in Ovaries of Qira Black Sheep and Hetian Sheep Using RNA-Seq Technique
    Chen, Han Ying
    Shen, Hong
    Bin Jia
    Zhang, Yong Sheng
    Wang, Xu Hai
    Zeng, Xian Cun
    PLOS ONE, 2015, 10 (03):
  • [26] Pardiff: Inference of Differential Expression at Base-Pair Level from RNA-Seq Experiments
    Mirauta, Bogdan
    Nicolas, Pierre
    Richard, Hugues
    NEW TRENDS IN IMAGE ANALYSIS AND PROCESSING - ICIAP 2013, 2013, 8158 : 418 - 427
  • [27] Differential expression analysis of RNA-seq data at single-base resolution
    Frazee, Alyssa C.
    Sabunciyan, Sarven
    Hansen, Kasper D.
    Irizarry, Rafael A.
    Leek, Jeffrey T.
    BIOSTATISTICS, 2014, 15 (03) : 413 - 426
  • [28] Discrete distributional differential expression (D3E) - a tool for gene expression analysis of single-cell RNA-seq data
    Delmans, Mihails
    Hemberg, Martin
    BMC BIOINFORMATICS, 2016, 17
  • [29] Event Analysis: Using Transcript Events To Improve Estimates of Abundance in RNA-seq Data
    Newman, Jeremy R. B.
    Concannon, Patrick
    Tardaguila, Manuel
    Conesa, Ana
    McIntyre, Lauren M.
    G3-GENES GENOMES GENETICS, 2018, 8 (09): : 2923 - 2940
  • [30] RNA-Seq Analysis of Spatiotemporal Gene Expression Patterns During Fruit Development Revealed Reference Genes for Transcript Normalization in Plums
    Kim, Ho-Youn
    Saha, Prasenjit
    Farcuh, Macarena
    Li, Bosheng
    Sadka, Avi
    Blumwald, Eduardo
    PLANT MOLECULAR BIOLOGY REPORTER, 2015, 33 (06) : 1634 - 1649