Differential Expression Analysis for RNA-Seq: An Overview of Statistical Methods and Computational Software

被引:18
作者
Huang, Huei-Chung [1 ]
Niu, Yi [1 ]
Qin, Li-Xuan [1 ]
机构
[1] Mem Sloan Kettering Canc Ctr, Dept Epidemiol & Biostat, New York, NY 10021 USA
关键词
RNA sequencing; differential expression analysis; overview; statistical methods; software;
D O I
10.4137/CIN.S21631
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Deep sequencing has recently emerged as a powerful alternative to microarrays for the high-throughput profiling of gene expression. In order to account for the discrete nature of RNA sequencing data, new statistical methods and computational tools have been developed for the analysis of differential expression to identify genes that are relevant to a disease such as cancer. In this paper, it is thus timely to provide an overview of these analysis methods and tools. For readers with statistical background, we also review the parameter estimation algorithms and hypothesis testing strategies used in these methods.
引用
收藏
页码:57 / 67
页数:11
相关论文
共 46 条
[1]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[2]   Count-based differential expression analysis of RNA sequencing data using R and Bioconductor [J].
Anders, Simon ;
McCarthy, Davis J. ;
Chen, Yunshun ;
Okoniewski, Michal ;
Smyth, Gordon K. ;
Huber, Wolfgang ;
Robinson, Mark D. .
NATURE PROTOCOLS, 2013, 8 (09) :1765-1786
[3]   A Two-Stage Poisson Model for Testing RNA-Seq Data [J].
Auer, Paul L. ;
Doerge, Rebecca W. .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
[4]   Differential expression in SAGE: accounting for normal between-library variation [J].
Baggerly, KA ;
Deng, L ;
Morris, JS ;
Aldaz, CM .
BIOINFORMATICS, 2003, 19 (12) :1477-1483
[5]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[6]   Summarizing and correcting the GC content bias in high-throughput sequencing [J].
Benjamini, Yuval ;
Speed, Terence P. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (10) :e72
[7]   A comparison of normalization methods for high density oligonucleotide array data based on variance and bias [J].
Bolstad, BM ;
Irizarry, RA ;
Åstrand, M ;
Speed, TP .
BIOINFORMATICS, 2003, 19 (02) :185-193
[8]   Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments [J].
Bullard, James H. ;
Purdom, Elizabeth ;
Hansen, Kasper D. ;
Dudoit, Sandrine .
BMC BIOINFORMATICS, 2010, 11
[9]  
Chen Y, EDGER DIFFERENTIAL E
[10]  
COX DR, 1987, J ROY STAT SOC B MET, V49, P1