A survey of best practices for RNA-seq data analysis

被引:1595
作者
Conesa, Ana [1 ,2 ]
Madrigal, Pedro [3 ,4 ]
Tarazona, Sonia [2 ,5 ]
Gomez-Cabrero, David [6 ,7 ,8 ,9 ]
Cervera, Alejandra [10 ,11 ]
McPherson, Andrew [12 ]
Szczesniak, Michal Wojciech [13 ]
Gaffney, Daniel J. [3 ]
Elo, Laura L. [14 ,15 ]
Zhang, Xuegong [16 ,17 ,18 ]
Mortazavi, Ali [19 ,20 ]
机构
[1] Univ Florida, Inst Food & Agr Sci, Dept Microbiol & Cell Sci, Gainesville, FL 32603 USA
[2] Ctr Invest Principe Felipe, Genom Gene Express Lab, Valencia 46012, Spain
[3] Wellcome Trust Sanger Inst, Wellcome Trust Genome Campus, Cambridge CB10 1SA, England
[4] Univ Cambridge, Dept Surg, Wellcome Trust Med Res Council Cambridge Stem Cel, Anne McLaren Lab Regenerat Med, Cambridge CB2 0SZ, England
[5] Univ Politecn Valencia, Dept Appl Stat Operat Res & Qual, Valencia 46020, Spain
[6] Karolinska Inst, Karolinska Univ Hosp, Dept Med, Unit Computat Med, S-17177 Stockholm, Sweden
[7] Karolinska Inst, Ctr Mol Med, S-17177 Stockholm, Sweden
[8] Karolinska Univ Hosp, Dept Med, Clin Epidemiol Unit, L8, S-17176 Stockholm, Sweden
[9] Sci Life Lab, S-17121 Solna, Sweden
[10] Univ Helsinki, Syst Biol Lab, Inst Biomed, FIN-00014 Helsinki, Finland
[11] Univ Helsinki, Genome Scale Biol Res Program, FIN-00014 Helsinki, Finland
[12] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[13] Adam Mickiewicz Univ, Inst Mol Biol & Biotechnol, Dept Bioinformat, PL-61614 Poznan, Poland
[14] Univ Turku, Turku Ctr Biotechnol, FI-20520 Turku, Finland
[15] Abo Akad Univ, FI-20520 Turku, Finland
[16] Tsinghua Univ, Key Lab Bioinformat, Bioinformat Div, TNLIST, Beijing 100084, Peoples R China
[17] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[18] Tsinghua Univ, Sch Life Sci, Beijing 100084, Peoples R China
[19] Univ Calif Irvine, Dept Dev & Cell Biol, Irvine, CA 92697 USA
[20] Univ Calif Irvine, Ctr Complex Biol Syst, Irvine, CA 92697 USA
来源
GENOME BIOLOGY | 2016年 / 17卷
基金
芬兰科学院;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; SIMULTANEOUS ISOFORM DISCOVERY; INTEGRATED ANALYSIS MMIA; WEB-BASED TOOL; SINGLE-CELL; GENE-EXPRESSION; DNA-METHYLATION; COMPREHENSIVE EVALUATION; CHROMATIN ACCESSIBILITY; TRANSCRIPTOME ANALYSIS;
D O I
10.1186/s13059-016-0881-8
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
RNA-sequencing (RNA-seq) has a wide variety of applications, but no single analysis pipeline can be used in all cases. We review all of the major steps in RNA-seq data analysis, including experimental design, quality control, read alignment, quantification of gene and transcript levels, visualization, differential gene expression, alternative splicing, functional analysis, gene fusion detection and eQTL mapping. We highlight the challenges associated with each step. We discuss the analysis of small RNAs and the integration of RNA-seq with other functional genomics techniques. Finally, we discuss the outlook for novel technologies that are changing the state of the art in transcriptomics.
引用
收藏
页数:19
相关论文
共 211 条
[1]   Global signatures of protein and mRNA expression levels [J].
Abreu, Raquel de Sousa ;
Penalva, Luiz O. ;
Marcotte, Edward M. ;
Vogel, Christine .
MOLECULAR BIOSYSTEMS, 2009, 5 (12) :1512-1526
[2]   miRDeep*: an integrated application tool for miRNA identification from RNA sequencing data [J].
An, Jiyuan ;
Lai, John ;
Lehman, Melanie L. ;
Nelson, Colleen C. .
NUCLEIC ACIDS RESEARCH, 2013, 41 (02) :727-737
[3]  
Anders S., 2010, GENOME BIOL, V11, pR106, DOI [10.1186/gb-2010-11-10-r106, DOI 10.1186/gb-2010-11-10-r106]
[4]   HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[5]   Detecting differential usage of exons from RNA-seq data [J].
Anders, Simon ;
Reyes, Alejandro ;
Huber, Wolfgang .
GENOME RESEARCH, 2012, 22 (10) :2008-2017
[6]   Understanding gene regulatory mechanisms by integrating ChIP-seq and RNA-seq data: statistical solutions to biological problems [J].
Angelini, Claudia ;
Costa, Valerio .
FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2014, 2
[7]  
[Anonymous], CURR PROTOC BIOINFOR
[8]  
[Anonymous], SEPIA RNA SMALLRNA S
[9]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[10]   Characterization of the human ESC transcriptome by hybrid sequencing [J].
Au, Kin Fai ;
Sebastiano, Vittorio ;
Afshar, Pegah Tootoonchi ;
Durruthy, Jens Durruthy ;
Lee, Lawrence ;
Williams, Brian A. ;
van Bakel, Harm ;
Schadt, Eric E. ;
Reijo-Pera, Renee A. ;
Underwood, Jason G. ;
Wong, Wing Hung .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (50) :E4821-E4830