A benchmark for RNA-seq quantification pipelines

被引:114
|
作者
Teng, Mingxiang [1 ,2 ,9 ]
Love, Michael I. [1 ,2 ]
Davis, Carrie A. [3 ]
Djebali, Sarah [4 ,5 ]
Dobin, Alexander [3 ]
Graveley, Brenton R. [6 ]
Li, Sheng [7 ]
Mason, Christopher E. [7 ]
Olson, Sara [6 ]
Pervouchine, Dmitri [4 ,5 ]
Sloan, Cricket A. [8 ]
Wei, Xintao [6 ]
Zhan, Lijun [6 ]
Irizarry, Rafael A. [1 ,2 ]
机构
[1] Dana Farber Canc Inst, Dept Biostat & Computat Biol, 450 Brookline Ave, Boston, MA 02215 USA
[2] Harvard Univ, TH Chan Sch Publ Hlth, Dept Biostat, 677 Huntington Ave, Boston, MA 02115 USA
[3] Cold Spring Harbor Lab, Funct Genom Grp, 1 Bungtown Rd, Cold Spring Harbor, NY 11724 USA
[4] Ctr Genom Regulat CRG, Bioinformat & Genom Programme, Doctor Aiguader 88, Barcelona 08003, Spain
[5] UPF, Doctor Aiguader 88, Barcelona 08003, Spain
[6] UConn Hlth Ctr, Inst Syst Genom, Dept Genet & Genome Sci, Farmington, CT 06030 USA
[7] Weill Cornell Med Coll, Dept Physiol & Biophys, New York, NY USA
[8] Stanford Univ, Dept Genet, 300 Pasteur Dr, Stanford, CA 94305 USA
[9] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
来源
GENOME BIOLOGY | 2016年 / 17卷
关键词
GENE-EXPRESSION; CELL; TRANSCRIPTOMES; NORMALIZATION; ABUNDANCE; ALIGNMENT;
D O I
10.1186/s13059-016-0940-1
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Obtaining RNA-seq measurements involves a complex data analytical process with a large number of competing algorithms as options. There is much debate about which of these methods provides the best approach. Unfortunately, it is currently difficult to evaluate their performance due in part to a lack of sensitive assessment metrics. We present a series of statistical summaries and plots to evaluate the performance in terms of specificity and sensitivity, available as a R/Bioconductor package (http://bioconductor.org/packages/rnaseqcomp). Using two independent datasets, we assessed seven competing pipelines. Performance was generally poor, with two methods clearly underperforming and RSEM slightly outperforming the rest.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] NSMAP: A method for spliced isoforms identification and quantification from RNA-Seq
    Xia, Zheng
    Wen, Jianguo
    Chang, Chung-Che
    Zhou, Xiaobo
    BMC BIOINFORMATICS, 2011, 12
  • [22] Integrative analysis with ChIP-seq advances the limits of transcript quantification from RNA-seq
    Liu, Peng
    Sanalkumar, Rajendran
    Bresnick, Emery H.
    Keles, Sunduz
    Dewey, Colin N.
    GENOME RESEARCH, 2016, 26 (08) : 1124 - 1133
  • [23] Principles of transcriptome analysis and gene expression quantification: an RNA-seq tutorial
    Wolf, Jochen B. W.
    MOLECULAR ECOLOGY RESOURCES, 2013, 13 (04) : 559 - 572
  • [24] Reproducible RNA-seq analysis using recount2
    Collado-Torres, Leonardo
    Nellore, Abhinav
    Kammers, Kai
    Ellis, Shannon E.
    Taub, Margaret A.
    Hansen, Kasper D.
    Jaffe, Andrew E.
    Langmead, Ben
    Leek, Jeffrey T.
    NATURE BIOTECHNOLOGY, 2017, 35 (04) : 319 - 321
  • [25] BADGE: A novel Bayesian model for accurate abundance quantification and differential analysis of RNA-Seq data
    Gu, Jinghua
    Wang, Xiao
    Halakivi-Clarke, Leena
    Clarke, Robert
    Xuan, Jianhua
    BMC BIOINFORMATICS, 2014, 15
  • [26] Grape RNA-Seq analysis pipeline environment
    Knowles, David G.
    Roeder, Maik
    Merkel, Angelika
    Guigo, Roderic
    BIOINFORMATICS, 2013, 29 (05) : 614 - 621
  • [27] RNA-Seq optimization with eQTL gold standards
    Ellis, Shannon E.
    Gupta, Simone
    Ashar, Foram N.
    Bader, Joel S.
    West, Andrew B.
    Arking, Dan E.
    BMC GENOMICS, 2013, 14
  • [28] An integrative method to normalize RNA-Seq data
    Cyril Filloux
    Meersseman Cédric
    Philippe Romain
    Forestier Lionel
    Klopp Christophe
    Rocha Dominique
    Maftah Abderrahman
    Petit Daniel
    BMC Bioinformatics, 15
  • [29] Computational analysis of bacterial RNA-Seq data
    McClure, Ryan
    Balasubramanian, Divya
    Sun, Yan
    Bobrovskyy, Maksym
    Sumby, Paul
    Genco, Caroline A.
    Vanderpool, Carin K.
    Tjaden, Brian
    NUCLEIC ACIDS RESEARCH, 2013, 41 (14) : e140
  • [30] Estimating accuracy of RNA-Seq and microarrays with proteomics
    Fu, Xing
    Fu, Ning
    Guo, Song
    Yan, Zheng
    Xu, Ying
    Hu, Hao
    Menzel, Corinna
    Chen, Wei
    Li, Yixue
    Zeng, Rong
    Khaitovich, Philipp
    BMC GENOMICS, 2009, 10