A benchmark for RNA-seq quantification pipelines

被引:114
|
作者
Teng, Mingxiang [1 ,2 ,9 ]
Love, Michael I. [1 ,2 ]
Davis, Carrie A. [3 ]
Djebali, Sarah [4 ,5 ]
Dobin, Alexander [3 ]
Graveley, Brenton R. [6 ]
Li, Sheng [7 ]
Mason, Christopher E. [7 ]
Olson, Sara [6 ]
Pervouchine, Dmitri [4 ,5 ]
Sloan, Cricket A. [8 ]
Wei, Xintao [6 ]
Zhan, Lijun [6 ]
Irizarry, Rafael A. [1 ,2 ]
机构
[1] Dana Farber Canc Inst, Dept Biostat & Computat Biol, 450 Brookline Ave, Boston, MA 02215 USA
[2] Harvard Univ, TH Chan Sch Publ Hlth, Dept Biostat, 677 Huntington Ave, Boston, MA 02115 USA
[3] Cold Spring Harbor Lab, Funct Genom Grp, 1 Bungtown Rd, Cold Spring Harbor, NY 11724 USA
[4] Ctr Genom Regulat CRG, Bioinformat & Genom Programme, Doctor Aiguader 88, Barcelona 08003, Spain
[5] UPF, Doctor Aiguader 88, Barcelona 08003, Spain
[6] UConn Hlth Ctr, Inst Syst Genom, Dept Genet & Genome Sci, Farmington, CT 06030 USA
[7] Weill Cornell Med Coll, Dept Physiol & Biophys, New York, NY USA
[8] Stanford Univ, Dept Genet, 300 Pasteur Dr, Stanford, CA 94305 USA
[9] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150006, Peoples R China
来源
GENOME BIOLOGY | 2016年 / 17卷
关键词
GENE-EXPRESSION; CELL; TRANSCRIPTOMES; NORMALIZATION; ABUNDANCE; ALIGNMENT;
D O I
10.1186/s13059-016-0940-1
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Obtaining RNA-seq measurements involves a complex data analytical process with a large number of competing algorithms as options. There is much debate about which of these methods provides the best approach. Unfortunately, it is currently difficult to evaluate their performance due in part to a lack of sensitive assessment metrics. We present a series of statistical summaries and plots to evaluate the performance in terms of specificity and sensitivity, available as a R/Bioconductor package (http://bioconductor.org/packages/rnaseqcomp). Using two independent datasets, we assessed seven competing pipelines. Performance was generally poor, with two methods clearly underperforming and RSEM slightly outperforming the rest.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] An integrative method to normalize RNA-Seq data
    Filloux, Cyril
    Cedric, Meersseman
    Romain, Philippe
    Lionel, Forestier
    Christophe, Klopp
    Dominique, Rocha
    Abderrahman, Maftah
    Daniel, Petit
    BMC BIOINFORMATICS, 2014, 15
  • [32] APAtrap: identification and quantification of alternative polyadenylation sites from RNA-seq data
    Ye, Congting
    Long, Yuqi
    Ji, Guoli
    Li, Qingshun Quinn
    Wu, Xiaohui
    BIOINFORMATICS, 2018, 34 (11) : 1841 - 1849
  • [33] A survey on identification and quantification of alternative polyadenylation sites from RNA-seq data
    Chen, Moliang
    Ji, Guoli
    Fu, Hongjuan
    Lin, Qianmin
    Ye, Congting
    Ye, Wenbin
    Su, Yaru
    Wu, Xiaohui
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (04) : 1261 - 1276
  • [34] RNA-Seq analysis in MeV
    Howe, Eleanor A.
    Sinha, Raktim
    Schlauch, Daniel
    Quackenbush, John
    BIOINFORMATICS, 2011, 27 (22) : 3209 - 3210
  • [35] Deep annotation of long noncoding RNAs by assembling RNA-seq and small RNA-seq data
    Zhang, Jiaming
    Hou, Weibo
    Zhao, Qi
    Xiao, Songling
    Linghu, Hongye
    Zhang, Lixin
    Du, Jiawei
    Cui, Hongdi
    Yang, Xu
    Ling, Shukuan
    Su, Jianzhong
    Kong, Qingran
    JOURNAL OF BIOLOGICAL CHEMISTRY, 2023, 299 (09)
  • [36] Differential Expression Analysis of RNA-seq Reads: Overview, Taxonomy, and Tools
    Chowdhury, Hussain Ahmed
    Bhattacharyya, Dhruba Kumar
    Kalita, Jugal Kumar
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2020, 17 (02) : 566 - 586
  • [37] RNA-Seq: revelation of the messengers
    Van Verk, Marcel C.
    Hickman, Richard
    Pieterse, Corne M. J.
    Van Wees, Saskia C. M.
    TRENDS IN PLANT SCIENCE, 2013, 18 (04) : 175 - 179
  • [38] Comparing HISAT and STAR-based pipelines for RNA-Seq Data Analysis: a real experience
    Bianchi, Andrea
    Di Marco, Antinisca
    Pellegrini, Cristina
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 218 - 224
  • [39] Modeling RNA degradation for RNA-Seq with applications
    Wan, Lin
    Yan, Xiting
    Chen, Ting
    Sun, Fengzhu
    BIOSTATISTICS, 2012, 13 (04) : 734 - 747
  • [40] Computation for ChIP-seq and RNA-seq studies
    Pepke, Shirley
    Wold, Barbara
    Mortazavi, Ali
    NATURE METHODS, 2009, 6 (11) : S22 - S32