Sensitivity, specificity, and reproducibility of RNA-Seq differential expression calls

被引:25
作者
Labaj, Pawel P. [1 ,2 ]
Kreil, David P. [2 ]
机构
[1] Austrian Acad Sci, Vienna, Austria
[2] Boku Univ, Bioinformat Res Grp, Vienna, Austria
关键词
RNA-seq; Sensitivity; Specificity; Reproducibility; Differential expression calling; GENE; PACKAGE;
D O I
10.1186/s13062-016-0169-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The MAQC/SEQC consortium has recently compiled a key benchmark that can serve for testing the latest developments in analysis tools for microarray and RNA-seq expression profiling. Such objective benchmarks are required for basic and applied research, and can be critical for clinical and regulatory outcomes. Going beyond the first comparisons presented in the original SEQC study, we here present extended benchmarks including effect strengths typical of common experiments. Results: With artefacts removed by factor analysis and additional filters, for genome scale surveys, the reproducibility of differential expression calls typically exceed 80% for all tool combinations examined. This directly reflects the robustness of results and reproducibility across different studies. Similar improvements are observed for the top ranked candidates with the strongest relative expression change, although here some tools clearly perform better than others, with typical reproducibility ranging from 60 to 93%. Conclusions: In our benchmark of alternative tools for RNA-seq data analysis we demonstrated the benefits that can be gained by analysing results in the context of other experiments employing a reference standard sample. This allowed the computational identification and removal of hidden confounders, for instance, by factor analysis. In itself, this already substantially improved the empirical False Discovery Rate (eFDR) without changing the overall landscape of sensitivity. Further filtering of false positives, however, is required to obtain acceptable eFDR levels. Appropriate filters noticeably improved agreement of differentially expressed genes both across sites and between alternative differential expression analysis pipelines.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Best practices on the differential expression analysis of multi-species RNA-seq
    Chung, Matthew
    Bruno, Vincent M.
    Rasko, David A.
    Cuomo, Christina A.
    Munoz, Jose F.
    Livny, Jonathan
    Shetty, Amol C.
    Mahurkar, Anup
    Dunning Hotopp, Julie C.
    [J]. GENOME BIOLOGY, 2021, 22 (01)
  • [42] An iteration normalization and test method for differential expression analysis of RNA-seq data
    Yan Zhou
    Nan Lin
    Baoxue Zhang
    [J]. BioData Mining, 7
  • [43] High heterogeneity undermines generalization of differential expression results in RNA-Seq analysis
    Weitong Cui
    Huaru Xue
    Lei Wei
    Jinghua Jin
    Xuewen Tian
    Qinglu Wang
    [J]. Human Genomics, 15
  • [44] contamDE: differential expression analysis of RNA-seq data for contaminated tumor samples
    Shen, Qi
    Hu, Jiyuan
    Jiang, Ning
    Hu, Xiaohua
    Luo, Zewei
    Zhang, Hong
    [J]. BIOINFORMATICS, 2016, 32 (05) : 705 - 712
  • [45] Best practices on the differential expression analysis of multi-species RNA-seq
    Matthew Chung
    Vincent M. Bruno
    David A. Rasko
    Christina A. Cuomo
    José F. Muñoz
    Jonathan Livny
    Amol C. Shetty
    Anup Mahurkar
    Julie C. Dunning Hotopp
    [J]. Genome Biology, 22
  • [46] An iteration normalization and test method for differential expression analysis of RNA-seq data
    Zhou, Yan
    Lin, Nan
    Zhang, Baoxue
    [J]. BIODATA MINING, 2014, 7
  • [47] Measuring differential gene expression with RNA-seq: challenges and strategies for data analysis
    Finotello, Francesca
    Di Camillo, Barbara
    [J]. BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (02) : 130 - 142
  • [48] A general workflow for differential expression analysis of RNA-seq and introductions on related tools
    Zhang, Zhong
    [J]. PROCEEDINGS OF THE 2016 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL & ELECTRONICS ENGINEERING AND COMPUTER SCIENCE (ICEEECS 2016), 2016, 50 : 328 - 338
  • [49] Interpretation of differential gene expression results of RNA-seq data: review and integration
    McDermaid, Adam
    Monier, Brandon
    Zhao, Jing
    Liu, Bingqiang
    Ma, Qin
    [J]. BRIEFINGS IN BIOINFORMATICS, 2019, 20 (06) : 2044 - 2054
  • [50] Error estimates for the analysis of differential expression from RNA-seq count data
    Burden, Conrad J.
    Qureshi, Sumaira E.
    Wilson, Susan R.
    [J]. PEERJ, 2014, 2