Sensitivity, specificity, and reproducibility of RNA-Seq differential expression calls

被引:25
作者
Labaj, Pawel P. [1 ,2 ]
Kreil, David P. [2 ]
机构
[1] Austrian Acad Sci, Vienna, Austria
[2] Boku Univ, Bioinformat Res Grp, Vienna, Austria
关键词
RNA-seq; Sensitivity; Specificity; Reproducibility; Differential expression calling; GENE; PACKAGE;
D O I
10.1186/s13062-016-0169-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The MAQC/SEQC consortium has recently compiled a key benchmark that can serve for testing the latest developments in analysis tools for microarray and RNA-seq expression profiling. Such objective benchmarks are required for basic and applied research, and can be critical for clinical and regulatory outcomes. Going beyond the first comparisons presented in the original SEQC study, we here present extended benchmarks including effect strengths typical of common experiments. Results: With artefacts removed by factor analysis and additional filters, for genome scale surveys, the reproducibility of differential expression calls typically exceed 80% for all tool combinations examined. This directly reflects the robustness of results and reproducibility across different studies. Similar improvements are observed for the top ranked candidates with the strongest relative expression change, although here some tools clearly perform better than others, with typical reproducibility ranging from 60 to 93%. Conclusions: In our benchmark of alternative tools for RNA-seq data analysis we demonstrated the benefits that can be gained by analysing results in the context of other experiments employing a reference standard sample. This allowed the computational identification and removal of hidden confounders, for instance, by factor analysis. In itself, this already substantially improved the empirical False Discovery Rate (eFDR) without changing the overall landscape of sensitivity. Further filtering of false positives, however, is required to obtain acceptable eFDR levels. Appropriate filters noticeably improved agreement of differentially expressed genes both across sites and between alternative differential expression analysis pipelines.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Novel Data Transformations for RNA-seq Differential Expression Analysis
    Zhang, Zeyu
    Yu, Danyang
    Seo, Minseok
    Hersh, Craig P.
    Weiss, Scott T.
    Qiu, Weiliang
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [22] A comparison of methods for differential expression analysis of RNA-seq data
    Soneson, Charlotte
    Delorenzi, Mauro
    BMC BIOINFORMATICS, 2013, 14
  • [23] A comparison of methods for differential expression analysis of RNA-seq data
    Charlotte Soneson
    Mauro Delorenzi
    BMC Bioinformatics, 14
  • [24] Detecting differential expression from RNA-seq data with expression measurement uncertainty
    Li Zhang
    Songcan Chen
    Xuejun Liu
    Frontiers of Computer Science, 2015, 9 : 652 - 663
  • [25] Current RNA-seq methodology reporting limits reproducibility
    Simoneau, Joel
    Dumontier, Simon
    Gosselin, Ryan
    Scott, Michelle S.
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) : 140 - 145
  • [26] RNA-seq reproducibility of Pseudomonas aeruginosa in laboratory models of cystic fibrosis
    Duncan, Rebecca P.
    Lewin, Gina R.
    Cornforth, Daniel M.
    Diggle, Frances L.
    Kapur, Ananya
    Moustafa, Dina A.
    Hilliam, Yasmin
    Bomberger, Jennifer M.
    Whiteley, Marvin
    Goldberg, Joanna B.
    MICROBIOLOGY SPECTRUM, 2025, 13 (01)
  • [27] Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data
    Li, Jun
    Tibshirani, Robert
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 519 - 536
  • [28] RNA-seq differential expression studies: more sequence or more replication?
    Liu, Yuwen
    Zhou, Jie
    White, Kevin P.
    BIOINFORMATICS, 2014, 30 (03) : 301 - 304
  • [29] Impact of human gene annotations on RNA-seq differential expression analysis
    Yu Hamaguchi
    Chao Zeng
    Michiaki Hamada
    BMC Genomics, 22
  • [30] LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
    Bingqing Lin
    Li-Feng Zhang
    Xin Chen
    BMC Genomics, 15