PROPER: comprehensive power evaluation for differential expression using RNA-seq

被引:67
|
作者
Wu, Hao [1 ]
Wang, Chi [2 ,3 ]
Wu, Zhijin [4 ]
机构
[1] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[2] Univ Kentucky, Dept Biostat, Lexington, KY 40536 USA
[3] Univ Kentucky, Markey Canc Ctr, Lexington, KY 40536 USA
[4] Brown Univ, Dept Biostat, Providence, RI 02806 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
SAMPLE-SIZE CALCULATION; BIOCONDUCTOR; SEQUENCE;
D O I
10.1093/bioinformatics/btu640
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: RNA-seq has become a routine technique in differential expression (DE) identification. Scientists face a number of experimental design decisions, including the sample size. The power for detecting differential expression is affected by several factors, including the fraction of DE genes, distribution of the magnitude of DE, distribution of gene expression level, sequencing coverage and the choice of type I error control. The complexity and flexibility of RNA-seq experiments, the high-throughput nature of transcriptome-wide expression measurements and the unique characteristics of RNA-seq data make the power assessment particularly challenging. Results: We propose prospective power assessment instead of a direct sample size calculation by making assumptions on all of these factors. Our power assessment tool includes two components: (i) a semi-parametric simulation that generates data based on actual RNA-seq experiments with flexible choices on baseline expressions, biological variations and patterns of DE; and (ii) a power assessment component that provides a comprehensive view of power. We introduce the concepts of stratified power and false discovery cost, and demonstrate the usefulness of our method in experimental design (such as sample size and sequencing depth), as well as analysis plan (gene filtering).
引用
收藏
页码:233 / 241
页数:9
相关论文
共 50 条
  • [1] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Franck Rapaport
    Raya Khanin
    Yupu Liang
    Mono Pirun
    Azra Krek
    Paul Zumbo
    Christopher E Mason
    Nicholas D Socci
    Doron Betel
    Genome Biology, 14
  • [2] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Rapaport, Franck
    Khanin, Raya
    Liang, Yupu
    Pirun, Mono
    Krek, Azra
    Zumbo, Paul
    Mason, Christopher E.
    Socci, Nicholas D.
    Betel, Doron
    GENOME BIOLOGY, 2013, 14 (09):
  • [3] Power analysis for RNA-Seq differential expression studies
    Yu, Lianbo
    Fernandez, Soledad
    Brock, Guy
    BMC BIOINFORMATICS, 2017, 18
  • [4] Power analysis for RNA-Seq differential expression studies
    Lianbo Yu
    Soledad Fernandez
    Guy Brock
    BMC Bioinformatics, 18
  • [5] Erratum to: Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Franck Rapaport
    Raya Khanin
    Yupu Liang
    Mono Pirun
    Azra Krek
    Paul Zumbo
    Christopher E. Mason
    Nicholas D. Socci
    Doron Betel
    Genome Biology, 16
  • [6] On Differential Gene Expression Using RNA-Seq Data
    Lee, Juhee
    Ji, Yuan
    Liang, Shoudan
    Cai, Guoshuai
    Mueller, Peter
    CANCER INFORMATICS, 2011, 10 : 205 - 215
  • [7] Power analysis and sample size estimation for RNA-Seq differential expression
    Ching, Travers
    Huang, Sijia
    Garmire, Lana X.
    RNA, 2014, 20 (11) : 1684 - 1696
  • [8] Differential expression in RNA-seq: A matter of depth
    Tarazona, Sonia
    Garcia-Alcalde, Fernando
    Dopazo, Joaquin
    Ferrer, Alberto
    Conesa, Ana
    GENOME RESEARCH, 2011, 21 (12) : 2213 - 2223
  • [9] An evaluation of RNA-seq differential analysis methods
    Li, Dongmei
    Zand, Martin S.
    Dye, Timothy D.
    Goniewicz, Maciej L.
    Rahman, Irfan
    Xie, Zidian
    PLOS ONE, 2022, 17 (09):
  • [10] Differential gene expression analysis using coexpression and RNA-Seq data
    Yang, Ei-Wen
    Girke, Thomas
    Jiang, Tao
    BIOINFORMATICS, 2013, 29 (17) : 2153 - 2161