PROPER: comprehensive power evaluation for differential expression using RNA-seq

被引:67
|
作者
Wu, Hao [1 ]
Wang, Chi [2 ,3 ]
Wu, Zhijin [4 ]
机构
[1] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[2] Univ Kentucky, Dept Biostat, Lexington, KY 40536 USA
[3] Univ Kentucky, Markey Canc Ctr, Lexington, KY 40536 USA
[4] Brown Univ, Dept Biostat, Providence, RI 02806 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
SAMPLE-SIZE CALCULATION; BIOCONDUCTOR; SEQUENCE;
D O I
10.1093/bioinformatics/btu640
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: RNA-seq has become a routine technique in differential expression (DE) identification. Scientists face a number of experimental design decisions, including the sample size. The power for detecting differential expression is affected by several factors, including the fraction of DE genes, distribution of the magnitude of DE, distribution of gene expression level, sequencing coverage and the choice of type I error control. The complexity and flexibility of RNA-seq experiments, the high-throughput nature of transcriptome-wide expression measurements and the unique characteristics of RNA-seq data make the power assessment particularly challenging. Results: We propose prospective power assessment instead of a direct sample size calculation by making assumptions on all of these factors. Our power assessment tool includes two components: (i) a semi-parametric simulation that generates data based on actual RNA-seq experiments with flexible choices on baseline expressions, biological variations and patterns of DE; and (ii) a power assessment component that provides a comprehensive view of power. We introduce the concepts of stratified power and false discovery cost, and demonstrate the usefulness of our method in experimental design (such as sample size and sequencing depth), as well as analysis plan (gene filtering).
引用
收藏
页码:233 / 241
页数:9
相关论文
共 50 条
  • [11] Differential Expression Analysis in RNA-seq Data Using a Geometric Approach
    Tambonis, Tiago
    Boareto, Marcelo
    Leite, Vitor B. P.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2018, 25 (11) : 1257 - 1265
  • [12] Comprehensive Evaluation of RNA-seq Quantification Methods for Linearity
    Jin, Haijing
    Wan, Ying-Wooi
    Liu, Zhandong
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 539 - 539
  • [13] Comprehensive evaluation of RNA-seq quantification methods for linearity
    Jin, Haijing
    Wan, Ying-Wooi
    Liu, Zhandong
    BMC BIOINFORMATICS, 2017, 18
  • [14] Comprehensive evaluation of RNA-seq quantification methods for linearity
    Haijing Jin
    Ying-Wooi Wan
    Zhandong Liu
    BMC Bioinformatics, 18
  • [15] A Comprehensive Evaluation of Alignment Algorithms in the Context of RNA-Seq
    Lindner, Robert
    Friedel, Caroline C.
    PLOS ONE, 2012, 7 (12):
  • [16] Differential expression analysis for paired RNA-seq data
    Chung, Lisa M.
    Ferguson, John P.
    Zheng, Wei
    Qian, Feng
    Bruno, Vincent
    Montgomery, Ruth R.
    Zhao, Hongyu
    BMC BIOINFORMATICS, 2013, 14 : 110
  • [17] Power analysis for RNA-Seq differential expression studies using generalized linear mixed effects models
    Yu, Lianbo
    Fernandez, Soledad
    Brock, Guy N.
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [18] The impact of amplification on differential expression analyses by RNA-seq
    Swati Parekh
    Christoph Ziegenhain
    Beate Vieth
    Wolfgang Enard
    Ines Hellmann
    Scientific Reports, 6
  • [19] Identifying differential expression for RNA-seq data with no replication
    Gim, Jungsoo
    Park, Taesung
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [20] From RNA-seq reads to differential expression results
    Alicia Oshlack
    Mark D Robinson
    Matthew D Young
    Genome Biology, 11