Differential expression in RNA-seq: A matter of depth

被引:1182
|
作者
Tarazona, Sonia [1 ,2 ]
Garcia-Alcalde, Fernando [1 ]
Dopazo, Joaquin [1 ]
Ferrer, Alberto
Conesa, Ana [1 ]
机构
[1] Ctr Invest Principe Felipe, Bioinformat & Genom Dept, Valencia 46012, Spain
[2] Univ Politecn Valencia, Dept Appl Stat Operat Res & Qual, Valencia 46022, Spain
关键词
TRANSCRIPTIONAL LANDSCAPE; GENE; REPRODUCIBILITY; POLYADENYLATION; GENOME;
D O I
10.1101/gr.124321.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Next-generation sequencing (NGS) technologies are revolutionizing genome research, and in particular, their application to transcriptomics (RNA-seq) is increasingly being used for gene expression profiling as a replacement for microarrays. However, the properties of RNA-seq data have not been yet fully established, and additional research is needed for understanding how these data respond to differential expression analysis. In this work, we set out to gain insights into the characteristics of RNA-seq data analysis by studying an important parameter of this technology: the sequencing depth. We have analyzed how sequencing depth affects the detection of transcripts and their identification as differentially expressed, looking at aspects such as transcript biotype, length, expression level, and fold-change. We have evaluated different algorithms available for the analysis of RNA-seq and proposed a novel approach-NOISeq-that differs from existing methods in that it is data-adaptive and nonparametric. Our results reveal that most existing methodologies suffer from a strong dependency on sequencing depth for their differential expression calls and that this results in a considerable number of false positives that increases as the number of reads grows. In contrast, our proposed method models the noise distribution from the actual data, can therefore better adapt to the size of the data set, and is more effective in controlling the rate of false discoveries. This work discusses the true potential of RNA-seq for studying regulation at low expression ranges, the noise within RNA-seq data, and the issue of replication.
引用
收藏
页码:2213 / 2223
页数:11
相关论文
共 50 条
  • [41] LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
    Lin, Bingqing
    Zhang, Li-Feng
    Chen, Xin
    BMC GENOMICS, 2014, 15
  • [42] RNA-Seq differential expression analysis: An extended review and a software tool
    Costa-Silva, Juliana
    Domingues, Douglas
    Lopes, Fabricio Martins
    PLOS ONE, 2017, 12 (12):
  • [43] A fuzzy method for RNA-Seq differential expression analysis in presence of multireads
    Arianna Consiglio
    Corrado Mencar
    Giorgio Grillo
    Flaviana Marzano
    Mariano Francesco Caratozzolo
    Sabino Liuni
    BMC Bioinformatics, 17
  • [44] DEXUS: identifying differential expression in RNA-Seq studies with unknown conditions
    Klambauer, Guenter
    Unterthiner, Thomas
    Hochreiter, Sepp
    NUCLEIC ACIDS RESEARCH, 2013, 41 (21)
  • [45] PROPER: comprehensive power evaluation for differential expression using RNA-seq
    Wu, Hao
    Wang, Chi
    Wu, Zhijin
    BIOINFORMATICS, 2015, 31 (02) : 233 - 241
  • [46] An evaluation of RNA-seq differential analysis methods
    Li, Dongmei
    Zand, Martin S.
    Dye, Timothy D.
    Goniewicz, Maciej L.
    Rahman, Irfan
    Xie, Zidian
    PLOS ONE, 2022, 17 (09):
  • [47] Evaluation of the coverage and depth of transcriptome by RNA-Seq in chickens
    Ying Wang
    Noushin Ghaffari
    Charles D Johnson
    Ulisses M Braga-Neto
    Hui Wang
    Rui Chen
    Huaijun Zhou
    BMC Bioinformatics, 12
  • [48] Accuracy of RNA-Seq and its dependence on sequencing depth
    Guoshuai Cai
    Hua Li
    Yue Lu
    Xuelin Huang
    Juhee Lee
    Peter Müller
    Yuan Ji
    Shoudan Liang
    BMC Bioinformatics, 13
  • [49] Differential expression analysis of RNA-seq data at single-base resolution
    Frazee, Alyssa C.
    Sabunciyan, Sarven
    Hansen, Kasper D.
    Irizarry, Rafael A.
    Leek, Jeffrey T.
    BIOSTATISTICS, 2014, 15 (03) : 413 - 426
  • [50] Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks
    Cole Trapnell
    Adam Roberts
    Loyal Goff
    Geo Pertea
    Daehwan Kim
    David R Kelley
    Harold Pimentel
    Steven L Salzberg
    John L Rinn
    Lior Pachter
    Nature Protocols, 2012, 7 : 562 - 578