Differential expression in RNA-seq: A matter of depth

被引:1180
作者
Tarazona, Sonia [1 ,2 ]
Garcia-Alcalde, Fernando [1 ]
Dopazo, Joaquin [1 ]
Ferrer, Alberto
Conesa, Ana [1 ]
机构
[1] Ctr Invest Principe Felipe, Bioinformat & Genom Dept, Valencia 46012, Spain
[2] Univ Politecn Valencia, Dept Appl Stat Operat Res & Qual, Valencia 46022, Spain
关键词
TRANSCRIPTIONAL LANDSCAPE; GENE; REPRODUCIBILITY; POLYADENYLATION; GENOME;
D O I
10.1101/gr.124321.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Next-generation sequencing (NGS) technologies are revolutionizing genome research, and in particular, their application to transcriptomics (RNA-seq) is increasingly being used for gene expression profiling as a replacement for microarrays. However, the properties of RNA-seq data have not been yet fully established, and additional research is needed for understanding how these data respond to differential expression analysis. In this work, we set out to gain insights into the characteristics of RNA-seq data analysis by studying an important parameter of this technology: the sequencing depth. We have analyzed how sequencing depth affects the detection of transcripts and their identification as differentially expressed, looking at aspects such as transcript biotype, length, expression level, and fold-change. We have evaluated different algorithms available for the analysis of RNA-seq and proposed a novel approach-NOISeq-that differs from existing methods in that it is data-adaptive and nonparametric. Our results reveal that most existing methodologies suffer from a strong dependency on sequencing depth for their differential expression calls and that this results in a considerable number of false positives that increases as the number of reads grows. In contrast, our proposed method models the noise distribution from the actual data, can therefore better adapt to the size of the data set, and is more effective in controlling the rate of false discoveries. This work discusses the true potential of RNA-seq for studying regulation at low expression ranges, the noise within RNA-seq data, and the issue of replication.
引用
收藏
页码:2213 / 2223
页数:11
相关论文
共 50 条
  • [41] A Comparison of Low Read Depth QuantSeq 3′ Sequencing to Total RNA-Seq in FUS Mutant Mice
    Jarvis, Seth
    Birsa, Nicol
    Secrier, Maria
    Fratta, Pietro
    Plagnol, Vincent
    FRONTIERS IN GENETICS, 2020, 11
  • [42] RNA-Seq SSRs and small RNA-Seq SSRs: New approaches in cancer biomarker discovery
    Alisoltani, Arghavan
    Fallahi, Hossein
    Shiran, Behrouz
    Alisoltani, Anousheh
    Ebrahimie, Esmaeil
    GENE, 2015, 560 (01) : 34 - 43
  • [43] Introducing differential RNA-seq mapping to track the early infection phase for Pseudomonas phage φKZ
    Wicke, Laura
    Ponath, Falk
    Coppens, Lucas
    Gerovac, Milan
    Lavigne, Rob
    Vogel, Joerg
    RNA BIOLOGY, 2021, 18 (08) : 1099 - 1110
  • [44] Logic programming to infer complex RNA expression patterns from RNA-seq data
    Weirick, Tyler
    Militello, Giuseppe
    Ponomareva, Yuliya
    John, David
    Doring, Claudia
    Dimmeler, Stefanie
    Uchida, Shizuka
    BRIEFINGS IN BIOINFORMATICS, 2018, 19 (02) : 199 - 209
  • [45] Giardia lamblia Transcriptome Analysis Using TSS-Seq and RNA-Seq
    Tolba, Mohammed E. M.
    Kobayashi, Seiki
    Imada, Mihoko
    Suzuki, Yutaka
    Sugano, Sumio
    PLOS ONE, 2013, 8 (10):
  • [46] Critical Evaluation of Imprinted Gene Expression by RNA-Seq: A New Perspective
    DeVeale, Brian
    van der Kooy, Derek
    Babak, Tomas
    PLOS GENETICS, 2012, 8 (03):
  • [47] Accurate Estimation of Expression Levels of Homologous Genes in RNA-seq Experiments
    Pasaniuc, Bogdan
    Zaitlen, Noah
    Halperin, Eran
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2011, 18 (03) : 459 - 468
  • [48] Accurate Estimation of Expression Levels of Homologous Genes in RNA-seq Experiments
    Pasaniuc, Bogdan
    Zaitlen, Noah
    Halperin, Eran
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, PROCEEDINGS, 2010, 6044 : 397 - +
  • [49] Estimation of Isoform Expression using Hierarchical Bayesian Model by RNA-seq
    Wang, Zengmiao
    Wang, Jun
    Deng, Minghua
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 8554 - 8558
  • [50] RNA-Seq Analysis of Differential Splice Junction Usage and Intron Retentions by DEXSeq
    Li, Yafang
    Rao, Xiayu
    Mattox, William W.
    Amos, Christopher I.
    Liu, Bin
    PLOS ONE, 2015, 10 (09):