Differential expression in RNA-seq: A matter of depth

被引:1180
|
作者
Tarazona, Sonia [1 ,2 ]
Garcia-Alcalde, Fernando [1 ]
Dopazo, Joaquin [1 ]
Ferrer, Alberto
Conesa, Ana [1 ]
机构
[1] Ctr Invest Principe Felipe, Bioinformat & Genom Dept, Valencia 46012, Spain
[2] Univ Politecn Valencia, Dept Appl Stat Operat Res & Qual, Valencia 46022, Spain
关键词
TRANSCRIPTIONAL LANDSCAPE; GENE; REPRODUCIBILITY; POLYADENYLATION; GENOME;
D O I
10.1101/gr.124321.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Next-generation sequencing (NGS) technologies are revolutionizing genome research, and in particular, their application to transcriptomics (RNA-seq) is increasingly being used for gene expression profiling as a replacement for microarrays. However, the properties of RNA-seq data have not been yet fully established, and additional research is needed for understanding how these data respond to differential expression analysis. In this work, we set out to gain insights into the characteristics of RNA-seq data analysis by studying an important parameter of this technology: the sequencing depth. We have analyzed how sequencing depth affects the detection of transcripts and their identification as differentially expressed, looking at aspects such as transcript biotype, length, expression level, and fold-change. We have evaluated different algorithms available for the analysis of RNA-seq and proposed a novel approach-NOISeq-that differs from existing methods in that it is data-adaptive and nonparametric. Our results reveal that most existing methodologies suffer from a strong dependency on sequencing depth for their differential expression calls and that this results in a considerable number of false positives that increases as the number of reads grows. In contrast, our proposed method models the noise distribution from the actual data, can therefore better adapt to the size of the data set, and is more effective in controlling the rate of false discoveries. This work discusses the true potential of RNA-seq for studying regulation at low expression ranges, the noise within RNA-seq data, and the issue of replication.
引用
收藏
页码:2213 / 2223
页数:11
相关论文
共 50 条
  • [31] Evaluation of methods for differential expression analysis on multi-group RNA-seq count data
    Tang, Min
    Sun, Jianqiang
    Shimizu, Kentaro
    Kadota, Koji
    BMC BIOINFORMATICS, 2015, 16
  • [32] Impact of sequencing depth and technology on de novo RNA-Seq assembly
    Patterson, Jordan
    Carpenter, Eric J.
    Zhu, Zhenzhen
    An, Dan
    Liang, Xinming
    Geng, Chunyu
    Drmanac, Radoje
    Wong, Gane Ka-Shu
    BMC GENOMICS, 2019, 20 (1)
  • [33] SplicingCompass: differential splicing detection using RNA-Seq data
    Aschoff, Moritz
    Hotz-Wagenblatt, Agnes
    Glatting, Karl-Heinz
    Fischer, Matthias
    Eils, Roland
    Koenig, Rainer
    BIOINFORMATICS, 2013, 29 (09) : 1141 - 1148
  • [34] AuPairWise: A Method to Estimate RNA-Seq Replicability through Co-expression
    Ballouz, Sara
    Gillis, Jesse
    PLOS COMPUTATIONAL BIOLOGY, 2016, 12 (04)
  • [35] Gene dispersion is the key determinant of the read count bias in differential expression analysis of RNA-seq data
    Yoon, Sora
    Nam, Dougu
    BMC GENOMICS, 2017, 18
  • [36] RNA-Seq gene expression estimation with read mapping uncertainty
    Li, Bo
    Ruotti, Victor
    Stewart, Ron M.
    Thomson, James A.
    Dewey, Colin N.
    BIOINFORMATICS, 2010, 26 (04) : 493 - 500
  • [37] Identification of Prostate Cancer LncRNAs by RNA-Seq
    Hu, Cheng-Cheng
    Gan, Ping
    Zhang, Rui-Ying
    Xue, Jin-Xia
    Ran, Long-Ke
    ASIAN PACIFIC JOURNAL OF CANCER PREVENTION, 2014, 15 (21) : 9439 - 9444
  • [38] Differential gene network analysis from single cell RNA-seq
    Yikai Wang
    Hao Wu
    Tianwei Yu
    Journal of Genetics and Genomics, 2017, 44 (06) : 331 - 334
  • [39] RNA-Seq Transcriptome Analysis of Potato with Differential Tolerance to Bentazone Herbicide
    Guo, Jing
    Song, Xiuli
    Sun, Shiqi
    Shao, Baihui
    Tao, Bo
    Zhang, Lili
    AGRONOMY-BASEL, 2021, 11 (05):
  • [40] Advancing RNA-Seq analysis
    Haas, Brian J.
    Zody, Michael C.
    NATURE BIOTECHNOLOGY, 2010, 28 (05) : 421 - 423