Differential expression in RNA-seq: A matter of depth

被引:1180
|
作者
Tarazona, Sonia [1 ,2 ]
Garcia-Alcalde, Fernando [1 ]
Dopazo, Joaquin [1 ]
Ferrer, Alberto
Conesa, Ana [1 ]
机构
[1] Ctr Invest Principe Felipe, Bioinformat & Genom Dept, Valencia 46012, Spain
[2] Univ Politecn Valencia, Dept Appl Stat Operat Res & Qual, Valencia 46022, Spain
关键词
TRANSCRIPTIONAL LANDSCAPE; GENE; REPRODUCIBILITY; POLYADENYLATION; GENOME;
D O I
10.1101/gr.124321.111
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Next-generation sequencing (NGS) technologies are revolutionizing genome research, and in particular, their application to transcriptomics (RNA-seq) is increasingly being used for gene expression profiling as a replacement for microarrays. However, the properties of RNA-seq data have not been yet fully established, and additional research is needed for understanding how these data respond to differential expression analysis. In this work, we set out to gain insights into the characteristics of RNA-seq data analysis by studying an important parameter of this technology: the sequencing depth. We have analyzed how sequencing depth affects the detection of transcripts and their identification as differentially expressed, looking at aspects such as transcript biotype, length, expression level, and fold-change. We have evaluated different algorithms available for the analysis of RNA-seq and proposed a novel approach-NOISeq-that differs from existing methods in that it is data-adaptive and nonparametric. Our results reveal that most existing methodologies suffer from a strong dependency on sequencing depth for their differential expression calls and that this results in a considerable number of false positives that increases as the number of reads grows. In contrast, our proposed method models the noise distribution from the actual data, can therefore better adapt to the size of the data set, and is more effective in controlling the rate of false discoveries. This work discusses the true potential of RNA-seq for studying regulation at low expression ranges, the noise within RNA-seq data, and the issue of replication.
引用
收藏
页码:2213 / 2223
页数:11
相关论文
共 50 条
  • [21] A Generalized dSpliceType Framework to Detect Differential Splicing and Differential Expression Events Using RNA-Seq
    Zhu, Dongxiao
    Deng, Nan
    Bai, Changxin
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2015, 14 (02) : 192 - 202
  • [22] The impact of quality filter for RNA-Seq data over differential expression profile
    de Sa, Pablo Gomes
    Soares, Siomar de Castro
    de Oliveira Veras, Adonney Allan
    Pinto, Anne Cybelle
    Guimaraes, Luis
    Azevedo, Vasco
    Silva, Artur
    Juca Ramos, Rommel Thiago
    PROCEEDINGS IWBBIO 2014: INTERNATIONAL WORK-CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING, VOLS 1 AND 2, 2014, : 1445 - 1449
  • [23] NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
    Bi, Yingtao
    Davuluri, Ramana V.
    BMC BIOINFORMATICS, 2013, 14
  • [24] Statistical inferences for isoform expression in RNA-Seq
    Jiang, Hui
    Wong, Wing Hung
    BIOINFORMATICS, 2009, 25 (08) : 1026 - 1032
  • [25] Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data
    Li, Jun
    Tibshirani, Robert
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2013, 22 (05) : 519 - 536
  • [26] Analysis of Annotation and Differential Expression Methods used in RNA-seq Studies in Crustacean Systems
    Das, Sunetra
    Shyamal, Sharmishtha
    Durica, David S.
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2016, 56 (06) : 1067 - 1079
  • [27] Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
    Wang, Xi
    Cairns, Murray J.
    BMC BIOINFORMATICS, 2013, 14
  • [28] Hierarchical Modeling and Differential Expression Analysis for RNA-seq Experiments with Inbred and Hybrid Genotypes
    Lithio, Andrew
    Nettleton, Dan
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2015, 20 (04) : 598 - 613
  • [29] Benchmarking RNA-Seq Quantification Tools
    Chandramohan, Raghu
    Wu, Po-Yen
    Phan, John H.
    Wang, May D.
    2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 647 - 650
  • [30] Identifying suitable tools for variant detection and differential gene expression using RNA-seq data
    Dharshini, S. Akila Parvathy
    Taguchi, Y-H
    Gromiha, M. Michael
    GENOMICS, 2020, 112 (03) : 2166 - 2172