Parametric analysis of RNA-seq expression data

被引:4
|
作者
Konishi, Tomokazu [1 ]
机构
[1] Akita Prefectural Univ, Fac Bioresource Sci, Akita 0100195, Japan
关键词
DIFFERENTIAL EXPRESSION; NORMALIZATION; MODEL; SAGE;
D O I
10.1111/gtc.12372
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Various methods had been introduced for normalization and comparison of RNA-seq count data. However, they lacked objectivity because they based on ad hoc assumptions that were never verified their appropriateness. Here, we introduced a method that assumes parsimony models on data distribution; the assumptions were verified according to exploratory data analysis. As was expected, count data were lognormally distributed. The level of noise in recent data appeared to be much higher than those of microarrays. Still, the appropriate distribution model would improve certainty and accuracy of normalization, by finding out the reliable range of data. Primary cause of noise was not the principle of the methodology; that is, each read is a trial that which transcript is read. Rather, the cause would be overlooking of transcripts, and the overlooking occurred more often among lower range of data. To find out genes likely to be overlooked, number of replications would be more important than read depth, which will not prevent overlooking. Both signal and noise in the reliable range of data were distributed normally, showing the suitability to use generalized linear model to evaluate differences in expression levels. In the framework, normalized data can be compared and combined freely beyond studies.
引用
收藏
页码:639 / 647
页数:9
相关论文
共 50 条
  • [21] ExpressionPlot: a web-based framework for analysis of RNA-Seq and microarray gene expression data
    Friedman, Brad A.
    Maniatis, Tom
    GENOME BIOLOGY, 2011, 12 (07):
  • [22] Differential expression analysis of RNA-seq data at single-base resolution
    Frazee, Alyssa C.
    Sabunciyan, Sarven
    Hansen, Kasper D.
    Irizarry, Rafael A.
    Leek, Jeffrey T.
    BIOSTATISTICS, 2014, 15 (03) : 413 - 426
  • [23] Measuring differential gene expression with RNA-seq: challenges and strategies for data analysis
    Finotello, Francesca
    Di Camillo, Barbara
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (02) : 130 - 142
  • [24] An iteration normalization and test method for differential expression analysis of RNA-seq data
    Zhou, Yan
    Lin, Nan
    Zhang, Baoxue
    BIODATA MINING, 2014, 7
  • [25] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Rapaport, Franck
    Khanin, Raya
    Liang, Yupu
    Pirun, Mono
    Krek, Azra
    Zumbo, Paul
    Mason, Christopher E.
    Socci, Nicholas D.
    Betel, Doron
    GENOME BIOLOGY, 2013, 14 (09):
  • [26] Empirical Bayes Analysis of RNA-seq Data for Detection of Gene Expression Heterosis
    Niemi, Jarad
    Mittman, Eric
    Landau, Will
    Nettleton, Dan
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2015, 20 (04) : 614 - 628
  • [27] An integrative method to normalize RNA-Seq data
    Filloux, Cyril
    Cedric, Meersseman
    Romain, Philippe
    Lionel, Forestier
    Christophe, Klopp
    Dominique, Rocha
    Abderrahman, Maftah
    Daniel, Petit
    BMC BIOINFORMATICS, 2014, 15
  • [28] Power analysis and sample size estimation for RNA-Seq differential expression
    Ching, Travers
    Huang, Sijia
    Garmire, Lana X.
    RNA, 2014, 20 (11) : 1684 - 1696
  • [29] Getting the most out of RNA-seq data analysis
    Khang, Tsung Fei
    Lau, Ching Yee
    PEERJ, 2015, 3
  • [30] NPEBseq: nonparametric empirical bayesian-based procedure for differential expression analysis of RNA-seq data
    Bi, Yingtao
    Davuluri, Ramana V.
    BMC BIOINFORMATICS, 2013, 14