Parametric analysis of RNA-seq expression data

被引:4
|
作者
Konishi, Tomokazu [1 ]
机构
[1] Akita Prefectural Univ, Fac Bioresource Sci, Akita 0100195, Japan
关键词
DIFFERENTIAL EXPRESSION; NORMALIZATION; MODEL; SAGE;
D O I
10.1111/gtc.12372
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Various methods had been introduced for normalization and comparison of RNA-seq count data. However, they lacked objectivity because they based on ad hoc assumptions that were never verified their appropriateness. Here, we introduced a method that assumes parsimony models on data distribution; the assumptions were verified according to exploratory data analysis. As was expected, count data were lognormally distributed. The level of noise in recent data appeared to be much higher than those of microarrays. Still, the appropriate distribution model would improve certainty and accuracy of normalization, by finding out the reliable range of data. Primary cause of noise was not the principle of the methodology; that is, each read is a trial that which transcript is read. Rather, the cause would be overlooking of transcripts, and the overlooking occurred more often among lower range of data. To find out genes likely to be overlooked, number of replications would be more important than read depth, which will not prevent overlooking. Both signal and noise in the reliable range of data were distributed normally, showing the suitability to use generalized linear model to evaluate differences in expression levels. In the framework, normalized data can be compared and combined freely beyond studies.
引用
收藏
页码:639 / 647
页数:9
相关论文
共 50 条
  • [21] LFCseq: a nonparametric approach for differential expression analysis of RNA-seq data
    Lin, Bingqing
    Zhang, Li-Feng
    Chen, Xin
    BMC GENOMICS, 2014, 15
  • [22] Protocol for RNA-seq Expression Analysis in Yeast
    Bohn, Stefan
    BIO-PROTOCOL, 2021, 11 (18):
  • [23] Expression Analysis Stakes Claim on RNA-Seq
    Potera, Carol
    GENETIC ENGINEERING & BIOTECHNOLOGY NEWS, 2012, 32 (12): : 10 - +
  • [24] A semi-parametric Bayesian approach for detection of gene expression heterosis with RNA-seq data
    Bi, Ran
    Liu, Peng
    JOURNAL OF APPLIED STATISTICS, 2023, 50 (01) : 214 - 230
  • [25] Statistical analysis of RNA-seq data at scale
    Leek, Jeff T.
    GENETIC EPIDEMIOLOGY, 2015, 39 (07) : 563 - 563
  • [26] A comprehensive review on RNA-seq data analysis
    Zhang, Li
    Liu, Xuejun
    Transactions of Nanjing University of Aeronautics and Astronautics, 2016, 33 (03) : 339 - 361
  • [27] Dynamic Model for RNA-seq Data Analysis
    Li, Lerong
    Xiong, Momiao
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [28] Computational analysis of bacterial RNA-Seq data
    McClure, Ryan
    Balasubramanian, Divya
    Sun, Yan
    Bobrovskyy, Maksym
    Sumby, Paul
    Genco, Caroline A.
    Vanderpool, Carin K.
    Tjaden, Brian
    NUCLEIC ACIDS RESEARCH, 2013, 41 (14)
  • [29] RseqFlow: workflows for RNA-Seq data analysis
    Wang, Ying
    Mehta, Gaurang
    Mayani, Rajiv
    Lu, Jingxi
    Souaiaia, Tade
    Chen, Yangho
    Clark, Andrew
    Yoon, Hee Jae
    Wan, Lin
    Evgrafov, Oleg V.
    Knowles, James A.
    Deelman, Ewa
    Chen, Ting
    BIOINFORMATICS, 2011, 27 (18) : 2598 - 2600
  • [30] A Comprehensive Review on RNA-seq Data Analysis
    Zhang Li
    Liu Xuejun
    Transactions of Nanjing University of Aeronautics and Astronautics, 2016, 33 (03) : 339 - 361