Parametric analysis of RNA-seq expression data

被引:4
|
作者
Konishi, Tomokazu [1 ]
机构
[1] Akita Prefectural Univ, Fac Bioresource Sci, Akita 0100195, Japan
关键词
DIFFERENTIAL EXPRESSION; NORMALIZATION; MODEL; SAGE;
D O I
10.1111/gtc.12372
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Various methods had been introduced for normalization and comparison of RNA-seq count data. However, they lacked objectivity because they based on ad hoc assumptions that were never verified their appropriateness. Here, we introduced a method that assumes parsimony models on data distribution; the assumptions were verified according to exploratory data analysis. As was expected, count data were lognormally distributed. The level of noise in recent data appeared to be much higher than those of microarrays. Still, the appropriate distribution model would improve certainty and accuracy of normalization, by finding out the reliable range of data. Primary cause of noise was not the principle of the methodology; that is, each read is a trial that which transcript is read. Rather, the cause would be overlooking of transcripts, and the overlooking occurred more often among lower range of data. To find out genes likely to be overlooked, number of replications would be more important than read depth, which will not prevent overlooking. Both signal and noise in the reliable range of data were distributed normally, showing the suitability to use generalized linear model to evaluate differences in expression levels. In the framework, normalized data can be compared and combined freely beyond studies.
引用
收藏
页码:639 / 647
页数:9
相关论文
共 50 条
  • [31] Principles of transcriptome analysis and gene expression quantification: an RNA-seq tutorial
    Wolf, Jochen B. W.
    MOLECULAR ECOLOGY RESOURCES, 2013, 13 (04) : 559 - 572
  • [32] A comprehensive workflow for optimizing RNA-seq data analysis
    Jiang, Gao
    Zheng, Juan-Yu
    Ren, Shu-Ning
    Yin, Weilun
    Xia, Xinli
    Li, Yun
    Wang, Hou-Ling
    BMC GENOMICS, 2024, 25 (01):
  • [33] A Two-Stage Poisson Model for Testing RNA-Seq Data
    Auer, Paul L.
    Doerge, Rebecca W.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [34] Detecting differential expression from RNA-seq data with expression measurement uncertainty
    Zhang, Li
    Chen, Songcan
    Liu, Xuejun
    FRONTIERS OF COMPUTER SCIENCE, 2015, 9 (04) : 652 - 663
  • [35] High heterogeneity undermines generalization of differential expression results in RNA-Seq analysis
    Cui, Weitong
    Xue, Huaru
    Wei, Lei
    Jin, Jinghua
    Tian, Xuewen
    Wang, Qinglu
    HUMAN GENOMICS, 2021, 15 (01)
  • [36] Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation
    McCarthy, Davis J.
    Chen, Yunshun
    Smyth, Gordon K.
    NUCLEIC ACIDS RESEARCH, 2012, 40 (10) : 4288 - 4297
  • [37] Fully Bayesian Analysis of RNA-seq Counts for the Detection of Gene Expression Heterosis
    Landau, Will
    Niemi, Jarad
    Nettleton, Dan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (526) : 610 - 621
  • [38] A two-step integrated approach to detect differentially expressed genes in RNA-Seq data
    Al Mahi, Naim
    Begum, Munni
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (06)
  • [39] Unit-Free and Robust Detection of Differential Expression from RNA-Seq Data
    Jiang H.
    Zhan T.
    Statistics in Biosciences, 2017, 9 (1) : 178 - 199
  • [40] Single-Cell RNA-Seq Technologies and Related Computational Data Analysis
    Chen, Geng
    Ning, Baitang
    Shi, Tieliu
    FRONTIERS IN GENETICS, 2019, 10