Novel Data Transformations for RNA-seq Differential Expression Analysis

被引:0
|
作者
Zeyu Zhang
Danyang Yu
Minseok Seo
Craig P. Hersh
Scott T. Weiss
Weiliang Qiu
机构
[1] Tongji University,Department of Bioinformatics, School of Life Sciences and Technology
[2] Hunan University,Department of Information and Computing Science, College of Mathematics and Econometrics
[3] Brigham and Women’s Hospital/Harvard Medical School,Channing Division of Network Medicine
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
We propose eight data transformations (r, r2, rv, rv2, l, l2, lv, and lv2) for RNA-seq data analysis aiming to make the transformed sample mean to be representative of the distribution center since it is not always possible to transform count data to satisfy the normality assumption. Simulation studies showed that for data sets with small (e.g., nCases = nControls = 3) or large sample size (e.g., nCases = nControls = 100) limma based on data from the l, l2, and r2 transformations performed better than limma based on data from the voom transformation in term of accuracy, FDR, and FNR. For datasets with moderate sample size (e.g., nCases = nControls = 30 or 50), limma with the rv and rv2 transformations performed similarly to limma with the voom transformation. Real data analysis results are consistent with simulation analysis results: limma with the r, l, r2, and l2 transformation performed better than limma with the voom transformation when sample sizes are small or large; limma with the rv and rv2 transformations performed similarly to limma with the voom transformation when sample sizes are moderate. We also observed from our data analyses that for datasets with large sample size, the gene-selection via the Wilcoxon rank sum test (a non-parametric two sample test method) based on the raw data outperformed limma based on the transformed data.
引用
收藏
相关论文
共 50 条
  • [31] An iteration normalization and test method for differential expression analysis of RNA-seq data
    Zhou, Yan
    Lin, Nan
    Zhang, Baoxue
    BIODATA MINING, 2014, 7
  • [32] Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Rapaport, Franck
    Khanin, Raya
    Liang, Yupu
    Pirun, Mono
    Krek, Azra
    Zumbo, Paul
    Mason, Christopher E.
    Socci, Nicholas D.
    Betel, Doron
    GENOME BIOLOGY, 2013, 14 (09):
  • [33] Analysis of differential gene expression by RNA-seq data in brain areas of laboratory animals
    Babenko, Vladimir N.
    Bragin, Anatoly O.
    Spitsina, Anastasia M.
    Chadaeva, Irina V.
    Galieva, Elvira R.
    Orlova, Galina V.
    Medvedeva, Irina V.
    Orlov, Yuriy L.
    JOURNAL OF INTEGRATIVE BIOINFORMATICS, 2016, 13 (04) : 292
  • [34] Differential expression analysis of human endogenous retroviruses based on ENCODE RNA-seq data
    Kerstin Haase
    Anja Mösch
    Dmitrij Frishman
    BMC Medical Genomics, 8
  • [35] Enhanced clustering-based differential expression analysis method for RNA-seq data
    Makino, Manon
    Shimizu, Kentaro
    Kadota, Koji
    METHODSX, 2024, 12
  • [36] Differential Expression Analysis on RNA-Seq Count Data Based on Penalized Matrix Decomposition
    Liu, Jin-Xing
    Gao, Ying-Lian
    Xu, Yong
    Zheng, Chun-Hou
    You, Jane
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (01) : 12 - 18
  • [37] A Semi-parametric Bayesian Approach for Differential Expression Analysis of RNA-seq Data
    Fangfang Liu
    Chong Wang
    Peng Liu
    Journal of Agricultural, Biological, and Environmental Statistics, 2015, 20 : 555 - 576
  • [38] iDEP: an integrated web application for differential expression and pathway analysis of RNA-Seq data
    Steven Xijin Ge
    Eun Wo Son
    Runan Yao
    BMC Bioinformatics, 19
  • [39] Erratum to: Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Franck Rapaport
    Raya Khanin
    Yupu Liang
    Mono Pirun
    Azra Krek
    Paul Zumbo
    Christopher E. Mason
    Nicholas D. Socci
    Doron Betel
    Genome Biology, 16
  • [40] Gene set enrichment analysis of RNA-Seq data: integrating differential expression and splicing
    Wang, Xi
    Cairns, Murray J.
    BMC BIOINFORMATICS, 2013, 14