An empirical likelihood ratio test robust to individual heterogeneity for differential expression analysis of RNA-seq

被引:5
|
作者
Xu, Maoqi [1 ]
Chen, Liang [1 ]
机构
[1] Univ Southern Calif, Los Angeles, CA USA
基金
美国国家卫生研究院;
关键词
cancer transcriptome; differential expression analysis; empirical likelihood ratio test; heterogeneity; RNA-seq; COMPREHENSIVE MOLECULAR CHARACTERIZATION; GENOMIC CHARACTERIZATION; GENETIC-HETEROGENEITY; CANCER;
D O I
10.1093/bib/bbw103
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The individual sample heterogeneity is one of the biggest obstacles in biomarker identification for complex diseases such as cancers. Current statistical models to identify differentially expressed genes between disease and control groups often overlook the substantial human sample heterogeneity. Meanwhile, traditional nonparametric tests lose detailed data information and sacrifice the analysis power, although they are distribution free and robust to heterogeneity. Here, we propose an empirical likelihood ratio test with a mean-variance relationship constraint (ELTSeq) for the differential expression analysis of RNA sequencing (RNA-seq). As a distribution-free nonparametric model, ELTSeq handles individual heterogeneity by estimating an empirical probability for each observation without making any assumption about read-count distribution. It also incorporates a constraint for the read-count overdispersion, which is widely observed in RNA-seq data. ELTSeq demonstrates a significant improvement over existing methods such as edgeR, DESeq, t-tests, Wilcoxon tests and the classic empirical likelihood-ratio test when handling heterogeneous groups. It will significantly advance the transcriptomics studies of cancers and other complex disease.
引用
收藏
页码:109 / 117
页数:9
相关论文
共 50 条
  • [41] Differential Expression Analysis for RNA-Seq: An Overview of Statistical Methods and Computational Software
    Huang, Huei-Chung
    Niu, Yi
    Qin, Li-Xuan
    CANCER INFORMATICS, 2015, 14 : 57 - 67
  • [42] Two-phase differential expression analysis for single cell RNA-seq
    Wu, Zhijin
    Zhang, Yi
    Stitzel, Michael L.
    Wu, Hao
    BIOINFORMATICS, 2018, 34 (19) : 3340 - 3348
  • [43] Sensitivity, specificity, and reproducibility of RNA-Seq differential expression calls
    Paweł P. Łabaj
    David P. Kreil
    Biology Direct, 11
  • [44] Sensitivity, specificity, and reproducibility of RNA-Seq differential expression calls
    Labaj, Pawel P.
    Kreil, David P.
    BIOLOGY DIRECT, 2016, 11
  • [45] Gene dispersion is the key determinant of the read count bias in differential expression analysis of RNA-seq data
    Yoon, Sora
    Nam, Dougu
    BMC GENOMICS, 2017, 18
  • [46] Gene dispersion is the key determinant of the read count bias in differential expression analysis of RNA-seq data
    Sora Yoon
    Dougu Nam
    BMC Genomics, 18
  • [47] Differential gene expression analysis by RNA-seq reveals the importance of actin cytoskeletal proteins in erythroleukemia cells
    Fernandez-Calleja, Vanessa
    Hernandez, Pablo
    Schvartzman, Jorge B.
    Garcia de lacoba, Mario
    Krimer, Dora B.
    PEERJ, 2017, 5 : e3432
  • [48] RNA-Seq Analysis of Differential Gene Expression in Electroporated Chick Embryonic Spinal Cord
    Vieceli, Felipe M.
    Yan, C. Y. Irene
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2014, (93):
  • [49] Differential Expression Analysis on RNA-Seq Count Data Based on Penalized Matrix Decomposition
    Liu, Jin-Xing
    Gao, Ying-Lian
    Xu, Yong
    Zheng, Chun-Hou
    You, Jane
    IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2014, 13 (01) : 12 - 18
  • [50] A Semi-parametric Bayesian Approach for Differential Expression Analysis of RNA-seq Data
    Fangfang Liu
    Chong Wang
    Peng Liu
    Journal of Agricultural, Biological, and Environmental Statistics, 2015, 20 : 555 - 576