A tail-based test to detect differential expression in RNA-sequencing data

被引:2
|
作者
Chen, Jiong [1 ]
Mi, Xinlei [2 ]
Ning, Jing [3 ]
He, Xuming [4 ]
Hu, Jianhua [2 ]
机构
[1] LinkedIn, Data Sci, Mountain View, CA USA
[2] Columbia Univ, Dept Biostat, New York, NY 10032 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[4] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Correlated data; differential expression analysis; quantile regression; RNA sequencing; robust tail-based test; LUNG-CANCER; REGRESSION;
D O I
10.1177/0962280220951907
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
RNA sequencing data have been abundantly generated in biomedical research for biomarker discovery and other studies. Such data at the exon level are usually heavily tailed and correlated. Conventional statistical tests based on the mean or median difference for differential expression likely suffer from low power when the between-group difference occurs mostly in the upper or lower tail of the distribution of gene expression. We propose a tail-based test to make comparisons between groups in terms of a specific distribution area rather than a single location. The proposed test, which is derived from quantile regression, adjusts for covariates and accounts for within-sample dependence among the exons through a specified correlation structure. Through Monte Carlo simulation studies, we show that the proposed test is generally more powerful and robust in detecting differential expression than commonly used tests based on the mean or a single quantile. An application to TCGA lung adenocarcinoma data demonstrates the promise of the proposed method in terms of biomarker discovery.
引用
收藏
页码:261 / 276
页数:16
相关论文
共 50 条
  • [21] Evaluation of methods to detect circular RNAs from single-end RNA-sequencing data
    Manh Hung Nguyen
    Ha-Nam Nguyen
    Trung Nghia Vu
    BMC GENOMICS, 2022, 23 (01)
  • [22] Bias detection and correction in RNA-Sequencing data
    Wei Zheng
    Lisa M Chung
    Hongyu Zhao
    BMC Bioinformatics, 12
  • [23] A comparison of survival analysis methods for cancer gene expression RNA-Sequencing data
    Raman, Pichai
    Zimmerman, Samuel
    Rathi, Komal S.
    de Torrente, Laurence
    Sarmady, Mahdi
    Wu, Chao
    Leipzig, Jeremy
    Taylor, Deanne M.
    Tozeren, Aydin
    Mar, Jessica C.
    CANCER GENETICS, 2019, 235 : 1 - 12
  • [24] Bias detection and correction in RNA-Sequencing data
    Zheng, Wei
    Chung, Lisa M.
    Zhao, Hongyu
    BMC BIOINFORMATICS, 2011, 12
  • [25] Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
    José A Robles
    Sumaira E Qureshi
    Stuart J Stephen
    Susan R Wilson
    Conrad J Burden
    Jennifer M Taylor
    BMC Genomics, 13
  • [26] Efficient experimental design and analysis strategies for the detection of differential expression using RNA-Sequencing
    Robles, Jose A.
    Qureshi, Sumaira E.
    Stephen, Stuart J.
    Wilson, Susan R.
    Burden, Conrad J.
    Taylor, Jennifer M.
    BMC GENOMICS, 2012, 13
  • [27] Statistical inference of differential RNA-editing sites from RNA-sequencing data by hierarchical modeling
    Tran, Stephen S.
    Zhou, Qing
    Xiao, Xinshu
    BIOINFORMATICS, 2020, 36 (09) : 2796 - 2804
  • [28] Differential gene expression analysis tools exhibit substandard performance for long non-coding RNA-sequencing data
    Assefa, Alemu Takele
    De Paepe, Katrijn
    Everaert, Celine
    Mestdagh, Pieter
    Thas, Olivier
    Vandesompele, Jo
    GENOME BIOLOGY, 2018, 19
  • [29] Differential gene expression analysis tools exhibit substandard performance for long non-coding RNA-sequencing data
    Alemu Takele Assefa
    Katrijn De Paepe
    Celine Everaert
    Pieter Mestdagh
    Olivier Thas
    Jo Vandesompele
    Genome Biology, 19
  • [30] Fast identification of differential distributions in single-cell RNA-sequencing data with waddR
    Schefzik, Roman
    Flesch, Julian
    Goncalves, Angela
    BIOINFORMATICS, 2021, 37 (19) : 3204 - 3211