A tail-based test to detect differential expression in RNA-sequencing data

被引:2
|
作者
Chen, Jiong [1 ]
Mi, Xinlei [2 ]
Ning, Jing [3 ]
He, Xuming [4 ]
Hu, Jianhua [2 ]
机构
[1] LinkedIn, Data Sci, Mountain View, CA USA
[2] Columbia Univ, Dept Biostat, New York, NY 10032 USA
[3] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[4] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Correlated data; differential expression analysis; quantile regression; RNA sequencing; robust tail-based test; LUNG-CANCER; REGRESSION;
D O I
10.1177/0962280220951907
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
RNA sequencing data have been abundantly generated in biomedical research for biomarker discovery and other studies. Such data at the exon level are usually heavily tailed and correlated. Conventional statistical tests based on the mean or median difference for differential expression likely suffer from low power when the between-group difference occurs mostly in the upper or lower tail of the distribution of gene expression. We propose a tail-based test to make comparisons between groups in terms of a specific distribution area rather than a single location. The proposed test, which is derived from quantile regression, adjusts for covariates and accounts for within-sample dependence among the exons through a specified correlation structure. Through Monte Carlo simulation studies, we show that the proposed test is generally more powerful and robust in detecting differential expression than commonly used tests based on the mean or a single quantile. An application to TCGA lung adenocarcinoma data demonstrates the promise of the proposed method in terms of biomarker discovery.
引用
收藏
页码:261 / 276
页数:16
相关论文
共 50 条
  • [41] Combining bulk RNA-sequencing and single-cell RNA-sequencing data to reveal the immune microenvironment and metabolic pattern of osteosarcoma
    Huang, Ruichao
    Wang, Xiaohu
    Yin, Xiangyun
    Zhou, Yaqi
    Sun, Jiansheng
    Yin, Zhongxiu
    Zhu, Zhi
    FRONTIERS IN GENETICS, 2022, 13
  • [42] Comprehensive long non-coding RNA expression profiling from the TCGA HNSCC RNA-sequencing data
    Nohata, Nijiro
    Abba, Martin
    Amornphimoltham, Panomwat
    Gutkind, J. Silvio
    CANCER RESEARCH, 2016, 76
  • [43] Analysis of cellulose synthase gene expression strategies in higher plants using RNA-sequencing data
    Ts. A. Padvitski
    D. V. Galinousky
    N. V. Anisimova
    G. Ya. Baer
    Ya. V. Pirko
    A. I. Yemets
    L. V. Khotyleva
    Ya. B. Blume
    A. V. Kilchevsky
    Cytology and Genetics, 2017, 51 : 8 - 17
  • [44] Novel hybrid DCNN-SVM model for classifying RNA-sequencing gene expression data*
    Huynh, Phuoc-Hai
    Nguyen, Van-Hoa
    Do, Thanh-Nghi
    JOURNAL OF INFORMATION AND TELECOMMUNICATION, 2019, 3 (04) : 533 - 547
  • [45] Benchmarking of RNA-sequencing analysis workflows using wholetranscriptome RT-qPCR expression data
    Everaert, Celine
    Luypaert, Manuel
    Maag, Jesper L. V.
    Cheng, Quek Xiu
    Dinger, Marcel E.
    Hellemans, Jan
    Mestdagh, Pieter
    SCIENTIFIC REPORTS, 2017, 7
  • [46] ReQTL: identifying correlations between expressed SNVs and gene expression using RNA-sequencing data
    Spurr, Liam F.
    Alomran, Nawaf
    Bousounis, Pavlos
    Reece-Stremtan, Dacian
    Prashant, N. M.
    Liu, Hongyu
    Slowinski, Piotr
    Li, Muzi
    Zhang, Qianqian
    Sein, Justin
    Asher, Gabriel
    Crandall, Keith A.
    Tsaneva-Atanasova, Krasimira
    Horvath, Anelia
    BIOINFORMATICS, 2020, 36 (05) : 1351 - 1359
  • [47] Analysis of cellulose synthase gene expression strategies in higher plants using RNA-sequencing data
    Padvitski, Ts. A.
    Galinousky, D. V.
    Anisimova, N. V.
    Baer, G. Ya.
    Pirko, Ya. V.
    Yemets, A. I.
    Khotyleva, L. V.
    Blume, Ya. B.
    Kilchevskya, A. V.
    CYTOLOGY AND GENETICS, 2017, 51 (01) : 8 - 17
  • [48] Demultiplexing of single-cell RNA-sequencing data using interindividual variation in gene expression
    Nassiri, Isar
    Kwok, Andrew J.
    Bhandari, Aneesha
    Bull, Katherine R.
    Garner, Lucy C.
    Klenerman, Paul
    Webber, Caleb
    Parkkinen, Laura
    Lee, Angela W.
    Wu, Yanxia
    Fairfax, Benjamin
    Knight, Julian C.
    Buck, David
    Piazza, Paolo
    BIOINFORMATICS ADVANCES, 2024, 4 (01):
  • [49] Inferring the kinetics of stochastic gene expression from single-cell RNA-sequencing data
    Kim, Jong Kyoung
    Marioni, John C.
    GENOME BIOLOGY, 2013, 14 (01): : 1 - 12
  • [50] Robust identific tion of regulatory variants(eQTLs) using a differential expression framework developed for RNA-sequencing
    Mackenzie A.Marrella
    Fernando H.Biase
    JournalofAnimalScienceandBiotechnology, 2023, 14 (05) : 1869 - 1879