Mixed Distribution Models Based on Single-Cell RNA Sequencing Data

被引:0
|
作者
Min Wu
Junhua Xu
Tao Ding
Jie Gao
机构
[1] Jiangnan University,School of Science
[2] Newcastle University,School of Mathematics Statistics and Physics
来源
Interdisciplinary Sciences: Computational Life Sciences | 2021年 / 13卷
关键词
Colorectal cancer (CRC); Mixed stable-normal distribution (MSND) model; Mixed stable-exponential distribution (MSED) model; Stable distribution; Cauchy distribution;
D O I
暂无
中图分类号
学科分类号
摘要
Progress in single-cell RNA sequencing (scRNA-seq) has yielded a lot of valuable data. Analysis of these data can provide a new perspective for studying the intratumoral heterogeneity and identifying gene markers. In this paper, the scRNA-seq data of colorectal cancer (CRC) are analyzed, and it is found that the shape of the gene expression difference (GED) data shows certain distribution regularity. To study the distribution regularity, mixed stable-normal distribution (MSND) model and mixed stable-exponential distribution (MSED) model are constructed to fit the GED data. And the estimated parameters of MSND and MSED are used to describe some characteristics of their distribution. Through the comparison of root mean square error and the chi-squared goodness of fit test, it is found that the fitting effect of MSED and MSND are both better than that of stable distribution and Cauchy distribution. Considering the given quantile thresholds, MSND and MSED can be used to identify tumor-related genes. The results of functional analysis indicate that the selected genes are highly correlated with CRC. In addition, the parameters of MSND and MSED exhibit a certain trend with the development of CRC. To explore the association, Gene-set enrichment analysis (GSEA) is performed. The results of GSEA reveal that the trend can well characterize the intratumoral heterogeneity of CRC. In addition, the application of MSED model on hepatocellular carcinoma shows that our model can analyze other cancers. Overall, MSND model and MSED model can well fit the GED data in different disease stages, the parameters of the two models can characterize the heterogeneity of CRC tumor cells, and the two models can be used to identify genes highly correlated with tumors.
引用
收藏
页码:362 / 370
页数:8
相关论文
共 50 条
  • [41] Cell-specific network constructed by single-cell RNA sequencing data
    Dai, Hao
    Li, Lin
    Zeng, Tao
    Chen, Luonan
    NUCLEIC ACIDS RESEARCH, 2019, 47 (11)
  • [42] scGEAToolbox: a Matlab toolbox for single-cell RNA sequencing data analysis
    Cai, James J.
    BIOINFORMATICS, 2020, 36 (06) : 1948 - 1949
  • [43] SSBER: removing batch effect for single-cell RNA sequencing data
    Yin Zhang
    Fei Wang
    BMC Bioinformatics, 22
  • [44] splatPop: simulating population scale single-cell RNA sequencing data
    Christina B. Azodi
    Luke Zappia
    Alicia Oshlack
    Davis J. McCarthy
    Genome Biology, 22
  • [45] Significance analysis for clustering with single-cell RNA-sequencing data
    Isabella N. Grabski
    Kelly Street
    Rafael A. Irizarry
    Nature Methods, 2023, 20 : 1196 - 1202
  • [46] Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data
    Andrews, Tallulah S.
    Kiselev, Vladimir Yu
    McCarthy, Davis
    Hemberg, Martin
    NATURE PROTOCOLS, 2021, 16 (01) : 1 - 9
  • [47] Resolution of the curse of dimensionality in single-cell RNA sequencing data analysis
    Imoto, Yusuke
    Nakamura, Tomonori
    Escolar, Emerson G.
    Yoshiwaki, Michio
    Kojima, Yoji
    Yabuta, Yukihiro
    Katou, Yoshitaka
    Yamamoto, Takuya
    Hiraoka, Yasuaki
    Saitou, Mitinori
    LIFE SCIENCE ALLIANCE, 2022, 5 (12)
  • [48] Differential gene expression analysis in single-cell RNA sequencing data
    Wang, Tianyu
    Nabavi, Sheida
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 202 - 207
  • [49] Accelerated dimensionality reduction of single-cell RNA sequencing data with fastglmpca
    Weine, Eric
    Carbonetto, Peter
    Stephens, Matthew
    BIOINFORMATICS, 2024, 40 (08)
  • [50] SAREV: A review on statistical analytics of single-cell RNA sequencing data
    Ellis, Dorothy
    Wu, Dongyuan
    Datta, Susmita
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2022, 14 (04):