Mixed Distribution Models Based on Single-Cell RNA Sequencing Data

被引:0
|
作者
Min Wu
Junhua Xu
Tao Ding
Jie Gao
机构
[1] Jiangnan University,School of Science
[2] Newcastle University,School of Mathematics Statistics and Physics
来源
Interdisciplinary Sciences: Computational Life Sciences | 2021年 / 13卷
关键词
Colorectal cancer (CRC); Mixed stable-normal distribution (MSND) model; Mixed stable-exponential distribution (MSED) model; Stable distribution; Cauchy distribution;
D O I
暂无
中图分类号
学科分类号
摘要
Progress in single-cell RNA sequencing (scRNA-seq) has yielded a lot of valuable data. Analysis of these data can provide a new perspective for studying the intratumoral heterogeneity and identifying gene markers. In this paper, the scRNA-seq data of colorectal cancer (CRC) are analyzed, and it is found that the shape of the gene expression difference (GED) data shows certain distribution regularity. To study the distribution regularity, mixed stable-normal distribution (MSND) model and mixed stable-exponential distribution (MSED) model are constructed to fit the GED data. And the estimated parameters of MSND and MSED are used to describe some characteristics of their distribution. Through the comparison of root mean square error and the chi-squared goodness of fit test, it is found that the fitting effect of MSED and MSND are both better than that of stable distribution and Cauchy distribution. Considering the given quantile thresholds, MSND and MSED can be used to identify tumor-related genes. The results of functional analysis indicate that the selected genes are highly correlated with CRC. In addition, the parameters of MSND and MSED exhibit a certain trend with the development of CRC. To explore the association, Gene-set enrichment analysis (GSEA) is performed. The results of GSEA reveal that the trend can well characterize the intratumoral heterogeneity of CRC. In addition, the application of MSED model on hepatocellular carcinoma shows that our model can analyze other cancers. Overall, MSND model and MSED model can well fit the GED data in different disease stages, the parameters of the two models can characterize the heterogeneity of CRC tumor cells, and the two models can be used to identify genes highly correlated with tumors.
引用
收藏
页码:362 / 370
页数:8
相关论文
共 50 条
  • [31] A novel prognostic model based on single-cell RNA sequencing data for hepatocellular carcinoma
    Lu, Juan
    Chen, Yanfei
    Zhang, Xiaoqian
    Guo, Jing
    Xu, Kaijin
    Li, Lanjuan
    CANCER CELL INTERNATIONAL, 2022, 22 (01)
  • [32] Identification of immune subtypes of melanoma based on single-cell and bulk RNA sequencing data
    Guo, Linqian
    Meng, Qingrong
    Lin, Wenqi
    Weng, Kaiyuan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (02) : 2920 - 2936
  • [33] Clustering Single-cell RNA-sequencing Data based on Matching Clusters Structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2020, 27 (01): : 89 - 95
  • [34] Regulatory network-based imputation of dropouts in single-cell RNA sequencing data
    Leote, Ana Carolina
    Wu, Xiaohui
    Beyer, Andreas
    PLOS COMPUTATIONAL BIOLOGY, 2022, 18 (02)
  • [35] Clustering single-cell rna-sequencing data based on matching clusters structures
    Wang, Yizhang
    Zhou, You
    Pang, Wie
    Liang, Yanchun
    Wang, Shu
    Tehnicki Vjesnik, 2020, 27 (01): : 89 - 95
  • [36] A novel prognostic model based on single-cell RNA sequencing data for hepatocellular carcinoma
    Juan Lu
    Yanfei Chen
    Xiaoqian Zhang
    Jing Guo
    Kaijin Xu
    Lanjuan Li
    Cancer Cell International, 22
  • [37] A Model for Detecting Type 2 Diabetes Using Mixed Single-Cell RNA Sequencing with Optimized Data
    Padmaja K.
    Mukhopadhyay D.
    SN Computer Science, 4 (6)
  • [38] Cell Classification Based on Stacked Autoencoder for Single-Cell RNA Sequencing
    Qi, Rong
    Zheng, Chun-Hou
    Ji, Cun-Mei
    Yu, Ning
    Ni, Jian-Cheng
    Wang, Yu-Tian
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 245 - 259
  • [39] A comparison of automatic cell identification methods for single-cell RNA sequencing data
    Abdelaal, Tamim
    Michielsen, Lieke
    Cats, Davy
    Hoogduin, Dylan
    Mei, Hailiang
    Reinders, Marcel J. T.
    Mahfouz, Ahmed
    GENOME BIOLOGY, 2019, 20 (01)
  • [40] A comparison of automatic cell identification methods for single-cell RNA sequencing data
    Tamim Abdelaal
    Lieke Michielsen
    Davy Cats
    Dylan Hoogduin
    Hailiang Mei
    Marcel J. T. Reinders
    Ahmed Mahfouz
    Genome Biology, 20