Mixed Distribution Models Based on Single-Cell RNA Sequencing Data

被引:0
|
作者
Min Wu
Junhua Xu
Tao Ding
Jie Gao
机构
[1] Jiangnan University,School of Science
[2] Newcastle University,School of Mathematics Statistics and Physics
来源
Interdisciplinary Sciences: Computational Life Sciences | 2021年 / 13卷
关键词
Colorectal cancer (CRC); Mixed stable-normal distribution (MSND) model; Mixed stable-exponential distribution (MSED) model; Stable distribution; Cauchy distribution;
D O I
暂无
中图分类号
学科分类号
摘要
Progress in single-cell RNA sequencing (scRNA-seq) has yielded a lot of valuable data. Analysis of these data can provide a new perspective for studying the intratumoral heterogeneity and identifying gene markers. In this paper, the scRNA-seq data of colorectal cancer (CRC) are analyzed, and it is found that the shape of the gene expression difference (GED) data shows certain distribution regularity. To study the distribution regularity, mixed stable-normal distribution (MSND) model and mixed stable-exponential distribution (MSED) model are constructed to fit the GED data. And the estimated parameters of MSND and MSED are used to describe some characteristics of their distribution. Through the comparison of root mean square error and the chi-squared goodness of fit test, it is found that the fitting effect of MSED and MSND are both better than that of stable distribution and Cauchy distribution. Considering the given quantile thresholds, MSND and MSED can be used to identify tumor-related genes. The results of functional analysis indicate that the selected genes are highly correlated with CRC. In addition, the parameters of MSND and MSED exhibit a certain trend with the development of CRC. To explore the association, Gene-set enrichment analysis (GSEA) is performed. The results of GSEA reveal that the trend can well characterize the intratumoral heterogeneity of CRC. In addition, the application of MSED model on hepatocellular carcinoma shows that our model can analyze other cancers. Overall, MSND model and MSED model can well fit the GED data in different disease stages, the parameters of the two models can characterize the heterogeneity of CRC tumor cells, and the two models can be used to identify genes highly correlated with tumors.
引用
收藏
页码:362 / 370
页数:8
相关论文
共 50 条
  • [1] Mixed Distribution Models Based on Single-Cell RNA Sequencing Data
    Wu, Min
    Xu, Junhua
    Ding, Tao
    Gao, Jie
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (03) : 362 - 370
  • [2] Analysis of single-cell RNA sequencing data based on autoencoders
    Andrea Tangherloni
    Federico Ricciuti
    Daniela Besozzi
    Pietro Liò
    Ana Cvejic
    BMC Bioinformatics, 22
  • [3] Analysis of single-cell RNA sequencing data based on autoencoders
    Tangherloni, Andrea
    Ricciuti, Federico
    Besozzi, Daniela
    Lio, Pietro
    Cvejic, Ana
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [4] Evaluation of single-cell classifiers for single-cell RNA sequencing data sets
    Zhao, Xinlei
    Wu, Shuang
    Fang, Nan
    Sun, Xiao
    Fan, Jue
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1581 - 1595
  • [5] Complex Analysis of Single-Cell RNA Sequencing Data
    Khozyainova, Anna A. A.
    Valyaeva, Anna A. A.
    Arbatsky, Mikhail S. S.
    Isaev, Sergey V. V.
    Iamshchikov, Pavel S. S.
    Volchkov, Egor V. V.
    Sabirov, Marat S. S.
    Zainullina, Viktoria R. R.
    Chechekhin, Vadim I. I.
    Vorobev, Rostislav S. S.
    Menyailo, Maxim E. E.
    Tyurin-Kuzmin, Pyotr A. A.
    Denisov, Evgeny V. V.
    BIOCHEMISTRY-MOSCOW, 2023, 88 (02) : 231 - 252
  • [6] Splatter: simulation of single-cell RNA sequencing data
    Zappia, Luke
    Phipson, Belinda
    Oshlack, Alicia
    GENOME BIOLOGY, 2017, 18
  • [7] Complex Analysis of Single-Cell RNA Sequencing Data
    Anna A. Khozyainova
    Anna A. Valyaeva
    Mikhail S. Arbatsky
    Sergey V. Isaev
    Pavel S. Iamshchikov
    Egor V. Volchkov
    Marat S. Sabirov
    Viktoria R. Zainullina
    Vadim I. Chechekhin
    Rostislav S. Vorobev
    Maxim E. Menyailo
    Pyotr A. Tyurin-Kuzmin
    Evgeny V. Denisov
    Biochemistry (Moscow), 2023, 88 : 231 - 252
  • [8] Splatter: simulation of single-cell RNA sequencing data
    Luke Zappia
    Belinda Phipson
    Alicia Oshlack
    Genome Biology, 18
  • [9] The Poisson distribution model fits UMI-based single-cell RNA-sequencing data
    Yue Pan
    Justin T. Landis
    Razia Moorad
    Di Wu
    J. S. Marron
    Dirk P. Dittmer
    BMC Bioinformatics, 24
  • [10] The Poisson distribution model fits UMI-based single-cell RNA-sequencing data
    Pan, Yue
    Landis, Justin T.
    Moorad, Razia
    Wu, Di
    Marron, J. S.
    Dittmer, Dirk P.
    BMC BIOINFORMATICS, 2023, 24 (01)