A deep matrix factorization based approach for single-cell RNA-seq data clustering

被引:2
作者
Liang, Zhenlan [1 ]
Zheng, Ruiqing [1 ]
Chen, Siqi [1 ]
Yan, Xuhua [1 ]
Li, Min [1 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha 410083, Peoples R China
关键词
Single-cellRNA-seq; Clustering; Autoencoder; Matrixfactorization; GENE-EXPRESSION; HETEROGENEITY; PROFILES;
D O I
10.1016/j.ymeth.2022.06.010
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The rapid development of single-cell sequencing technologies makes it possible to analyze cellular heterogeneity at the single-cell level. Cell clustering is one of the most fundamental and common steps in the heterogeneity analysis. However, due to the high noise level, high dimensionality and high sparsity, accurate cell clustering is still challengeable. Here, we present DeepCI, a new clustering approach for scRNA-seq data. Using two autoencoders to obtain cell embedding and gene embedding, DeepCI can simultaneously learn cell low -dimensional representation and clustering. In addition, the recovered gene expression matrix can be obtained by the matrix multiplication of cell and gene embedding. To evaluate the performance of DeepCI, we performed it on several real scRNA-seq datasets for clustering and visualization analysis. The experimental results show that DeepCI obtains the overall better performance than several popular single cell analysis methods. We also eval-uated the imputation performance of DeepCI by a dedicated experiment. The corresponding results show that the imputed gene expression of known specific marker genes can greatly improve the accuracy of cell type classification.
引用
收藏
页码:114 / 122
页数:9
相关论文
共 55 条
  • [1] Angerer Philipp, 2017, Current Opinion in Systems Biology, V4, P85, DOI 10.1016/j.coisb.2017.07.004
  • [2] xCell: digitally portraying the tissue cellular heterogeneity landscape
    Aran, Dvir
    Hu, Zicheng
    Butte, Atul J.
    [J]. GENOME BIOLOGY, 2017, 18
  • [3] Bandyopadhyay G, 2019, FASEB J, V33
  • [4] Computational analysis of cell-to-cell heterogeneity in single-cell RNA-sequencing data reveals hidden subpopulations of cells
    Buettner, Florian
    Natarajan, Kedar N.
    Casale, F. Paolo
    Proserpio, Valentina
    Scialdone, Antonio
    Theis, Fabian J.
    Teichmann, Sarah A.
    Marioni, John C.
    Stegie, Oliver
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (02) : 155 - 160
  • [5] Integrating single-cell transcriptomic data across different conditions, technologies, and species
    Butler, Andrew
    Hoffman, Paul
    Smibert, Peter
    Papalexi, Efthymia
    Satija, Rahul
    [J]. NATURE BIOTECHNOLOGY, 2018, 36 (05) : 411 - +
  • [6] Comprehensive single-cell transcriptional profiling of a multicellular organism
    Cao, Junyue
    Packer, Jonathan S.
    Ramani, Vijay
    Cusanovich, Darren A.
    Huynh, Chau
    Daza, Riza
    Qiu, Xiaojie
    Lee, Choli
    Furlan, Scott N.
    Steemers, Frank J.
    Adey, Andrew
    Waterston, Robert H.
    Trapnell, Cole
    Shendure, Jay
    [J]. SCIENCE, 2017, 357 (6352) : 661 - 667
  • [7] scRMD: imputation for single cell RNA-seq data via robust matrix decomposition
    Chen, Chong
    Wu, Changjing
    Wu, Linjie
    Wang, Xiaochen
    Deng, Minghua
    Xi, Ruibin
    [J]. BIOINFORMATICS, 2020, 36 (10) : 3156 - 3161
  • [8] Choi YH, 2019, MOL CELLS, V42, P189
  • [9] Single-cell RNA-sequencing analyses identify heterogeneity of CD8+ T cell subpopulations and novel therapy targets in melanoma
    Deng, Weiwei
    Ma, Yubo
    Su, Zhen
    Liu, Yufang
    Liang, Panpan
    Huang, Chen
    Liu, Xiao
    Shao, Jin
    Zhang, Yi
    Zhang, Kai
    Chen, Jian
    Li, Ruoyu
    [J]. MOLECULAR THERAPY-ONCOLYTICS, 2021, 20 : 105 - 118
  • [10] Stochastic gene expression in a single cell
    Elowitz, MB
    Levine, AJ
    Siggia, ED
    Swain, PS
    [J]. SCIENCE, 2002, 297 (5584) : 1183 - 1186