Single-cell RNA-seq data clustering by deep information fusion

被引:3
作者
Ren, Liangrui [2 ]
Wang, Jun [3 ]
Li, Wei [4 ]
Guo, Maozu [5 ]
Yu, Guoxian [1 ,2 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Shandong Univ, Sch Software, Jinan, Peoples R China
[3] Shandong Univ, Joint SDU NTU Ctr Artificial Intelligence Res C FA, Jinan, Peoples R China
[4] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China
[5] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell RNA-seq clustering; graph convolution network; deep auto-encoder; ZINB; transcriptomics; VISUALIZATION; COMPLEX;
D O I
10.1093/bfgp/elad017
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Determining cell types by single-cell transcriptomics data is fundamental for downstream analysis. However, cell clustering and data imputation still face the computation challenges, due to the high dropout rate, sparsity and dimensionality of single-cell data. Although some deep learning based solutions have been proposed to handle these challenges, they still can not leverage gene attribute information and cell topology in a sensible way to explore the consistent clustering. In this paper, we present scDeepFC, a deep information fusion-based single-cell data clustering method for cell clustering and data imputation. Specifically, scDeepFC uses a deep auto-encoder (DAE) network and a deep graph convolution network to embed high-dimensional gene attribute information and high-order cell-cell topological information into different low-dimensional representations, and then fuses them to generate a more comprehensive and accurate consensus representation via a deep information fusion network. In addition, scDeepFC integrates the zero-inflated negative binomial (ZINB) into DAE to model the dropout events. By jointly optimizing the ZINB loss and cell graph reconstruction loss, scDeepFC generates a salient embedding representation for clustering cells and imputing missing data. Extensive experiments on real single-cell datasets prove that scDeepFC outperforms other popular single-cell analysis methods. Both the gene attribute and cell topology information can improve the cell clustering.
引用
收藏
页码:128 / 137
页数:10
相关论文
共 50 条
  • [21] An End-to-End Deep Hybrid Autoencoder Based Method for Single-Cell RNA-Seq Data Analysis
    Ji, Cunmei
    Yu, Ning
    Wang, Yutian
    Qi, Rong
    Zheng, Chunhou
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (06) : 3889 - 3900
  • [22] Fast and accurate single-cell RNA-seq analysis by clustering of transcript-compatibility counts
    Ntranos, Vasilis
    Kamath, Govinda M.
    Zhang, Jesse M.
    Pachter, Lior
    Tse, David N.
    GENOME BIOLOGY, 2016, 17
  • [23] Deterministic column subset selection for single-cell RNA-Seq
    McCurdy, Shannon R.
    Ntranos, Vasilis
    Pachter, Lior
    PLOS ONE, 2019, 14 (01):
  • [24] A general and flexible method for signal extraction from single-cell RNA-seq data
    Risso, Davide
    Perraudeau, Fanny
    Gribkova, Svetlana
    Dudoit, Sandrine
    Vert, Jean-Philippe
    NATURE COMMUNICATIONS, 2018, 9
  • [25] ScDA: A Denoising AutoEncoder Based Dimensionality Reduction for Single-cell RNA-seq Data
    Zhu, Xiaoshu
    Lin, Yongchang
    Li, Jian
    Wang, Jianxin
    Peng, Xiaoqing
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 534 - 545
  • [26] scBubbletree: computational approach for visualization of single cell RNA-seq data
    Kitanovski, Simo
    Cao, Yingying
    Ttoouli, Dimitris
    Farahpour, Farnoush
    Wang, Jun
    Hoffmann, Daniel
    BMC BIOINFORMATICS, 2024, 25 (01):
  • [27] Accurately Clustering Single-cell RNA-seq data by Capturing Structural Relations between Cells through Graph Convolutional Network
    Zeng, Yuausong
    Zhou, Xiang
    Rao, Jiahua
    Lu, Yutong
    Yang, Yuedong
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 519 - 522
  • [28] scEFSC: Accurate single-cell RNA-seq data analysis via ensemble consensus clustering based on multiple feature selections
    Bian, Chuang
    Wang, Xubin
    Su, Yanchi
    Wang, Yunhe
    Wong, Ka-chun
    Li, Xiangtao
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 2181 - 2197
  • [29] scAPAmod: Profiling Alternative Polyadenylation Modalities in Single Cells from Single-Cell RNA-Seq Data
    Qian, Lingwu
    Fu, Hongjuan
    Mou, Yunwen
    Lin, Weixu
    Ye, Lishan
    Ji, Guoli
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2022, 23 (15)
  • [30] Understanding the Adult Mammalian Heart at Single-Cell RNA-Seq Resolution
    Marin-Sedeno, Ernesto
    Martinez de Morentin, Xabier
    Perez-Pomares, Jose M.
    Gomez-Cabrero, David
    Ruiz-Villalba, Adrian
    FRONTIERS IN CELL AND DEVELOPMENTAL BIOLOGY, 2021, 9