Single-cell RNA-seq data clustering by deep information fusion

被引:3
|
作者
Ren, Liangrui [2 ]
Wang, Jun [3 ]
Li, Wei [4 ]
Guo, Maozu [5 ]
Yu, Guoxian [1 ,2 ]
机构
[1] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
[2] Shandong Univ, Sch Software, Jinan, Peoples R China
[3] Shandong Univ, Joint SDU NTU Ctr Artificial Intelligence Res C FA, Jinan, Peoples R China
[4] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China
[5] Beijing Univ Civil Engn & Architecture, Sch Elect & Informat Engn, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
single-cell RNA-seq clustering; graph convolution network; deep auto-encoder; ZINB; transcriptomics; VISUALIZATION; COMPLEX;
D O I
10.1093/bfgp/elad017
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Determining cell types by single-cell transcriptomics data is fundamental for downstream analysis. However, cell clustering and data imputation still face the computation challenges, due to the high dropout rate, sparsity and dimensionality of single-cell data. Although some deep learning based solutions have been proposed to handle these challenges, they still can not leverage gene attribute information and cell topology in a sensible way to explore the consistent clustering. In this paper, we present scDeepFC, a deep information fusion-based single-cell data clustering method for cell clustering and data imputation. Specifically, scDeepFC uses a deep auto-encoder (DAE) network and a deep graph convolution network to embed high-dimensional gene attribute information and high-order cell-cell topological information into different low-dimensional representations, and then fuses them to generate a more comprehensive and accurate consensus representation via a deep information fusion network. In addition, scDeepFC integrates the zero-inflated negative binomial (ZINB) into DAE to model the dropout events. By jointly optimizing the ZINB loss and cell graph reconstruction loss, scDeepFC generates a salient embedding representation for clustering cells and imputing missing data. Extensive experiments on real single-cell datasets prove that scDeepFC outperforms other popular single-cell analysis methods. Both the gene attribute and cell topology information can improve the cell clustering.
引用
收藏
页码:128 / 137
页数:10
相关论文
共 50 条
  • [1] A Global Similarity Learning for Clustering of Single-Cell RNA-Seq Data
    Zhu, Xiaoshu
    Guo, Lilu
    Xu, Yunpei
    Li, Hong-Dong
    Liao, Xingyu
    Wu, Fang-Xiang
    Peng, Xiaoqing
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 261 - 266
  • [2] Impact of similarity metrics on single-cell RNA-seq data clustering
    Kim, Taiyun
    Chen, Irene Rui
    Lin, Yingxin
    Wang, Andy Yi-Yang
    Yang, Jean Yee Hwa
    Yang, Pengyi
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (06) : 2316 - 2326
  • [3] A hybrid deep clustering approach for robust cell type profiling using single-cell RNA-seq data
    Srinivasan, Suhas
    Leshchyk, Anastasia
    Johnson, Nathan T.
    Korkin, Dmitry
    RNA, 2020, 26 (10) : 1303 - 1319
  • [4] Comparison of Gene Selection Methods for Clustering Single-cell RNA-seq Data
    Zhu, Xiaoshu
    Wang, Jianxin
    Li, Rongruan
    Peng, Xiaoqing
    CURRENT BIOINFORMATICS, 2023, 18 (01) : 1 - 11
  • [5] Improving Single-Cell RNA-seq Clustering by Integrating Pathways
    Zhang, Chenxing
    Gao, Lin
    Wang, Bingbo
    Gao, Yong
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (06)
  • [6] Emerging deep learning methods for single-cell RNA-seq data analysis
    Zheng, Jie
    Wang, Ke
    QUANTITATIVE BIOLOGY, 2019, 7 (04) : 247 - 254
  • [7] An interpretable framework for clustering single-cell RNA-Seq datasets
    Zhang, Jesse M.
    Fan, Jue
    Fan, Christina
    Rosenfeld, David
    Tse, David N.
    BMC BIOINFORMATICS, 2018, 19
  • [8] Review of single-cell RNA-seq data clustering for cell-type identification and characterization
    Zhang, Shixiong
    Li, Xiangtao
    Lin, Jiecong
    Lin, Qiuzhen
    Wong, Ka-Chun
    RNA, 2023, 29 (05) : 517 - 530
  • [9] ScGSLC: An unsupervised graph similarity learning framework for single-cell RNA-seq data clustering
    Li, Junyi
    Jiang, Wei
    Han, Henry
    Liu, Jing
    Liu, Bo
    Wang, Yadong
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2021, 90
  • [10] Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder
    Jiang, Jing
    Xu, Junlin
    Liu, Yuansheng
    Song, Bosheng
    Guo, Xiulan
    Zeng, Xiangxiang
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)