Consensus clustering of single-cell RNA-seq data by enhancing network affinity

被引:28
|
作者
Cui, Yaxuan [1 ]
Zhang, Shaoqiang [1 ]
Liang, Ying [1 ]
Wang, Xiangyun [1 ]
Ferraro, Thomas N. [2 ]
Chen, Yong [3 ]
机构
[1] Tianjin Normal Univ, Coll Comp & Informat Engn, Tianjin 300387, Peoples R China
[2] CMSRU, Dept Biomed Sci, Camden, NJ USA
[3] Rowan Univ, Dept Mol & Cellular Biosci, Camden, NJ 08028 USA
基金
美国国家科学基金会;
关键词
single-cell RNA-seq; clustering algorithm; bioinformatics; cell typing; GENE-EXPRESSION; HETEROGENEITY; EMBRYOS; STATES; ATLAS; FATE;
D O I
10.1093/bib/bbab236
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Elucidation of cell subpopulations at high resolution is a key and challenging goal of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) data analysis. Although unsupervised clustering methods have been proposed for de novo identification of cell populations, their performance and robustness suffer from the high variability, low capture efficiency and high dropout rates which are characteristic of scRNA-seq experiments. Here, we present a novel unsupervised method for Single-cell Clustering by Enhancing Network Affinity (SCENA), which mainly employed three strategies: selecting multiple gene sets, enhancing local affinity among cells and clustering of consensus matrices. Large-scale validations on 13 real scRNA-seq datasets show that SCENA has high accuracy in detecting cell populations and is robust against dropout noise. When we applied SCENA to large-scale scRNA-seq data of mouse brain cells, known cell types were successfully detected, and novel cell types of interneurons were identified with differential expression of gamma-aminobutyric acid receptor subunits and transporters. SCENA is equipped with CPU+GPU (Central Processing Units+Graphics Processing Units) heterogeneous parallel computing to achieve high running speed. The high performance and running speed of SCENA combine into a new and efficient platform for biological discoveries in clustering analysis of large and diverse scRNA-seq datasets.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] SC3: consensus clustering of single-cell RNA-seq data
    Kiselev, Vladimir Yu
    Kirschner, Kristina
    Schaub, Michael T.
    Andrews, Tallulah
    Yiu, Andrew
    Chandra, Tamir
    Natarajan, Kedar N.
    Reik, Wolf
    Barahona, Mauricio
    Green, Anthony R.
    Hemberg, Martin
    NATURE METHODS, 2017, 14 (05) : 483 - +
  • [2] SC3: Consensus clustering of single-cell RNA-seq data
    Kiselev V.Y.
    Kirschner K.
    Schaub M.T.
    Andrews T.
    Yiu A.
    Chandra T.
    Natarajan K.N.
    Reik W.
    Barahona M.
    Green A.R.
    Hemberg M.
    Nature Methods, 2017, 14 (5) : 483 - 486
  • [3] Analysis of Single-Cell RNA-seq Data by Clustering Approaches
    Zhu, Xiaoshu
    Li, Hong-Dong
    Guo, Lilu
    Wu, Fang-Xiang
    Wang, Jianxin
    CURRENT BIOINFORMATICS, 2019, 14 (04) : 314 - 322
  • [4] Challenges in unsupervised clustering of single-cell RNA-seq data
    Kiselev, Vladimir Yu
    Andrews, Tallulah S.
    Hemberg, Martin
    NATURE REVIEWS GENETICS, 2019, 20 (05) : 273 - 282
  • [5] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [6] HNC: a hybrid neighbourhood-consensus clustering algorithm for single-cell RNA-seq data
    Das, Priyojit
    Saha, Sujay
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2021, 25 (3-4) : 161 - 180
  • [7] Enhancing Clustering of single-cell RNA-seq data by Proximity Learning on Random Projected spaces
    Vrahatis, Aristidis G.
    Dimitrakopoulos, Georgios N.
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    2019 IEEE 19TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOENGINEERING (BIBE), 2019, : 846 - 849
  • [8] An active learning approach for clustering single-cell RNA-seq data
    Lin, Xiang
    Liu, Haoran
    Wei, Zhi
    Roy, Senjuti Basu
    Gao, Nan
    LABORATORY INVESTIGATION, 2022, 102 (03) : 227 - 235
  • [9] scDFN: enhancing single-cell RNA-seq clustering with deep fusion networks
    Liu, Tianxiang
    Jia, Cangzhi
    Bi, Yue
    Guo, Xudong
    Zou, Quan
    Li, Fuyi
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [10] Single-cell RNA-seq data clustering by deep information fusion
    Ren, Liangrui
    Wang, Jun
    Li, Wei
    Guo, Maozu
    Yu, Guoxian
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2024, 23 (02) : 128 - 137