Consensus clustering of single-cell RNA-seq data by enhancing network affinity

被引:29
作者
Cui, Yaxuan [1 ]
Zhang, Shaoqiang [1 ]
Liang, Ying [1 ]
Wang, Xiangyun [1 ]
Ferraro, Thomas N. [2 ]
Chen, Yong [3 ]
机构
[1] Tianjin Normal Univ, Coll Comp & Informat Engn, Tianjin 300387, Peoples R China
[2] CMSRU, Dept Biomed Sci, Camden, NJ USA
[3] Rowan Univ, Dept Mol & Cellular Biosci, Camden, NJ 08028 USA
基金
美国国家科学基金会;
关键词
single-cell RNA-seq; clustering algorithm; bioinformatics; cell typing; GENE-EXPRESSION; HETEROGENEITY; EMBRYOS; STATES; ATLAS; FATE;
D O I
10.1093/bib/bbab236
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Elucidation of cell subpopulations at high resolution is a key and challenging goal of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) data analysis. Although unsupervised clustering methods have been proposed for de novo identification of cell populations, their performance and robustness suffer from the high variability, low capture efficiency and high dropout rates which are characteristic of scRNA-seq experiments. Here, we present a novel unsupervised method for Single-cell Clustering by Enhancing Network Affinity (SCENA), which mainly employed three strategies: selecting multiple gene sets, enhancing local affinity among cells and clustering of consensus matrices. Large-scale validations on 13 real scRNA-seq datasets show that SCENA has high accuracy in detecting cell populations and is robust against dropout noise. When we applied SCENA to large-scale scRNA-seq data of mouse brain cells, known cell types were successfully detected, and novel cell types of interneurons were identified with differential expression of gamma-aminobutyric acid receptor subunits and transporters. SCENA is equipped with CPU+GPU (Central Processing Units+Graphics Processing Units) heterogeneous parallel computing to achieve high running speed. The high performance and running speed of SCENA combine into a new and efficient platform for biological discoveries in clustering analysis of large and diverse scRNA-seq datasets.
引用
收藏
页数:14
相关论文
共 74 条
  • [1] A comparison of automatic cell identification methods for single-cell RNA sequencing data
    Abdelaal, Tamim
    Michielsen, Lieke
    Cats, Davy
    Hoogduin, Dylan
    Mei, Hailiang
    Reinders, Marcel J. T.
    Mahfouz, Ahmed
    [J]. GENOME BIOLOGY, 2019, 20 (01)
  • [2] M3Drop: dropout-based feature selection for scRNASeq
    Andrews, Tallulah S.
    Hemberg, Martin
    [J]. BIOINFORMATICS, 2019, 35 (16) : 2865 - 2867
  • [3] Cell type-specific transcriptional programs in mouse prefrontal cortex during adolescence and addiction
    Bhattacherjee, Aritra
    Djekidel, Mohamed Nadhir
    Chen, Renchao
    Chen, Wenqiang
    Tuesta, Luis M.
    Zhang, Yi
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [4] Cell fate inclination within 2-cell and 4-cell mouse embryos revealed by single-cell RNA sequencing
    Blase, Fernando H.
    Cao, Xiaoyi
    Zhong, Sheng
    [J]. GENOME RESEARCH, 2014, 24 (11) : 1787 - 1796
  • [5] APCluster: an R package for affinity propagation clustering
    Bodenhofer, Ulrich
    Kothmeier, Andreas
    Hochreiter, Sepp
    [J]. BIOINFORMATICS, 2011, 27 (17) : 2463 - 2464
  • [6] Brennecke P, 2013, NAT METHODS, V10, P1093, DOI [10.1038/NMETH.2645, 10.1038/nmeth.2645]
  • [7] Integrating single-cell transcriptomic data across different conditions, technologies, and species
    Butler, Andrew
    Hoffman, Paul
    Smibert, Peter
    Papalexi, Efthymia
    Satija, Rahul
    [J]. NATURE BIOTECHNOLOGY, 2018, 36 (05) : 411 - +
  • [8] Single-Cell RNA-Seq Technologies and Related Computational Data Analysis
    Chen, Geng
    Ning, Baitang
    Shi, Tieliu
    [J]. FRONTIERS IN GENETICS, 2019, 10
  • [9] The cis-regulatory dynamics of embryonic development at single-cell resolution
    Cusanovich, Darren A.
    Reddington, James P.
    Garfield, David A.
    Daza, Riza M.
    Aghamirzaie, Delasa
    Marco-Ferreres, Raquel
    Pliner, Hannah A.
    Christiansen, Lena
    Qiu, Xiaojie
    Steemers, Frank J.
    Trapnell, Cole
    Shendure, Jay
    Furlong, Eileen E. M.
    [J]. NATURE, 2018, 555 (7697) : 538 - +
  • [10] A Single-Cell Transcriptome Atlas of the Aging Drosophila Brain
    Davie, Kristofer
    Janssens, Jasper
    Koldere, Duygu
    De Waegeneer, Maxime
    Pech, Uli
    Kreft, Lukasz
    Aibar, Sara
    Makhzami, Samira
    Christiaens, Valerie
    Gonzalez-Blas, Carmen Bravo
    Poovathingal, Suresh
    Hulselmans, Gert
    Spanier, Katina I.
    Moerman, Thomas
    Vanspauwen, Bram
    Geurs, Sarah
    Voet, Thierry
    Lammertyn, Jeroen
    Thienpont, Bernard
    Liu, Sha
    Konstantinides, Nikos
    Fiers, Mark
    Verstreken, Patrik
    Aerts, Stein
    [J]. CELL, 2018, 174 (04) : 982 - +