Consensus clustering of single-cell RNA-seq data by enhancing network affinity

被引:29
作者
Cui, Yaxuan [1 ]
Zhang, Shaoqiang [1 ]
Liang, Ying [1 ]
Wang, Xiangyun [1 ]
Ferraro, Thomas N. [2 ]
Chen, Yong [3 ]
机构
[1] Tianjin Normal Univ, Coll Comp & Informat Engn, Tianjin 300387, Peoples R China
[2] CMSRU, Dept Biomed Sci, Camden, NJ USA
[3] Rowan Univ, Dept Mol & Cellular Biosci, Camden, NJ 08028 USA
基金
美国国家科学基金会;
关键词
single-cell RNA-seq; clustering algorithm; bioinformatics; cell typing; GENE-EXPRESSION; HETEROGENEITY; EMBRYOS; STATES; ATLAS; FATE;
D O I
10.1093/bib/bbab236
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Elucidation of cell subpopulations at high resolution is a key and challenging goal of single-cell ribonucleic acid (RNA) sequencing (scRNA-seq) data analysis. Although unsupervised clustering methods have been proposed for de novo identification of cell populations, their performance and robustness suffer from the high variability, low capture efficiency and high dropout rates which are characteristic of scRNA-seq experiments. Here, we present a novel unsupervised method for Single-cell Clustering by Enhancing Network Affinity (SCENA), which mainly employed three strategies: selecting multiple gene sets, enhancing local affinity among cells and clustering of consensus matrices. Large-scale validations on 13 real scRNA-seq datasets show that SCENA has high accuracy in detecting cell populations and is robust against dropout noise. When we applied SCENA to large-scale scRNA-seq data of mouse brain cells, known cell types were successfully detected, and novel cell types of interneurons were identified with differential expression of gamma-aminobutyric acid receptor subunits and transporters. SCENA is equipped with CPU+GPU (Central Processing Units+Graphics Processing Units) heterogeneous parallel computing to achieve high running speed. The high performance and running speed of SCENA combine into a new and efficient platform for biological discoveries in clustering analysis of large and diverse scRNA-seq datasets.
引用
收藏
页数:14
相关论文
共 74 条
  • [11] Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells
    Deng, Qiaolin
    Ramskold, Daniel
    Reinius, Bjorn
    Sandberg, Rickard
    [J]. SCIENCE, 2014, 343 (6167) : 193 - 196
  • [12] Resolution limit in community detection
    Fortunato, Santo
    Barthelemy, Marc
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (01) : 36 - 41
  • [13] Clustering by passing messages between data points
    Frey, Brendan J.
    Dueck, Delbert
    [J]. SCIENCE, 2007, 315 (5814) : 972 - 976
  • [14] Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in 4-Cell Mouse Embryos
    Goolam, Mubeen
    Scialdone, Antonio
    Graham, Sarah J. L.
    Macaulay, Iain C.
    Jedrusik, Agnieszka
    Hupalowska, Anna
    Voet, Thierry
    Marioni, John C.
    Zernicka-Goetz, Magdalena
    [J]. CELL, 2016, 165 (01) : 61 - 74
  • [15] Single-cell messenger RNA sequencing reveals rare intestinal cell types
    Grun, Dominic
    Lyubimova, Anna
    Kester, Lennart
    Wiebrands, Kay
    Basak, Onur
    Sasaki, Nobuo
    Clevers, Hans
    van Oudenaarden, Alexander
    [J]. NATURE, 2015, 525 (7568) : 251 - +
  • [16] SINCERA: A Pipeline for Single-Cell RNA-Seq Profiling Analysis
    Guo, Minzhe
    Wang, Hui
    Potter, S. Steven
    Whitsett, Jeffrey A.
    Xu, Yan
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (11)
  • [17] Mapping the Mouse Cell Atlas by Microwell-Seq
    Han, Xiaoping
    Wang, Renying
    Zhou, Yincong
    Fei, Lijiang
    Sun, Huiyu
    Lai, Shujing
    Saadatpour, Assieh
    Zhou, Zimin
    Chen, Haide
    Ye, Fang
    Huang, Daosheng
    Xu, Yang
    Huang, Wentao
    Jiang, Mengmeng
    Jiang, Xinyi
    Mao, Jie
    Chen, Yao
    Lu, Chenyu
    Xie, Jin
    Fang, Qun
    Wang, Yibin
    Yue, Rui
    Li, Tiefeng
    Huang, He
    Orkin, Stuart H.
    Yuan, Guo-Cheng
    Chen, Ming
    Guo, Guoji
    [J]. CELL, 2018, 172 (05) : 1091 - +
  • [18] Advanced Applications of RNA Sequencing and Challenges
    Han, Yixing
    Gao, Shouguo
    Muegge, Kathrin
    Zhang, Wei
    Zhou, Bing
    [J]. BIOINFORMATICS AND BIOLOGY INSIGHTS, 2015, 9 : 29 - 46
  • [19] A practical guide to single-cell RNA-sequencing for biomedical research and clinical applications
    Haque, Ashraful
    Engel, Jessica
    Teichmann, Sarah A.
    Lonnberg, Tapio
    [J]. GENOME MEDICINE, 2017, 9
  • [20] A benchmark of batch-effect correction methods for single-cell RNA sequencing data
    Hoa Thi Nhu Tran
    Ang, Kok Siong
    Chevrier, Marion
    Zhang, Xiaomeng
    Lee, Nicole Yee Shin
    Goh, Michelle
    Chen, Jinmiao
    [J]. GENOME BIOLOGY, 2020, 21 (01)