scASGC: An adaptive simplified graph convolution model for clustering single-cell RNA-seq data

被引:4
作者
Wang, Shudong [1 ]
Zhang, Yu [1 ]
Zhang, Yulin [2 ]
Wu, Wenhao [1 ]
Ye, Lan [3 ]
Li, Yunyin [1 ]
Su, Jionglong [4 ]
Pang, Shanchen [1 ]
机构
[1] China Univ Petr, Qingdao Inst Software, Coll Comp Sci & Technol, Qingdao 266580, Peoples R China
[2] Shandong Univ Sci & Technol, Coll Math & Syst Sci, Qingdao 266590, Peoples R China
[3] Shandong Univ, Hosp 2, Ctr Lab, Jinan 250033, Peoples R China
[4] Xian Jiaotong Liverpool Univ, XJTLU Entrepreneur Coll Taicang, Sch AI & Adv Comp, Suzhou 215123, Peoples R China
关键词
ScRNA-seq; Clustering; Bioinformatics; Graph convolution; Computational biology; Machine learning; GENE-EXPRESSION; HETEROGENEITY; FATE;
D O I
10.1016/j.compbiomed.2023.107152
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-cell RNA sequencing (scRNA-seq) is now a successful technique for identifying cellular heterogeneity, revealing novel cell subpopulations, and forecasting developmental trajectories. A crucial component of the processing of scRNA-seq data is the precise identification of cell subpopulations. Although many unsupervised clustering methods have been developed to cluster cell subpopulations, the performance of these methods is vulnerable to dropouts and high dimensionality. In addition, most existing methods are time-consuming and fail to adequately account for potential associations between cells. In the manuscript, we present an unsupervised clustering method based on an adaptive simplified graph convolution model called scASGC. The proposed method builds plausible cell graphs, aggregates neighbor information using a simplified graph convolution model, and adaptively determines the most optimal number of convolution layers for various graphs. Experiments on 12 public datasets show that scASGC outperforms both classical and state-of-the-art clustering methods. In addition, in a study of mouse intestinal muscle containing 15,983 cells, we identified distinct marker genes based on the clustering results of scASGC. The source code of scASGC is available at https://github.com/ZzzOctopus/scASGC.
引用
收藏
页数:8
相关论文
共 41 条
[11]   Single-Cell RNA-Seq Reveals Dynamic, Random Monoallelic Gene Expression in Mammalian Cells [J].
Deng, Qiaolin ;
Ramskold, Daniel ;
Reinius, Bjorn ;
Sandberg, Rickard .
SCIENCE, 2014, 343 (6167) :193-196
[12]   Single-cell RNA-seq denoising using a deep count autoencoder [J].
Eraslan, Goekcen ;
Simon, Lukas M. ;
Mircea, Maria ;
Mueller, Nikola S. ;
Theis, Fabian J. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[13]   Single-cell reconstruction of developmental trajectories during zebrafish embryogenesis [J].
Farrell, Jeffrey A. ;
Wang, Yiqun ;
Riesenfeld, Samantha J. ;
Shekhar, Karthik ;
Regev, Aviv ;
Schier, Alexander F. .
SCIENCE, 2018, 360 (6392) :979-+
[14]   Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in 4-Cell Mouse Embryos [J].
Goolam, Mubeen ;
Scialdone, Antonio ;
Graham, Sarah J. L. ;
Macaulay, Iain C. ;
Jedrusik, Agnieszka ;
Hupalowska, Anna ;
Voet, Thierry ;
Marioni, John C. ;
Zernicka-Goetz, Magdalena .
CELL, 2016, 165 (01) :61-74
[15]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830
[16]   Single-cell RNA sequencing: Technical advancements and biological applications [J].
Hedlund, Eva ;
Deng, Qiaolin .
MOLECULAR ASPECTS OF MEDICINE, 2018, 59 :36-46
[17]   COMPARING PARTITIONS [J].
HUBERT, L ;
ARABIE, P .
JOURNAL OF CLASSIFICATION, 1985, 2 (2-3) :193-218
[18]   Independent component analysis:: algorithms and applications [J].
Hyvärinen, A ;
Oja, E .
NEURAL NETWORKS, 2000, 13 (4-5) :411-430
[19]   Dissecting Immune Circuits by Linking CRISPR-Pooled Screens with Single-Cell RNA-Seq [J].
Jaitin, Diego Adhemar ;
Weiner, Assaf ;
Yofe, Ido ;
Lara-Astiaso, David ;
Keren-Shaul, Hadas ;
David, Eyal ;
Salame, Tomer Meir ;
Tanay, Amos ;
van Oudenaarden, Alexander ;
Amit, Ido .
CELL, 2016, 167 (07) :1883-+
[20]  
Kipf T N, 2017, INT C LEARN REPR, ppp1, DOI DOI 10.48550/ARXIV.1609.02907