scSFCL:Deep clustering of scRNA-seq data with subspace feature confidence learning

被引:0
作者
Meng, Xiaokun [1 ]
Zhang, Yuanyuan [1 ]
Xu, Xiaoyu [1 ]
Zhang, Kaihao [1 ]
Feng, Baoming [1 ]
机构
[1] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao 266520, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep Clustering; ScRNA-seq; GCN; Subspace Clustering; Feature Confidence Learning;
D O I
10.1016/j.compbiolchem.2024.108292
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rapid development of single-cell RNA sequencing(scRNA-seq) technology has spawned a variety of singlecell clustering methods. These methods combine statistics and bioinformatics to reveal differences in gene expression between cells and the diversity of cell types. Deep exploration of single-cell data is more challenging due to the high dimensionality, sparsity and noise of scRNA-seq data. Discriminative attribute information is often difficult to be fully utilised, while traditional clustering methods may not accurately capture the diversity of cell types. Therefore, a deep clustering method is proposed for scRNA-seq data based on subspace feature confidence learning called scSFCL. By dividing the subspace based on kernel density, discriminative feature subsets are filtered. The feature confidence of the subset is learned by combining the graph convolutional network (GCN) with weighting. Also, scSFCL facilitates the complementary fusion of generic structural and idiosyncratic information through a mutually supervised clustering that integrates GCN and a denoising variational autoencoder based on zero-inflated negative binomials (DVAE-ZINB). By validation on multiple scRNA-seq datasets, it is shown that the clustering performance of scSFCL is significantly improved compared with traditional methods, providing an effective solution for deep clustering of scRNA-seq data.
引用
收藏
页数:14
相关论文
共 51 条
[1]  
[Anonymous], 2007, Comparing clusterings: an overview
[2]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[3]   Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection [J].
Campello, Ricardo J. G. B. ;
Moulavi, Davoud ;
Zimek, Arthur ;
Sander, Joerg .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2015, 10 (01)
[4]   Deep soft K-means clustering with self-training for single-cell RNA sequence data [J].
Chen, Liang ;
Wang, Weinan ;
Zhai, Yuyao ;
Deng, Minghua .
NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (02)
[5]   Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer [J].
Chung, Woosung ;
Eum, Hye Hyeon ;
Lee, Hae-Ock ;
Lee, Kyung-Min ;
Lee, Han-Byoel ;
Kim, Kyu-Tae ;
Ryu, Han Suk ;
Kim, Sangmin ;
Lee, Jeong Eon ;
Park, Yeon Hee ;
Kan, Zhengyan ;
Han, Wonshik ;
Park, Woong-Yang .
NATURE COMMUNICATIONS, 2017, 8
[6]   Single-cell RNA-seq denoising using a deep count autoencoder [J].
Eraslan, Goekcen ;
Simon, Lukas M. ;
Mircea, Maria ;
Mueller, Nikola S. ;
Theis, Fabian J. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[7]   scMAE: a masked autoencoder for single-cell RNA-seq clustering [J].
Fang, Zhaoyu ;
Zheng, Ruiqing ;
Li, Min .
BIOINFORMATICS, 2024, 40 (01)
[8]   Heterogeneity in Oct4 and Sox2 Targets Biases Cell Fate in 4-Cell Mouse Embryos [J].
Goolam, Mubeen ;
Scialdone, Antonio ;
Graham, Sarah J. L. ;
Macaulay, Iain C. ;
Jedrusik, Agnieszka ;
Hupalowska, Anna ;
Voet, Thierry ;
Marioni, John C. ;
Zernicka-Goetz, Magdalena .
CELL, 2016, 165 (01) :61-74
[9]   dbscan: Fast Density-Based Clustering with R [J].
Hahsler, Michael ;
Piekenbrock, Matthew ;
Doran, Derek .
JOURNAL OF STATISTICAL SOFTWARE, 2019, 91 (01) :1-30
[10]  
Hartigan J. A., 1979, Applied Statistics, V28, P100, DOI 10.2307/2346830