Single-cell RNA sequencing data analysis utilizing multi-type graph neural networks

被引:0
作者
Xu, Li [1 ]
Li, Zhenpeng [1 ]
Ren, Jiaxu [1 ]
Liu, Shuaipeng [1 ]
Xu, Yiming [2 ]
机构
[1] College of Computer Science and Technology, Harbin Engineering University, Heilongjiang, Harbin
[2] College of Engineering, Tokyo Institute of Technology, Tokyo, Tokyo
基金
中国国家自然科学基金;
关键词
Cell clustering; Denoising autoencoder; Graph neural network; Single-cell RNA-seq;
D O I
10.1016/j.compbiomed.2024.108921
中图分类号
学科分类号
摘要
Single-cell RNA sequencing (scRNA-seq) is the sequencing technology of a single cell whose expression reflects the overall characteristics of the individual cell, facilitating the research of problems at the cellular level. However, the problems of scRNA-seq such as dimensionality reduction processing of massive data, technical noise in data, and visualization of single-cell type clustering cause great difficulties for analyzing and processing scRNA-seq data. In this paper, we propose a new single-cell data analysis model using denoising autoencoder and multi-type graph neural networks (scDMG), which learns cell–cell topology information and latent representation of scRNA-seq data. scDMG introduces the zero-inflated negative binomial (ZINB) model into a denoising autoencoder (DAE) to perform dimensionality reduction and denoising on the raw data. scDMG integrates multiple-type graph neural networks as the encoder to further train the preprocessed data, which better deals with various types of scRNA-seq datasets, resolves dropout events in scRNA-seq data, and enables preliminary classification of scRNA-seq data. By employing TSNE and PCA algorithms for the trained data and invoking Louvain algorithm, scDMG has better dimensionality reduction and clustering optimization. Compared with other mainstream scRNA-seq clustering algorithms, scDMG outperforms other state-of-the-art methods in various clustering performance metrics and shows better scalability, shorter runtime, and great clustering results. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 62 条
[1]  
Jiang J., Xu J., Liu Y., Song B., Guo X., Zeng X., Zou Q., Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder, Brief. Bioinform., 24, 3, (2023)
[2]  
Simmons S.K., Lithwick-Yanai G., Adiconis X., Oberstrass F., Iremadze N., Geiger-Schuller K., Thakore P.I., Frangieh C.J., Barad O., Almogy G., Et al., Mostly natural sequencing-by-synthesis for scRNA-seq using ultima sequencing, Nat. Biotechnol., 41, 2, pp. 204-211, (2023)
[3]  
Qi R., Ma A., Ma Q., Zou Q., Clustering and classification methods for single-cell RNA-sequencing data, Brief. Bioinform., 21, 4, pp. 1196-1208, (2020)
[4]  
Xu J., Xu J., Meng Y., Lu C., Cai L., Zeng X., Nussinov R., Cheng F., Graph embedding and Gaussian mixture variational autoencoder network for end-to-end analysis of single-cell RNA sequencing data, Cell Rep. Methods, 3, 1, (2023)
[5]  
Chowdhury H.A., Effective clustering of scRNA-seq data to identify biomarkers without user input, Proceedings of the AAAI Conference on Artificial Intelligence, 35, pp. 15710-15711, (2021)
[6]  
Yu Z., Su Y., Lu Y., Yang Y., Wang F., Zhang S., Chang Y., Wong K.C., Li X., Topological identification and interpretation for single-cell gene regulation elucidation across multiple platforms using scMGCA, Nature Commun., 14, 1, (2023)
[7]  
Sha Y., Qiu Y., Zhou P., Nie Q., Reconstructing growth and dynamic trajectories from single-cell transcriptomics data, Nat. Mach. Intell., 6, 1, pp. 25-39, (2024)
[8]  
Zhao M., He W., Tang J., Zou Q., Guo F., A hybrid deep learning framework for gene regulatory network inference from single-cell transcriptomic data, Brief. Bioinform., 23, 2, (2022)
[9]  
Kim H., Chang W., Chae S.J., Park J.-E., Seo M., Kim J.K., scLENS: data-driven signal detection for unbiased scRNA-seq data analysis, Nature Commun., 15, 1, (2024)
[10]  
Zou Z., Hua K., Zhang X., HGC: Fast hierarchical clustering for large-scale single-cell data, Bioinformatics, 37, 21, pp. 3964-3965, (2021)