scE2EGAE: enhancing single-cell RNA-Seq data analysis through an end-to-end cell-graph-learnable graph autoencoder with differentiable edge sampling

被引:0
作者
Wang, Shuo [1 ,2 ]
Liu, Yuanning [1 ,2 ]
Zhang, Hao [1 ,2 ]
Liu, Zhen [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[3] Nagasaki Inst Appl Sci, Grad Sch Engn, 536 Aba machi, Nagasaki, Japan
基金
中国国家自然科学基金;
关键词
Single-cell RNA-Seq; Bioinformatics; End-to-end; Graph neural networks; Deep learning; Autoencoder; MICROGLIA;
D O I
10.1186/s13062-025-00616-z
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background:Single-cell RNA sequencing (scRNA-Seq) technology reveals biological processes and molecular-level genomic information among individual cells. Numerous computational methods, including methods based on graph neural networks (GNNs), have been developed to enhance scRNA-Seq data analysis. However, existing GNNs-based methods usually construct fixed graphs by applying the k-nearest neighbors algorithm, which may result in information loss.Methods:To address this problem, we propose scE2EGAE, which learns cell graphs during the training processes. Firstly, the scRNA-Seq data is fed into a deep count autoencoder (DCA). Secondly, the hidden representations of DCA are extracted and then used to generate cell-to-cell graph edges through a straight-through estimator (STE) based on top-k sampling and Gumbel-Softmax. Finally, the generated cell-to-cell graph and scRNA-Seq data are fed into the GNNs-based downstream tasks. In this paper, we design a graph autoencoder which performs denoising on scRNA-Seq data as the downstream task.Results:We evaluate scE2EGAE on eight public scRNA-Seq datasets and compare its performance with seven existing scRNA-Seq data denoising methods. In this paper, extensive experiments are conducted, encompassing: 1) the evaluation of denoising performance, with metrics including mean absolute error, Pearson correlation coefficient, and cosine similarity; 2) the assessment of clustering performance of the denoised results, utilizing adjusted rand index, normalized mutual information and silhouette score; and 3) the evaluation of the cell trajectory inference performance of the denoised results, measured by the pseudo-temporal ordering score. The results show that, on the scRNA-Seq data denoising task, scE2EGAE outperforms most of the methods, proving that it can learn cell-to-cell graphs containing real information of cell-to-cell relationships.Conclusions:In this paper, we validate the proposed scE2EGAE method through its application to the denoising task of scRNA-Seq data. This method demonstrates its capability to learn inter-cellular relationships and construct cell-to-cell graphs, thereby enhancing the downstream analysis of scRNA-Seq data. Our approach can serve as an inspiration for future research on scRNA-Seq analysis methods based on GNNs, holding broad application prospects.
引用
收藏
页数:25
相关论文
共 68 条
[61]  
Xie Yujia., 2020, Proc. Adv. Neural Inf. Process. Syst., V33, P20520
[62]   An efficient scRNA-seq dropout imputation method using graph attention network [J].
Xu, Chenyang ;
Cai, Lei ;
Gao, Jingyang .
BMC BIOINFORMATICS, 2021, 22 (01)
[63]   Zero-inflated negative binomial mixed regression modeling of over-dispersed count data with extra zeros [J].
Yau, KKW ;
Wang, K ;
Lee, AH .
BIOMETRICAL JOURNAL, 2003, 45 (04) :437-452
[64]  
Yin PH, 2019, Arxiv, DOI [arXiv:1903.05662, DOI 10.48550/ARXIV.1903.05662]
[65]   Autoencoder and its various variants [J].
Zhang, Sufang ;
Zhai, Junhai ;
Chen, Junfen ;
He, Qiang .
2018 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2018, :415-419
[66]   Graph Neural Networks and Their Current Applications in Bioinformatics [J].
Zhang, Xiao-Meng ;
Liang, Li ;
Liu, Lin ;
Tang, Ming-Jing .
FRONTIERS IN GENETICS, 2021, 12
[67]   Critical downstream analysis steps for single-cell RNA sequencing data [J].
Zhang, Zilong ;
Cui, Feifei ;
Lin, Chen ;
Zhao, Lingling ;
Wang, Chunyu ;
Zou, Quan .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
[68]   Hyperbolic geometry of gene expression [J].
Zhou, Yuansheng ;
Sharpee, Tatyana O. .
ISCIENCE, 2021, 24 (03)