scE2EGAE: enhancing single-cell RNA-Seq data analysis through an end-to-end cell-graph-learnable graph autoencoder with differentiable edge sampling

被引:0
作者
Wang, Shuo [1 ,2 ]
Liu, Yuanning [1 ,2 ]
Zhang, Hao [1 ,2 ]
Liu, Zhen [3 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[2] Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, 2699 Qianjin St, Changchun 130012, Jilin, Peoples R China
[3] Nagasaki Inst Appl Sci, Grad Sch Engn, 536 Aba machi, Nagasaki, Japan
基金
中国国家自然科学基金;
关键词
Single-cell RNA-Seq; Bioinformatics; End-to-end; Graph neural networks; Deep learning; Autoencoder; MICROGLIA;
D O I
10.1186/s13062-025-00616-z
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background:Single-cell RNA sequencing (scRNA-Seq) technology reveals biological processes and molecular-level genomic information among individual cells. Numerous computational methods, including methods based on graph neural networks (GNNs), have been developed to enhance scRNA-Seq data analysis. However, existing GNNs-based methods usually construct fixed graphs by applying the k-nearest neighbors algorithm, which may result in information loss.Methods:To address this problem, we propose scE2EGAE, which learns cell graphs during the training processes. Firstly, the scRNA-Seq data is fed into a deep count autoencoder (DCA). Secondly, the hidden representations of DCA are extracted and then used to generate cell-to-cell graph edges through a straight-through estimator (STE) based on top-k sampling and Gumbel-Softmax. Finally, the generated cell-to-cell graph and scRNA-Seq data are fed into the GNNs-based downstream tasks. In this paper, we design a graph autoencoder which performs denoising on scRNA-Seq data as the downstream task.Results:We evaluate scE2EGAE on eight public scRNA-Seq datasets and compare its performance with seven existing scRNA-Seq data denoising methods. In this paper, extensive experiments are conducted, encompassing: 1) the evaluation of denoising performance, with metrics including mean absolute error, Pearson correlation coefficient, and cosine similarity; 2) the assessment of clustering performance of the denoised results, utilizing adjusted rand index, normalized mutual information and silhouette score; and 3) the evaluation of the cell trajectory inference performance of the denoised results, measured by the pseudo-temporal ordering score. The results show that, on the scRNA-Seq data denoising task, scE2EGAE outperforms most of the methods, proving that it can learn cell-to-cell graphs containing real information of cell-to-cell relationships.Conclusions:In this paper, we validate the proposed scE2EGAE method through its application to the denoising task of scRNA-Seq data. This method demonstrates its capability to learn inter-cellular relationships and construct cell-to-cell graphs, thereby enhancing the downstream analysis of scRNA-Seq data. Our approach can serve as an inspiration for future research on scRNA-Seq analysis methods based on GNNs, holding broad application prospects.
引用
收藏
页数:25
相关论文
共 68 条
[1]   Exploring single-cell data with deep multitasking neural networks [J].
Amodio, Matthew ;
van Dijk, David ;
Srinivasan, Krishnan ;
Chen, William S. ;
Mohsen, Hussein ;
Moon, Kevin R. ;
Campbell, Allison ;
Zhao, Yujiao ;
Wang, Xiaomei ;
Venkataswamy, Manjunatha ;
Desai, Anita ;
Ravi, V. ;
Kumar, Priti ;
Montgomery, Ruth ;
Wolf, Guy ;
Krishnaswamy, Smita .
NATURE METHODS, 2019, 16 (11) :1139-+
[2]   Method of the Year 2013 [J].
不详 .
NATURE METHODS, 2014, 11 (01) :1-1
[3]  
[Anonymous], 2020, Advances In Neural Information Processing Systems
[4]  
[Anonymous], **DATA OBJECT**
[5]   The heterogeneity of human CD127+ innate lymphoid cells revealed by single-cell RNA sequencing [J].
Bjorklund, Asa K. ;
Forkel, Marianne ;
Picelli, Simone ;
Konya, Viktoria ;
Theorell, Jakob ;
Friberg, Danielle ;
Sandberg, Rickard ;
Mjosberg, Jenny .
NATURE IMMUNOLOGY, 2016, 17 (04) :451-+
[6]   A review of image denoising algorithms, with a new one [J].
Buades, A ;
Coll, B ;
Morel, JM .
MULTISCALE MODELING & SIMULATION, 2005, 4 (02) :490-530
[7]  
Cannoodt R., 2016, Scorpius improves trajectory inference and identifies novel modules in dendritic cell development, DOI [10.1101/079509, 10.1101/079509v2, DOI 10.1101/079509]
[8]   Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm [J].
Chu, Li-Fang ;
Leng, Ning ;
Zhang, Jue ;
Hou, Zhonggang ;
Mamott, Daniel ;
Vereide, David T. ;
Choi, Jeea ;
Kendziorski, Christina ;
Stewart, Ron ;
Thomson, James A. .
GENOME BIOLOGY, 2016, 17
[9]   EUCLIDEAN DISTANCE MAPPING [J].
DANIELSSON, PE .
COMPUTER GRAPHICS AND IMAGE PROCESSING, 1980, 14 (03) :227-248
[10]   Complement C1q-dependent excitatory and inhibitory synapse elimination by astrocytes and microglia in Alzheimer's disease mouse models [J].
Dejanovic, Borislav ;
Wu, Tiffany ;
Tsai, Ming-Chi ;
Graykowski, David ;
Gandham, Vineela D. ;
Rose, Christopher M. ;
Bakalarski, Corey E. ;
Ngu, Hai ;
Wang, Yuanyuan ;
Pandey, Shristi ;
Rezzonico, Mitchell G. ;
Friedman, Brad A. ;
Edmonds, Rose ;
De Maziere, Ann ;
Rakosi-Schmidt, Raphael ;
Singh, Tarjinder ;
Klumperman, Judith ;
Foreman, Oded ;
Chang, Michael C. ;
Xie, Luke ;
Sheng, Morgan ;
Hanson, Jesse E. .
NATURE AGING, 2022, 2 (09) :837-+