Wasserstein Graph Convolutional Network with Attention for Imbalanced scRNA-seq Data Knowledge Discovery

被引:0
作者
Ren, Jie [1 ]
Han, Henry [1 ]
机构
[1] Baylor Univ, Sch Engn & Comp Sci, Lab Data Sci & Artificial Intelligence Innovat, Waco, TX 76798 USA
来源
RECENT ADVANCES IN NEXT-GENERATION DATA SCIENCE, SDSC 2024 | 2024年 / 2158卷
关键词
scRNA-seq; Wasserstein distance; data imbalance; Graph convolutional network; Attention mechanism; semi-supervised learning; SINGLE-CELL;
D O I
10.1007/978-3-031-67871-4_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering complex molecular patterns in imbalanced scRNA-seq data remains a challenge, despite numerous efforts from different perspectives. In this study, we propose a novel deep learning model: the Wasserstein Graph Convolutional Network with an attention mechanism (twGCN), designed for semi-supervised learning to address this challenge. The proposed model overcomes the weaknesses of traditional Graph Convolutional Networks by capturing more data intricacies and geometry. This is achieved by integrating a Wasserstein distance-based loss function optimization along with an attention mechanism. Unlike traditional scRNA-seq data preprocessing, we employ a robust scaling approach to normalize scRNA-seq data, which generally contains a large number of outliers. Our methods demonstrate significant advantages over peer methods in discovering single-cell patterns in benchmark data. More importantly, the proposed twGCN can handle both low-dimensional and high-dimensional scRNA-seq data obtained after feature selection. To our knowledge, this study will positively impact both deep learning and bioinformatics, inspiring future research.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
[41]   A Deep Learning Pipeline for the Automatic cell type Assignment of scRNA-seq Data [J].
Riva, Simone G. ;
Myers, Brynelle ;
Cazzaniga, Paolo ;
Tangherloni, Andrea .
2022 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (IEEE CIBCB 2022), 2022, :67-74
[42]   A clustering method for small scRNA-seq data based on subspace and weighted distance [J].
Ning, Zilan ;
Dai, Zhijun ;
Zhang, Hongyan ;
Chen, Yuan ;
Yuan, Zheming .
PEERJ, 2023, 11 :28-28
[43]   SCDRHA: A scRNA-Seq Data Dimensionality Reduction Algorithm Based on Hierarchical Autoencoder [J].
Zhao, Jianping ;
Wang, Na ;
Wang, Haiyun ;
Zheng, Chunhou ;
Su, Yansen .
FRONTIERS IN GENETICS, 2021, 12
[44]   Digitaldlsorter: Deep-Learning on scRNA-Seq to Deconvolute Gene Expression Data [J].
Torroja, Carlos ;
Sanchez-Cabo, Fatima .
FRONTIERS IN GENETICS, 2019, 10
[45]   scSDSC: Self-supervised Deep Subspace Clustering for scRNA-seq Data [J].
Yang, Bo ;
Wang, Hai-Yun ;
Zhao, Jian-Ping ;
Zheng, Chun-Hou .
CURRENT BIOINFORMATICS, 2024,
[46]   Evaluation of T Cell Receptor Construction Methods from scRNA-Seq Data [J].
Tian, Ruonan ;
Yu, Zhejian ;
Xue, Ziwei ;
Wu, Jiaxin ;
Wu, Lize ;
Cai, Shuo ;
Gao, Bing ;
He, Bing ;
Zhao, Yu ;
Yao, Jianhua ;
Lu, Linrong ;
Liu, Wanlu .
GENOMICS PROTEOMICS & BIOINFORMATICS, 2025, 22 (06)
[47]   scSFCL:Deep clustering of scRNA-seq data with subspace feature confidence learning [J].
Meng, Xiaokun ;
Zhang, Yuanyuan ;
Xu, Xiaoyu ;
Zhang, Kaihao ;
Feng, Baoming .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2025, 114
[48]   Accurate and interpretable gene expression imputation on scRNA-seq data using IGSimpute [J].
Xu, Ke ;
Cheong, ChinWang ;
Veldsman, Werner P. ;
Lyu, Aiping ;
Cheung, William K. ;
Zhang, Lu .
BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
[49]   Integration of scRNA-seq data by disentangled representation learning with condition domain adaptation [J].
Liu, Renjing ;
Qian, Kun ;
He, Xinwei ;
Li, Hongwei .
BMC BIOINFORMATICS, 2024, 25 (01)
[50]   UIPBC: An effective clustering for scRNA-seq data analysis without user input [J].
Chowdhury, Hussain Ahmed ;
Bhattacharyya, Dhruba Kumar ;
Kalita, Jugal Kumar .
KNOWLEDGE-BASED SYSTEMS, 2022, 248