Wasserstein Graph Convolutional Network with Attention for Imbalanced scRNA-seq Data Knowledge Discovery

被引:0
作者
Ren, Jie [1 ]
Han, Henry [1 ]
机构
[1] Baylor Univ, Sch Engn & Comp Sci, Lab Data Sci & Artificial Intelligence Innovat, Waco, TX 76798 USA
来源
RECENT ADVANCES IN NEXT-GENERATION DATA SCIENCE, SDSC 2024 | 2024年 / 2158卷
关键词
scRNA-seq; Wasserstein distance; data imbalance; Graph convolutional network; Attention mechanism; semi-supervised learning; SINGLE-CELL;
D O I
10.1007/978-3-031-67871-4_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering complex molecular patterns in imbalanced scRNA-seq data remains a challenge, despite numerous efforts from different perspectives. In this study, we propose a novel deep learning model: the Wasserstein Graph Convolutional Network with an attention mechanism (twGCN), designed for semi-supervised learning to address this challenge. The proposed model overcomes the weaknesses of traditional Graph Convolutional Networks by capturing more data intricacies and geometry. This is achieved by integrating a Wasserstein distance-based loss function optimization along with an attention mechanism. Unlike traditional scRNA-seq data preprocessing, we employ a robust scaling approach to normalize scRNA-seq data, which generally contains a large number of outliers. Our methods demonstrate significant advantages over peer methods in discovering single-cell patterns in benchmark data. More importantly, the proposed twGCN can handle both low-dimensional and high-dimensional scRNA-seq data obtained after feature selection. To our knowledge, this study will positively impact both deep learning and bioinformatics, inspiring future research.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
[31]   ORIGINS: A protein network-based approach to quantify cell pluripotency from scRNA-seq data [J].
Senra, Daniela ;
Guisoni, Nara ;
Diambra, Luis .
METHODSX, 2022, 9
[32]   scCAT: Single-cell Combined graph Attentional clustering for scRNA-seq analysis [J].
Gao, Yufei ;
Zhang, Wenbo ;
Zhang, Yameng ;
Shi, Yujie ;
Shi, Lei ;
Wang, Hailing ;
Cheng, Guozhen .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 105
[33]   Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation [J].
Ayoub Lasri ;
Vahid Shahrezaei ;
Marc Sturrock .
BMC Bioinformatics, 23
[34]   Benchmarking imputation methods for network inference using a novel method of synthetic scRNA-seq data generation [J].
Lasri, Ayoub ;
Shahrezaei, Vahid ;
Sturrock, Marc .
BMC BIOINFORMATICS, 2022, 23 (01)
[35]   Statistics or biology: the zero-inflation controversy about scRNA-seq data [J].
Jiang, Ruochen ;
Sun, Tianyi ;
Song, Dongyuan ;
Li, Jingyi Jessica .
GENOME BIOLOGY, 2022, 23 (01)
[36]   Graph-Regularized Non-Negative Matrix Factorization for Single-Cell Clustering in scRNA-Seq Data [J].
Jiang, Hanjing ;
Wang, Mei-Neng ;
Huang, Yu-An ;
Huang, Yabing .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (08) :4986-4994
[37]   Integration of scRNA-seq data by disentangled representation learning with condition domain adaptation [J].
Renjing Liu ;
Kun Qian ;
Xinwei He ;
Hongwei Li .
BMC Bioinformatics, 25
[38]   scSDSC: Self-supervised Deep Subspace Clustering for scRNA-seq Data [J].
Yang, Bo ;
Wang, Hai-Yun ;
Zhao, Jian-Ping ;
Zheng, Chun-Hou .
CURRENT BIOINFORMATICS, 2024,
[39]   Evaluation of T Cell Receptor Construction Methods from scRNA-Seq Data [J].
Tian, Ruonan ;
Yu, Zhejian ;
Xue, Ziwei ;
Wu, Jiaxin ;
Wu, Lize ;
Cai, Shuo ;
Gao, Bing ;
He, Bing ;
Zhao, Yu ;
Yao, Jianhua ;
Lu, Linrong ;
Liu, Wanlu .
GENOMICS PROTEOMICS & BIOINFORMATICS, 2025, 22 (06)
[40]   A clustering method for small scRNA-seq data based on subspace and weighted distance [J].
Ning, Zilan ;
Dai, Zhijun ;
Zhang, Hongyan ;
Chen, Yuan ;
Yuan, Zheming .
PEERJ, 2023, 11 :28-28