Predicting gene regulatory links from single-cell RNA-seq data using graph neural networks

被引:30
作者
Mao, Guo [1 ]
Pang, Zhengbin [2 ]
Zuo, Ke
Wang, Qinglin [3 ]
Pei, Xiangdong [1 ]
Chen, Xinhai [1 ]
Liu, Jie [4 ]
机构
[1] Natl Univ Def Technol, Changsha, Peoples R China
[2] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
[3] Natl Univ Def Technol, Natl Key Lab Paralle & Ditributed Comp, Changsha, Peoples R China
[4] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Lab Software Engn Complex Syst, Changsha, Peoples R China
关键词
graph neural network; link prediction; graph convolutional network; gene regulatory networks (GRNs); INFERENCE;
D O I
10.1093/bib/bbad414
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA-sequencing (scRNA-seq) has emerged as a powerful technique for studying gene expression patterns at the single-cell level. Inferring gene regulatory networks (GRNs) from scRNA-seq data provides insight into cellular phenotypes from the genomic level. However, the high sparsity, noise and dropout events inherent in scRNA-seq data present challenges for GRN inference. In recent years, the dramatic increase in data on experimentally validated transcription factors binding to DNA has made it possible to infer GRNs by supervised methods. In this study, we address the problem of GRN inference by framing it as a graph link prediction task. In this paper, we propose a novel framework called GNNLink, which leverages known GRNs to deduce the potential regulatory interdependencies between genes. First, we preprocess the raw scRNA-seq data. Then, we introduce a graph convolutional network-based interaction graph encoder to effectively refine gene features by capturing interdependencies between nodes in the network. Finally, the inference of GRN is obtained by performing matrix completion operation on node features. The features obtained from model training can be applied to downstream tasks such as measuring similarity and inferring causality between gene pairs. To evaluate the performance of GNNLink, we compare it with six existing GRN reconstruction methods using seven scRNA-seq datasets. These datasets encompass diverse ground truth networks, including functional interaction networks, Loss of Function/Gain of Function data, non-specific ChIP-seq data and cell-type-specific ChIP-seq data. Our experimental results demonstrate that GNNLink achieves comparable or superior performance across these datasets, showcasing its robustness and accuracy. Furthermore, we observe consistent performance across datasets of varying scales. For reproducibility, we provide the data and source code of GNNLink on our GitHub repository: https://github.com/sdesignates/GNNLink.
引用
收藏
页数:11
相关论文
共 46 条
[1]  
Akhand MAH, 2015, 2015 18TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), P312, DOI 10.1109/ICCITechn.2015.7488088
[2]   AGRN: accurate gene regulatory network inference using ensemble machine learning methods [J].
Alawad, Duaa Mohammad ;
Katebi, Ataur ;
Ul Kabir, Md Wasi ;
Hoque, Md Tamjidul .
NEURO-ONCOLOGY ADVANCES, 2023, 5 (01)
[3]   DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data [J].
Arisdakessian, Cedric ;
Poirion, Olivier ;
Yunits, Breck ;
Zhu, Xun ;
Garmire, Lana X. .
GENOME BIOLOGY, 2019, 20 (01)
[4]   Information theoretic approaches for inference of biological networks from continuous-valued data [J].
Budden, David M. ;
Crampin, Edmund J. .
BMC SYSTEMS BIOLOGY, 2016, 10
[5]   Multilineage communication regulates human liver bud development from pluripotency [J].
Camp, J. Gray ;
Sekine, Keisuke ;
Gerber, Tobias ;
Loeffler-Wirth, Henry ;
Binder, Hans ;
Gac, Malgorzata ;
Kanton, Sabina ;
Kageyama, Jorge ;
Damm, Georg ;
Seehofer, Daniel ;
Belicova, Lenka ;
Bickle, Marc ;
Barsacchi, Rico ;
Okuda, Ryo ;
Yoshizawa, Emi ;
Kimura, Masaki ;
Ayabe, Hiroaki ;
Taniguchi, Hideki ;
Takebe, Takanori ;
Treutlein, Barbara .
NATURE, 2017, 546 (7659) :533-+
[6]   Graph attention network for link prediction of gene regulations from single-cell RNA-sequencing data [J].
Chen, Guangyi ;
Liu, Zhi-Ping .
BIOINFORMATICS, 2022, 38 (19) :4522-4529
[7]  
Chen J., 2021, etal DeepDRIM: a deep neural network to reconstruct cell-type-specific gene regulatory network using single-cell RNA-seq data
[8]   Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm [J].
Chu, Li-Fang ;
Leng, Ning ;
Zhang, Jue ;
Hou, Zhonggang ;
Mamott, Daniel ;
Vereide, David T. ;
Choi, Jeea ;
Kendziorski, Christina ;
Stewart, Ron ;
Thomson, James A. .
GENOME BIOLOGY, 2016, 17
[9]   Deep learning: new computational modelling techniques for genomics [J].
Eraslan, Gokcen ;
Avsec, Ziga ;
Gagneur, Julien ;
Theis, Fabian J. .
NATURE REVIEWS GENETICS, 2019, 20 (07) :389-403
[10]   Filter feature selection based Boolean Modelling for Genetic Network Inference [J].
Gamage, Hasini Nakulugamuwa ;
Chetty, Madhu ;
Shatte, Adrian ;
Hallinan, Jennifer .
BIOSYSTEMS, 2022, 221