RPI-GGCN: Prediction of RNA-Protein Interaction Based on Interpretability Gated Graph Convolution Neural Network and Co-Regularized Variational Autoencoders

被引:7
作者
Wang, Yifei [1 ]
Ding, Pengju [2 ]
Wang, Congjing [1 ]
He, Shiyue [2 ]
Gao, Xin [3 ]
Yu, Bin [2 ,4 ]
机构
[1] Qingdao Univ Sci & Technol China, Coll Math & Phys, Sch Data Sci, Qingdao 266061, Peoples R China
[2] Qingdao Univ Sci & Technol China, Sch Data Sci, Qingdao 266061, Peoples R China
[3] King Abdullah Univ Sci & Technol KAUST, Computat Biosci Res Ctr CBRC, Comp Elect & Math Sci & Engn Div, Thuwal 23955, Saudi Arabia
[4] Qingdao Univ Sci & Technol, Sch Data Sci, Qingdao 266061, Peoples R China
基金
中国国家自然科学基金;
关键词
Economic indicators; Feature extraction; RNA; Proteins; Predictive models; Computational modeling; Logic gates; Co-regularized variational autoencoders (Co-VAEs); gated graph convolution neural network (GGCN); interpretability; multi-information fusion; RNA-protein interactions (RPIs); SEQUENCE; COVARIANCE;
D O I
10.1109/TNNLS.2024.3390935
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
RNA-protein interactions (RPIs) play an important role in several fundamental cellular physiological processes, including cell motility, chromosome replication, transcription and translation, and signaling. Predicting RPI can guide the exploration of cellular biological functions, intervening in diseases, and designing drugs. Given this, this study proposes the RPI-gated graph convolutional network (RPI-GGCN) method for predicting RPI based on the gated graph convolutional neural network (GGCN) and co-regularized variational autoencoder (Co-VAE). First, different types of feature information were extracted from RNA and protein sequences by nine feature extraction methods. Second, Co-VAEs are used to eliminate the redundancy of fused features and generate optimal features. Finally, this study introduces gated cyclic units into graph convolutional networks (GCNs) to construct a model for RPI prediction, which efficiently extracts topological information and improves the model's interpretable feature learning and expression capabilities. In the fivefold cross-validation test, the RPI-GGCN method achieved prediction accuracies of 97.27%, 97.32%, 96.54%, 95.76%, and 94.98% on the RPI369, RPI488, RPI1446, RPI1807, and RPI2241 datasets. To test the generalization performance of the model, we used the model trained on RPI369 to predict the independent NPInter v3.0 dataset and achieved excellent performance in all six independent validation sets. By visualizing the RPI network graph based on the prediction results, we aim to provide a new perspective and reference for studying RPI mechanisms and exploring new RPIs. Extensive experimental results demonstrate that RPI-GGCN can provide an efficient, accurate, and stable RPI prediction method.
引用
收藏
页码:7681 / 7695
页数:15
相关论文
共 66 条
[1]  
[Anonymous], 1996, SER B METHODOL, V58, P267
[2]   Learning eigenfunctions links spectral embedding and kernel PCA [J].
Bengio, Y ;
Delalleau, O ;
Le Roux, N ;
Paiement, JF ;
Vincent, P ;
Ouimet, M .
NEURAL COMPUTATION, 2004, 16 (10) :2197-2219
[3]   Orthogonal Matching Pursuit for Sparse Signal Recovery With Noise [J].
Cai, T. Tony ;
Wang, Lie .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2011, 57 (07) :4680-4688
[4]   Structural mechanism for rifampicin inhibition of bacterial RNA polymerase [J].
Campbell, EA ;
Korzheva, N ;
Mustaev, A ;
Murakami, K ;
Nair, S ;
Goldfarb, A ;
Darst, SA .
CELL, 2001, 104 (06) :901-912
[5]   iTIS-PseTNC: A sequence-based predictor for identifying translation initiation site in human genes using pseudo trinucleotide composition [J].
Chen, Wei ;
Feng, Peng-Mian ;
Deng, En-Ze ;
Lin, Hao ;
Chou, Kuo-Chen .
ANALYTICAL BIOCHEMISTRY, 2014, 462 :76-83
[6]   DM-RPIs: Predicting ncRNA-protein interactions using stacked ensembling strategy [J].
Cheng, Shuping ;
Zhang, Lu ;
Tan, Jianjun ;
Gong, Weikang ;
Li, Chunhua ;
Zhang, Xiaoyi .
COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 83
[7]   Selecting high-quality negative samples for effectively predicting protein-RNA interactions [J].
Cheng, Zhanzhan ;
Huang, Kai ;
Wang, Yang ;
Liu, Hui ;
Guan, Jihong ;
Zhou, Shuigeng .
BMC SYSTEMS BIOLOGY, 2017, 11
[8]   Prediction of protein subcellular locations by incorporating quasi-sequence-order effect [J].
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2000, 278 (02) :477-483
[9]   Construction of Complex Features for Computational Predicting ncRNA-Protein Interaction [J].
Dai, Qiguo ;
Guo, Maozu ;
Duan, Xiaodong ;
Teng, Zhixia ;
Fu, Yueyue .
FRONTIERS IN GENETICS, 2019, 10
[10]   Graph Convolutional Adversarial Networks for Spatiotemporal Anomaly Detection [J].
Deng, Leyan ;
Lian, Defu ;
Huang, Zhenya ;
Chen, Enhong .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (06) :2416-2428