A Novel Deep Learning Method for Predicting RNA-Protein Binding Sites

被引:1
|
作者
Zhao, Xueru [1 ]
Chang, Furong [2 ]
Lv, Hehe [1 ]
Zou, Guobing [1 ]
Zhang, Bofeng [3 ,4 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai 200444, Peoples R China
[2] Yangzhou Polytech Inst, Sch Informat Engn, Yangzhou 225127, Peoples R China
[3] Shanghai Polytech Univ, Sch Comp & Commun Engn, Shanghai 201209, Peoples R China
[4] Kashi Univ, Sch Comp Sci & Technol, Kashi 844008, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 05期
基金
国家重点研发计划;
关键词
protein-RNA interaction; RNA-binding sites; deep learning; graph neural network; hierarchical pooling network; RNA secondary structure; SEQUENCE; MOTIFS; IDENTIFICATION; DATABASE; DNA;
D O I
10.3390/app13053247
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The cell cycle and biological processes rely on RNA and RNA-binding protein (RBP) interactions. It is crucial to identify the binding sites on RNA. Various deep-learning methods have been used for RNA-binding site prediction. However, they cannot extract the hierarchical features of the RNA secondary structure. Therefore, this paper proposes HPNet, which can automatically identify RNA-binding sites and -binding preferences. HPNet performs feature learning from the two perspectives of the RNA sequence and the RNA secondary structure. A convolutional neural network (CNN), a deep-learning method, is used to learn RNA sequence features in HPNet. To capture the hierarchical information for RNA, we introduced DiffPool into HPNet, a differentiable pooling graph neural network (GNN). A CNN and DiffPool were combined to improve the binding site prediction accuracy by leveraging both RNA sequence features and hierarchical features of the RNA secondary structure. Binding preferences can be extracted based on model outputs and parameters. Overall, the experimental results showed that HPNet achieved a mean area under the curve (AUC) of 94.5% for the benchmark dataset, which was more accurate than the state-of-the-art methods. Moreover, these results demonstrate that the hierarchical features of RNA secondary structure play an essential role in selecting RNA-binding sites.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] WVDL: Weighted Voting Deep Learning Model for Predicting RNA-Protein Binding Sites
    Pan, Zhengsen
    Zhou, Shusen
    Liu, Tong
    Liu, Chanjuan
    Zang, Mujun
    Wang, Qingjun
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (05) : 3322 - 3328
  • [2] RBPsuite: RNA-protein binding sites prediction suite based on deep learning
    Pan, Xiaoyong
    Fang, Yi
    Li, Xianfeng
    Yang, Yang
    Shen, Hong-Bin
    BMC GENOMICS, 2020, 21 (01)
  • [3] RBPsuite: RNA-protein binding sites prediction suite based on deep learning
    Xiaoyong Pan
    Yi Fang
    Xianfeng Li
    Yang Yang
    Hong-Bin Shen
    BMC Genomics, 21
  • [4] A Method for Predicting RNA-Protein Interaction and Interaction sites
    Wang, Tong
    Lu, Hong
    Li, Hongmei
    Cao, Xiaoxia
    PROCEEDINGS OF THE 2013 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION (ICCSE 2013), 2013, : 795 - 798
  • [5] Predicting RNA-protein binding affinity
    Strack, Rita
    NATURE METHODS, 2019, 16 (06) : 460 - 460
  • [6] Learning distributed representations of RNA sequences and its application for predicting RNA-protein binding sites with a convolutional neural network
    Pan, Xiaoyong
    Shen, Hong-Bin
    NEUROCOMPUTING, 2018, 305 : 51 - 58
  • [7] Predicting RNA-protein binding sites and motifs through combining local and global deep convolutional neural networks
    Pan, Xiaoyong
    Shen, Hong-Bin
    BIOINFORMATICS, 2018, 34 (20) : 3427 - 3436
  • [8] Self-Attention Based Neural Network for Predicting RNA-Protein Binding Sites
    Wang, Xinyi
    Zhang, Mingyang
    Long, Chunlin
    Yao, Lin
    Zhu, Min
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 1469 - 1479
  • [9] DeepPN: a deep parallel neural network based on convolutional neural network and graph convolutional network for predicting RNA-protein binding sites
    Jidong Zhang
    Bo Liu
    Zhihan Wang
    Klaus Lehnert
    Mark Gahegan
    BMC Bioinformatics, 23
  • [10] DeepPN: a deep parallel neural network based on convolutional neural network and graph convolutional network for predicting RNA-protein binding sites
    Zhang, Jidong
    Liu, Bo
    Wang, Zhihan
    Lehnert, Klaus
    Gahegan, Mark
    BMC BIOINFORMATICS, 2022, 23 (01)