MAHyNet: Parallel Hybrid Network for RNA-Protein Binding Sites Prediction Based on Multi-Head Attention and Expectation Pooling

被引:0
作者
Wang, Wei [1 ,2 ,3 ]
Sun, Zhenxi [1 ]
Liu, Dong [1 ,2 ,3 ]
Zhang, Hongjun [4 ]
Li, Juntao [5 ]
Wang, Xianfang
Zhou, Yun [1 ,2 ]
机构
[1] Henan Normal Univ, Coll Comp & Informat Engn, Xinxiang 453007, Peoples R China
[2] Key Lab Artificial Intelligence & Personalized Lea, Xinxiang 453007, Henan, Peoples R China
[3] Big Data Engn Lab Teaching Resources & Assessent E, Xinxiang 453007, Henan, Peoples R China
[4] Henan Polytech Univ, Hebi Instiute Engn & Technol, Hebi 458030, Peoples R China
[5] Henan Normal Univ, Sch Math & Informat Sci, Xinxiang 453007, Peoples R China
关键词
RNA; Feature extraction; Convolutional neural networks; Proteins; Predictive models; Data mining; Training; RNA-protein binding sites; prediction; deep learning; multi-head attention; expectation pooling; TRANSCRIPTOME-WIDE IDENTIFICATION; NEURAL-NETWORK; ARCHITECTURES;
D O I
10.1109/TCBB.2024.3366545
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
RNA-binding proteins (RBPs) can regulate biological functions by interacting with specific RNAs, and play an important role in many life activities. Therefore, the rapid identification of RNA-protein binding sites is crucial for functional annotation and site-directed mutagenesis. In this work, a new parallel network that integrates the multi-head attention mechanism and the expectation pooling is proposed, named MAHyNet. The left-branch network of MAHyNet hybrids convolutional neural networks (CNNs) and gated recurrent neural network (GRU) to extract the features of one-hot. The right-branch network is a two-layer CNN network to analyze physicochemical properties of RNA base. Specifically, the multi-head attention mechanism is a computational collection of multiple independent layers of attention, which can extract feature information from multiple dimensions. The expectation pooling combines probabilistic thinking with global pooling. This approach helps to reduce model parameters and enhance the model performance. The combination of CNN and GRU enables further extraction of high-level features in sequences. In addition, the study shows that appropriate hyperparameters have a positive impact on the model performance. Physicochemical properties can be used to supplement characterization information to improving model performance. The experimental results show that MAHyNet has better performance than other models.
引用
收藏
页码:416 / 427
页数:12
相关论文
共 54 条
  • [1] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [2] doRiNA: a database of RNA interactions in post-transcriptional regulation
    Anders, Gerd
    Mackowiak, Sebastian D.
    Jens, Marvin
    Maaskola, Jonas
    Kuntzagk, Andreas
    Rajewsky, Nikolaus
    Landthaler, Markus
    Dieterich, Christoph
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D180 - D186
  • [3] DeepLoc: prediction of protein subcellular localization using deep learning
    Armenteros, Jose Juan Almagro
    Sonderby, Casper Kaae
    Sonderby, Soren Kaae
    Nielsen, Henrik
    Winther, Ole
    [J]. BIOINFORMATICS, 2017, 33 (21) : 3387 - 3395
  • [4] Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473
  • [5] Golgi_DF: Golgi proteins classification with deep forest
    Bao, Wenzheng
    Gu, Yujian
    Chen, Baitong
    Yu, Huiping
    [J]. FRONTIERS IN NEUROSCIENCE, 2023, 17
  • [6] Phage_UniR_LGBM: Phage Virion Proteins Classification with UniRep Features and LightGBM Model
    Bao, Wenzheng
    Cui, Qingyu
    Chen, Baitong
    Yang, Bin
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [7] Roles for RNA-binding proteins in development and disease
    Brinegar, Amy E.
    Cooper, Thomas A.
    [J]. BRAIN RESEARCH, 2016, 1647 : 1 - 8
  • [8] Chentian M., 2020, J. Phys.: Conf. Ser., V1682
  • [9] Chollet F., 2018, Astrophys Source Code Library:ascl
  • [10] Unsupervised Anomaly Detection With LSTM Neural Networks
    Ergen, Tolga
    Kozat, Suleyman Serdar
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) : 3127 - 3141