A convolutional network and attention mechanism-based approach to predict protein-RNA binding residues

被引:1
作者
Li, Ke [1 ,2 ,3 ]
Wu, Hongwei [1 ,3 ]
Yue, Zhenyu [1 ,3 ]
Sun, Yu [1 ,3 ]
Xia, Chuan [3 ]
机构
[1] Anhui Agr Univ, Sch Informat & Comp, Hefei 230036, Anhui, Peoples R China
[2] Anhui Univ, Informat Mat & Intelligent Sensing Lab Anhui Prov, Hefei 230601, Anhui, Peoples R China
[3] Anhui Agr Univ, Anhui Prov Engn Lab Beidou Precis Agr Informat, Hefei 230036, Anhui, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature selection; Protein-RNA interaction; RNA-seq; Deep learning; Attention mechanism; ACCURATE; DATABASE; SITES;
D O I
10.1016/j.compbiolchem.2023.107901
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein-RNA interactions play a key role in various biological cellular processes, and many experimental and computational studies have been initiated to analyze their interactions. However, experimental determination is quite complex and expensive. Therefore, researchers have worked to develop efficient computational tools to detect protein-RNA binding residues. The accuracy of existing methods is limited by the features of the target and the performance of the computational models; there remains room for improvement. To solve the problem of the accurate detection of protein-RNA binding residues, we propose a convolutional network model named PBRPre based on improved MobileNet. First, by extracting the position information of the target complex and the 3-mer amino acid feature data, the position-specific scoring matrix (PSSM) is improved by using spatial neighbor smoothing processing and discrete wavelet transform to fully exploit the spatial structure information of the target and enrich the feature dataset. Second, the deep learning model MobileNet is used to integrate and optimize the potential features in the target complexes; then, by introducing the Vision Transformer (ViT) network classification layer, the deep-level information of the target is mined to enhance the processing ability of the model for global information and to improve the detection accuracy of the classifiers. The results show that the AUC value of the model can reach 0.866 in the independent testing dataset, which shows that PBRPre can effectively realize the detection of protein-RNA binding residues. All datasets and resource codes of PBRPre are available at https://github.com/linglewu/PBRPre for academic use.
引用
收藏
页数:8
相关论文
共 46 条
  • [1] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [2] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] Continuous Distributed Representation of Biological Sequences for Deep Proteomics and Genomics
    Asgari, Ehsaneddin
    Mofrad, Mohammad R. K.
    [J]. PLOS ONE, 2015, 10 (11):
  • [4] Vibrational dynamics of folded proteins: Significance of slow and fast motions in relation to function and stability
    Bahar, I
    Atilgan, AR
    Demirel, MC
    Erman, B
    [J]. PHYSICAL REVIEW LETTERS, 1998, 80 (12) : 2733 - 2736
  • [5] Direct evaluation of thermal fluctuations in proteins using a single-parameter harmonic potential
    Bahar, I
    Atilgan, AR
    Erman, B
    [J]. FOLDING & DESIGN, 1997, 2 (03): : 173 - 181
  • [6] Sequence-based prediction of protein interaction sites with an integrative method
    Chen, Xue-Wen
    Jeong, Jong Cheol
    [J]. BIOINFORMATICS, 2009, 25 (05) : 585 - 591
  • [7] Identifying RNA-binding residues based on evolutionary conserved structural and energetic features
    Chen, Yao Chi
    Sargsyan, Karen
    Wright, Jon D.
    Huang, Yi-Shuian
    Lim, Carmay
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (03) : e15
  • [8] Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
  • [9] FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues
    EL-Manzalawy, Yasser
    Abbas, Mostafa
    Malluhi, Qutaibah
    Honavar, Vasant
    [J]. PLOS ONE, 2016, 11 (07):
  • [10] RNAm5CPred: Prediction of RNA 5-Methylcytosine Sites Based on Three Different Kinds of Nucleotide Composition
    Fang, Ting
    Zhang, Zizheng
    Sun, Rui
    Zhu, Lin
    He, Jingjing
    Huang, Bei
    Xiong, Yi
    Zhu, Xiaolei
    [J]. MOLECULAR THERAPY-NUCLEIC ACIDS, 2019, 18 : 739 - 747