Conjoint Feature Representation of GO and Protein Sequence for PPI Prediction Based on an Inception RNN Attention Network

被引:19
|
作者
Zhao, Lingling [1 ]
Wang, Junjie [1 ]
Hu, Yang [2 ]
Cheng, Liang [3 ,4 ]
机构
[1] Harbin Inst Technol, Fac Comp, Harbin 150001, Peoples R China
[2] Harbin Inst Technol, Sch Life Sci & Technol, Dept Comp Sci, Harbin 150001, Peoples R China
[3] Harbin Med Univ, NHC & CAMS Key Lab Mol Probe & Targeted Theranost, Harbin 150028, Heilongjiang, Peoples R China
[4] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin 150081, Heilongjiang, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
NEURAL-NETWORK;
D O I
10.1016/j.omtn.2020.08.025
中图分类号
R-3 [医学研究方法]; R3 [基础医学];
学科分类号
1001 ;
摘要
Protein-protein interactions (PPIs) are pivotal for cellular functions and biological processes. In the past years, computational methods using amino acid sequences and gene ontology (GO) annotations of proteins for prioritizing PPIs have provided important references for biological experiments in the wet lab. Despite the current success, sequence information and ontological annotation in semantic representation have not been integrated into current methods. We propose a deep-learning-based PPI prediction methodology conjointly featuring sequence information and GO annotation. First, we adopt a word-embedding tool, the NCBI-blueBERT model pre-trained on PubMed, to map the GO terms into their semantic vectors. Then, the GO semantic vectors and protein sequence vector serve as the input of the proposed inception recurrent neural network (RNN) attention network (IRAN). The IRAN captures the spatial relationship and the potential sequential feature of the protein sequence and ontological annotation semantics. The extensive experimental results on 12 benchmarks demonstrate that our method achieves superiority over state-of-the-art baselines. In the yeast dataset of a binary PPI prediction, our method improved the performance with the Matthews correlation coefficient increasing from 94.2% to 98.2% and the accuracy from 97.1% to 98.2%. The analogous results were also obtained in other comparison evaluations.
引用
收藏
页码:198 / 208
页数:11
相关论文
共 50 条
  • [1] Sequence-based prediction of protein–protein interaction using auto-feature engineering of RNN-based model
    Mewara B.
    Lalwani S.
    Research on Biomedical Engineering, 2023, 39 (01) : 259 - 272
  • [2] RNN-based Human Motion Prediction via Differential Sequence Representation
    Wang, Yachuan
    Wang, Xuan
    Jiang, Peilin
    Wang, Fei
    PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 138 - 143
  • [3] Gene Ontology Based Function Prediction of Human Protein Using Protein Sequence and Neighborhood Property of PPI Network
    Saha, Sovan
    Chatterjee, Piyali
    Basu, Subhadip
    Nasipuri, Mita
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON FRONTIERS IN INTELLIGENT COMPUTING: THEORY AND APPLICATIONS, (FICTA 2016), VOL 2, 2017, 516 : 109 - 118
  • [4] Prediction of RNA-protein interactions using conjoint triad feature and chaos game representation
    Wang, Hongchu
    Wu, Pengfei
    BIOENGINEERED, 2018, 9 (01) : 242 - 251
  • [5] Combining Sequence Entropy and Subgraph Topology for Complex Prediction in Protein Protein Interaction (PPI) Network
    Sikandar, Aisha
    Anwar, Waqas
    Sikandar, Misba
    CURRENT BIOINFORMATICS, 2019, 14 (06) : 516 - 523
  • [6] SAINT: self-attention augmented inception-inside-inception network improves protein secondary structure prediction
    Uddin, Mostofa Rafid
    Mahbub, Sazan
    Rahman, M. Saifur
    Bayzid, Md Shamsuzzoha
    BIOINFORMATICS, 2020, 36 (17) : 4599 - 4608
  • [7] AFTGAN: prediction of multi-type PPI based on attention free transformer and graph attention network
    Kang, Yanlei
    Elofsson, Arne
    Jiang, Yunliang
    Huang, Weihong
    Yu, Minzhe
    Li, Zhong
    BIOINFORMATICS, 2023, 39 (02)
  • [8] RAANMF: An adaptive sequence feature representation method for predictions of protein thermostability, PPI, and drug-target interaction
    Yan, Qunfang
    Pan, Shuyi
    Cheng, Zhixing
    Ding, Yanrui
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2025, 169
  • [9] Classification of enzyme function from protein sequence based on feature representation
    Lee, Bum Ju
    Lee, Jong Yun
    Lee, Heon Gu
    Ryu, Keun Ho
    PROCEEDINGS OF THE 7TH IEEE INTERNATIONAL SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, VOLS I AND II, 2007, : 741 - +
  • [10] Prediction of Protein Solubility Based on Sequence Feature Fusion and DDcCNN
    Wang, Xianfang
    Liu, Yifeng
    Du, Zhiyong
    Zhu, Mingdong
    Kaushik, Aman Chandra
    Jiang, Xue
    Wei, Dongqing
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2021, 13 (04) : 703 - 716