An improved sequence-based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers

被引:44
|
作者
You, Zhu-Hong [1 ]
Li, Xiao [1 ]
Chan, Keith C. C. [2 ]
机构
[1] Chinese Acad Sci, Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Protein-protein interaction; Substitution matrix; Rotation forest; Protein sequence; Ensemble classifier; IDENTIFICATION; HYPERPLANES; COMPLEXES;
D O I
10.1016/j.neucom.2016.10.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein-protein Interactions (PPIs) play important roles in a wide variety of cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades High-throughput biological experiments for identifying PPIs are beginning to provide valuable information about the complexity of PPI networks, but are expensive, cumbersome, and extremely time-consuming. Hence, there is a need for accurate and robust computational methods for predicting PPIs. In this article, a sequence-based approach is proposed by combining a novel amino acid substitution matrix feature representation and Rotation Forest (RF) classifier. Given the protein sequences as input, the proposed method predicts whether or not the pair of proteins interacts. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 93.74% prediction accuracy with 90.05% sensitivity at the precision of 97.08%. Extensive experiments are performed to compare our method with the existing sequence-based method. Experimental results demonstrate that PPIs can be reliably predicted using only sequence-derived information. Achieved results show that the proposed approach offers an inexpensive method for computational construction of PPI networks, so it can be a useful supplementary tool for future proteomics studies.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [41] Detection of Protein-Protein Interactions from Amino Acid Sequences Using a Rotation Forest Model with a Novel PR-LPQ Descriptor
    Wong, Leon
    You, Zhu-Hong
    Li, Shuai
    Huang, Yu-An
    Liu, Gang
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 713 - 720
  • [42] Sequence-based prediction of protein protein interaction using a deep-learning algorithm
    Tanlin Sun
    Bo Zhou
    Luhua Lai
    Jianfeng Pei
    BMC Bioinformatics, 18
  • [43] Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier
    Wang, Lei
    You, Zhu-Hong
    Xia, Shi-Xiong
    Liu, Feng
    Chen, Xing
    Yan, Xin
    Zhou, Yong
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 418 : 105 - 110
  • [44] Improved Prediction of Protein-Protein Interaction Mapping on Homo Sapiens by Using Amino Acid Sequence Features in a Supervised Learning Framework
    Islam, Md Merajul
    Alam, Md Jahangir
    Ahmed, Fee Faysal
    Hasan, Md Mehedi
    Mollah, Md Nurul Haque
    PROTEIN AND PEPTIDE LETTERS, 2021, 28 (01) : 74 - 83
  • [45] Prediction of biological protein-protein interactions using atom-type and amino acid properties
    Aziz, Md Mominul
    Maleki, Mina
    Rueda, Luis
    Raza, Mohammad
    Banerjee, Sridip
    PROTEOMICS, 2011, 11 (19) : 3802 - 3810
  • [46] Improving protein-protein interactions prediction accuracy using XGBoost feature selection and stacked ensemble classifier
    Chen, Cheng
    Zhang, Qingmei
    Yu, Bin
    Yu, Zhaomin
    Lawrence, Patrick J.
    Ma, Qin
    Zhang, Yan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 123
  • [47] Improved Prediction of Protein-Protein Interactions Using Descriptors Derived From PSSM via Gray Level Co-Occurrence Matrix
    Zhu, Hui-Juan
    You, Zhu-Hong
    Shi, Wei-Lei
    Xu, Shou-Kun
    Jiang, Tong-Hai
    Zhuang, Li-Hua
    IEEE ACCESS, 2019, 7 : 49456 - 49465
  • [48] Human protein-protein interaction prediction by a novel sequence-based co-evolution method: co-evolutionary divergence
    Liu, Chia Hsin
    Li, Ker-Chau
    Yuan, Shinsheng
    BIOINFORMATICS, 2013, 29 (01) : 92 - 98
  • [49] Prediction and Modeling of Protein-Protein Interactions Using "Spotted" Peptides with a Template-Based Approach
    Gasbarri, Chiara
    Rosignoli, Serena
    Janson, Giacomo
    Boi, Dalila
    Paiardini, Alessandro
    BIOMOLECULES, 2022, 12 (02)
  • [50] Prediction of protein-protein interactions from amino acid sequences using a novel multi-scale continuous and discontinuous feature set
    You, Zhu-Hong
    Zhu, Lin
    Zheng, Chun-Hou
    Yu, Hong-Jie
    Deng, Su-Ping
    Ji, Zhen
    BMC BIOINFORMATICS, 2014, 15