An improved sequence-based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers

被引:44
|
作者
You, Zhu-Hong [1 ]
Li, Xiao [1 ]
Chan, Keith C. C. [2 ]
机构
[1] Chinese Acad Sci, Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Protein-protein interaction; Substitution matrix; Rotation forest; Protein sequence; Ensemble classifier; IDENTIFICATION; HYPERPLANES; COMPLEXES;
D O I
10.1016/j.neucom.2016.10.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein-protein Interactions (PPIs) play important roles in a wide variety of cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades High-throughput biological experiments for identifying PPIs are beginning to provide valuable information about the complexity of PPI networks, but are expensive, cumbersome, and extremely time-consuming. Hence, there is a need for accurate and robust computational methods for predicting PPIs. In this article, a sequence-based approach is proposed by combining a novel amino acid substitution matrix feature representation and Rotation Forest (RF) classifier. Given the protein sequences as input, the proposed method predicts whether or not the pair of proteins interacts. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 93.74% prediction accuracy with 90.05% sensitivity at the precision of 97.08%. Extensive experiments are performed to compare our method with the existing sequence-based method. Experimental results demonstrate that PPIs can be reliably predicted using only sequence-derived information. Achieved results show that the proposed approach offers an inexpensive method for computational construction of PPI networks, so it can be a useful supplementary tool for future proteomics studies.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [31] Sequence-based protein-protein interaction prediction via support vector machine
    Wang, Yongcui
    Wang, Jiguang
    Yang, Zhixia
    Deng, Naiyang
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2010, 23 (05) : 1012 - 1023
  • [32] Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
    Lei Wang
    Zhu-Hong You
    Xin Yan
    Shi-Xiong Xia
    Feng Liu
    Li-Ping Li
    Wei Zhang
    Yong Zhou
    Scientific Reports, 8
  • [33] Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
    Wang, Lei
    You, Zhu-Hong
    Yan, Xin
    Xia, Shi-Xiong
    Liu, Feng
    Li, Li-Ping
    Zhang, Wei
    Zhou, Yong
    SCIENTIFIC REPORTS, 2018, 8
  • [34] Prediction of Protein-Protein Interactions with Clustered Amino Acids and Weighted Sparse Representation
    Huang, Qiaoying
    You, Zhuhong
    Zhang, Xiaofeng
    Zhou, Yong
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2015, 16 (05) : 10855 - 10869
  • [35] Identification of potential interaction networks using sequence-based searches for conserved protein-protein interactions or "interologs"
    Matthews, LR
    Vaglio, P
    Reboul, J
    Ge, H
    Davis, BP
    Garrels, J
    Vincent, S
    Vidal, M
    GENOME RESEARCH, 2001, 11 (12) : 2120 - 2126
  • [36] Prediction of Protein-Protein Interactions from Protein Sequence Using Local Descriptors
    Yang, Lei
    Xia, Jun-Feng
    Gui, Jie
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09): : 1085 - 1090
  • [37] Prediction of protein-protein interactions using random decision forest framework
    Chen, XW
    Liu, M
    BIOINFORMATICS, 2005, 21 (24) : 4394 - 4400
  • [38] Protein-Protein Interactions Prediction Based on Graph Energy and Protein Sequence Information
    Xu, Da
    Xu, Hanxiao
    Zhang, Yusen
    Chen, Wei
    Gao, Rui
    MOLECULES, 2020, 25 (08):
  • [39] Prediction of protein-protein interactions based on elastic net and deep forest
    Yu, Bin
    Chen, Cheng
    Wang, Xiaolin
    Yu, Zhaomin
    Ma, Anjun
    Liu, Bingqiang
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 176
  • [40] Sequence-based machine learning method for predicting the effects of phosphorylation on protein-protein interactions
    Hong, Xiaokun
    Lv, Jiyang
    Li, Zhengxin
    Xiong, Yi
    Zhang, Jian
    Chen, Hai-Feng
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 243