An improved sequence-based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers

被引:44
|
作者
You, Zhu-Hong [1 ]
Li, Xiao [1 ]
Chan, Keith C. C. [2 ]
机构
[1] Chinese Acad Sci, Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Protein-protein interaction; Substitution matrix; Rotation forest; Protein sequence; Ensemble classifier; IDENTIFICATION; HYPERPLANES; COMPLEXES;
D O I
10.1016/j.neucom.2016.10.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein-protein Interactions (PPIs) play important roles in a wide variety of cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades High-throughput biological experiments for identifying PPIs are beginning to provide valuable information about the complexity of PPI networks, but are expensive, cumbersome, and extremely time-consuming. Hence, there is a need for accurate and robust computational methods for predicting PPIs. In this article, a sequence-based approach is proposed by combining a novel amino acid substitution matrix feature representation and Rotation Forest (RF) classifier. Given the protein sequences as input, the proposed method predicts whether or not the pair of proteins interacts. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 93.74% prediction accuracy with 90.05% sensitivity at the precision of 97.08%. Extensive experiments are performed to compare our method with the existing sequence-based method. Experimental results demonstrate that PPIs can be reliably predicted using only sequence-derived information. Achieved results show that the proposed approach offers an inexpensive method for computational construction of PPI networks, so it can be a useful supplementary tool for future proteomics studies.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [31] Predicting Protein-Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids
    Kuo, Tzu-Hao
    Li, Kuo-Bin
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (11)
  • [32] Predicting Primary Sequence-Based Protein-Protein Interactions Using a Mercer Series Representation of Nonlinear Support Vector Machine
    Chatrabgoun, Omid
    Daneshkhah, Alireza
    Esmaeilbeigi, Mohsen
    Sohrabi Safa, Nader
    Alenezi, Ali H.
    Rahman, Arafatur
    IEEE ACCESS, 2022, 10 : 124345 - 124354
  • [33] Sequence-based prediction of protein binding regions and drug–target interactions
    Ingoo Lee
    Hojung Nam
    Journal of Cheminformatics, 14
  • [34] Critical assessment of sequence-based protein-protein interaction prediction methods that do not require homologous protein sequences
    Park, Yungki
    BMC BIOINFORMATICS, 2009, 10 : 419
  • [35] A Sequence-Based Dynamic Ensemble Learning System for Protein Ligand-Binding Site Prediction
    Chen, Peng
    Hu, ShanShan
    Zhang, Jun
    Gao, Xin
    Li, Jinyan
    Xia, Junfeng
    Wang, Bing
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) : 901 - 912
  • [36] Accurate prediction of protein-protein interactions from sequence alignments using a Bayesian method
    Burger, Lukas
    van Nimwegen, Erik
    MOLECULAR SYSTEMS BIOLOGY, 2008, 4 (1)
  • [37] Performance of rotation forest ensemble classifier and feature extractor in predicting protein interactions using amino acid sequences
    Alhadi Bustamam
    Mohamad I. S. Musti
    Susilo Hartomo
    Shirley Aprilia
    Patuan P. Tampubolon
    Dian Lestari
    BMC Genomics, 20
  • [38] Performance of rotation forest ensemble classifier and feature extractor in predicting protein interactions using amino acid sequences
    Bustamam, Alhadi
    Musti, Mohamad I. S.
    Hartomo, Susilo
    Aprilia, Shirley
    Tampubolon, Patuan P.
    Lestari, Dian
    BMC GENOMICS, 2019, 20 (Suppl 9)
  • [39] Prediction of Protein-Protein Interactions from Amino Acid Sequences using Extreme Learning Machine Combined with Auto Covariance Descriptor
    You, Zhu-Hong
    Li, Liping
    Ji, Zhen
    Li, Min
    Guo, Sen
    2013 IEEE WORKSHOP ON MEMETIC COMPUTING (MC), 2013, : 80 - 85
  • [40] RF-PSSM: A Combination of Rotation Forest Algorithm and Position-Specific Scoring Matrix for Improved Prediction of Protein-Protein Interactions Between Hepatitis C Virus and Human
    Liu, Xin
    Lu, Yaping
    Wang, Liang
    Geng, Wei
    Shi, Xinyi
    Zhang, Xiao
    BIG DATA MINING AND ANALYTICS, 2023, 6 (01) : 21 - 31