An improved sequence-based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers

被引:44
|
作者
You, Zhu-Hong [1 ]
Li, Xiao [1 ]
Chan, Keith C. C. [2 ]
机构
[1] Chinese Acad Sci, Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Protein-protein interaction; Substitution matrix; Rotation forest; Protein sequence; Ensemble classifier; IDENTIFICATION; HYPERPLANES; COMPLEXES;
D O I
10.1016/j.neucom.2016.10.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein-protein Interactions (PPIs) play important roles in a wide variety of cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades High-throughput biological experiments for identifying PPIs are beginning to provide valuable information about the complexity of PPI networks, but are expensive, cumbersome, and extremely time-consuming. Hence, there is a need for accurate and robust computational methods for predicting PPIs. In this article, a sequence-based approach is proposed by combining a novel amino acid substitution matrix feature representation and Rotation Forest (RF) classifier. Given the protein sequences as input, the proposed method predicts whether or not the pair of proteins interacts. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 93.74% prediction accuracy with 90.05% sensitivity at the precision of 97.08%. Extensive experiments are performed to compare our method with the existing sequence-based method. Experimental results demonstrate that PPIs can be reliably predicted using only sequence-derived information. Achieved results show that the proposed approach offers an inexpensive method for computational construction of PPI networks, so it can be a useful supplementary tool for future proteomics studies.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [21] Recent developments of sequence-based prediction of protein–protein interactions
    Yoichi Murakami
    Kenji Mizuguchi
    Biophysical Reviews, 2022, 14 : 1393 - 1411
  • [22] Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction
    Khatun, Mst Shamima
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    CURRENT GENOMICS, 2020, 21 (06) : 454 - 463
  • [23] PPI-Detect: A Support Vector Machine Model for Sequence-Based Prediction of Protein-Protein Interactions
    Romero-Molina, Sandra
    Ruiz-Blanco, Yasser B.
    Harms, Mirja
    Muench, Jan
    Sanchez-Garcia, Elsa
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2019, 40 (11) : 1233 - 1242
  • [24] Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier
    Wei, Leyi
    Xing, Pengwei
    Zeng, Jiancang
    Chen, JinXiu
    Su, Ran
    Guo, Fei
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 83 : 67 - 74
  • [25] Prediction of Protein-Protein Interactions Using An Effective Sequence Based Combined Method
    Goktepe, Yunus Emre
    Kodaz, Halife
    NEUROCOMPUTING, 2018, 303 : 68 - 74
  • [26] Efficient prediction of protein-protein interactions using sequence information
    Guarracino, Mario R.
    Nebbia, Adriano
    Manna, Valeria
    Chinchuluun, Altannar
    Pardalos, Panos M.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 677 - 682
  • [27] Cracking the black box of deep sequence-based protein-protein interaction prediction
    Bernett, Judith
    Blumenthal, David B.
    List, Markus
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (02)
  • [28] Sequence-based protein-protein interaction prediction via support vector machine
    Yongcui Wang
    Jiguang Wang
    Zhixia Yang
    Naiyang Deng
    Journal of Systems Science and Complexity, 2010, 23 : 1012 - 1023
  • [29] DeNovo: virus-host sequence-based protein-protein interaction prediction
    Eid, Fatma-Elzahraa
    ElHefnawi, Mahmoud
    Heath, Lenwood S.
    BIOINFORMATICS, 2016, 32 (08) : 1144 - 1150
  • [30] Protein-protein interactions prediction based on ensemble deep neural networks
    Zhang, Long
    Yu, Guoxian
    Xia, Dawen
    Wang, Jun
    NEUROCOMPUTING, 2019, 324 : 10 - 19