An improved sequence-based prediction protocol for protein-protein interactions using amino acids substitution matrix and rotation forest ensemble classifiers

被引:44
|
作者
You, Zhu-Hong [1 ]
Li, Xiao [1 ]
Chan, Keith C. C. [2 ]
机构
[1] Chinese Acad Sci, Tech Inst Phys & Chem, Urumqi 830011, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
基金
美国国家科学基金会;
关键词
Protein-protein interaction; Substitution matrix; Rotation forest; Protein sequence; Ensemble classifier; IDENTIFICATION; HYPERPLANES; COMPLEXES;
D O I
10.1016/j.neucom.2016.10.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Protein-protein Interactions (PPIs) play important roles in a wide variety of cellular processes, including metabolic cycles, DNA transcription and replication, and signaling cascades High-throughput biological experiments for identifying PPIs are beginning to provide valuable information about the complexity of PPI networks, but are expensive, cumbersome, and extremely time-consuming. Hence, there is a need for accurate and robust computational methods for predicting PPIs. In this article, a sequence-based approach is proposed by combining a novel amino acid substitution matrix feature representation and Rotation Forest (RF) classifier. Given the protein sequences as input, the proposed method predicts whether or not the pair of proteins interacts. When performed on the PPI data of Saccharomyces cerevisiae, the proposed method achieved 93.74% prediction accuracy with 90.05% sensitivity at the precision of 97.08%. Extensive experiments are performed to compare our method with the existing sequence-based method. Experimental results demonstrate that PPIs can be reliably predicted using only sequence-derived information. Achieved results show that the proposed approach offers an inexpensive method for computational construction of PPI networks, so it can be a useful supplementary tool for future proteomics studies.
引用
收藏
页码:277 / 282
页数:6
相关论文
共 50 条
  • [1] Sequence-Based Prediction of Protein-Protein Interactions Using Pseudo Substitution Matrix Representation Features and Ensemble Rotation Forest Classifier in HIV (Human Immunodeficiency Virus)
    Lestari, D.
    Hartomo, S.
    Bustamam, A.
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2017 (ISCPMS2017), 2018, 2023
  • [2] Sequence-Based Prediction of Protein-Protein Interactions by Means of Rotation Forest and Autocorrelation Descriptor
    Xia, Jun-Feng
    Han, Kyungsook
    Huang, De-Shuang
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (01) : 137 - 145
  • [3] Sequence-Based Prediction of Plant Protein-Protein Interactions by Combining Discrete Sine Transformation With Rotation Forest
    Pan, Jie
    Li, Li-Ping
    Yu, Chang-Qing
    You, Zhu-Hong
    Guan, Yong-Jian
    Ren, Zhong-Hao
    EVOLUTIONARY BIOINFORMATICS, 2021, 17
  • [4] Recent developments of sequence-based prediction of protein-protein interactions
    Murakami, Yoichi
    Mizuguchi, Kenji
    BIOPHYSICAL REVIEWS, 2022, 14 (06) : 1393 - 1411
  • [5] Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier
    Wei, Leyi
    Xing, Pengwei
    Zeng, Jiancang
    Chen, JinXiu
    Su, Ran
    Guo, Fei
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2017, 83 : 67 - 74
  • [6] Prediction of Protein-Protein Interactions Using Local Description of Amino Acid Sequence
    Zhou, Yu Zhen
    Gao, Yun
    Zheng, Ying Ying
    ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 254 - +
  • [7] Sequence-Based Prediction of Protein-Protein Interactions Using Ensemble Based Classifier Combined with Global Encoding in HIV (Human Immunodeficiency Virus)
    Lestari, D.
    Musti, M. I. S.
    Bustamam, A.
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2017 (ISCPMS2017), 2018, 2023
  • [8] Using Two-dimensional Principal Component Analysis and Rotation Forest for Prediction of Protein-Protein Interactions
    Wang, Lei
    You, Zhu-Hong
    Yan, Xin
    Xia, Shi-Xiong
    Liu, Feng
    Li, Li-Ping
    Zhang, Wei
    Zhou, Yong
    SCIENTIFIC REPORTS, 2018, 8
  • [9] Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding
    Huang, Yu-An
    You, Zhu-Hong
    Chen, Xing
    Chan, Keith
    Luo, Xin
    BMC BIOINFORMATICS, 2016, 17
  • [10] Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction
    Khatun, Mst Shamima
    Shoombuatong, Watshara
    Hasan, Md Mehedi
    Kurata, Hiroyuki
    CURRENT GENOMICS, 2020, 21 (06) : 454 - 463