An Ensemble Classifier with Random Projection for Predicting Protein-Protein Interactions Using Sequence and Evolutionary Information

被引:15
|
作者
Song, Xiao-Yu [1 ]
Chen, Zhan-Heng [2 ]
Sun, Xiang-Yang [1 ]
You, Zhu-Hong [2 ]
Li, Li-Ping [2 ]
Zhao, Yang [1 ]
机构
[1] Lanzhou Jiaotong Univ, Sch Elect & Informat Engn, Lanzhou 730070, Gansu, Peoples R China
[2] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi 830011, Xinjiang, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 01期
基金
美国国家科学基金会;
关键词
protein-protein interactions; position-specific scoring matrix; random projection ensemble classifier; support vector machine; AMINO-ACID-SEQUENCES; MACHINES; DATABASE; SPACES;
D O I
10.3390/app8010089
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Identifying protein-protein interactions (PPIs) is crucial to comprehend various biological processes in cells. Although high-throughput techniques generate many PPI data for various species, they are only a petty minority of the entire PPI network. Furthermore, these approaches are costly and time-consuming and have a high error rate. Therefore, it is necessary to design computational methods for efficiently detecting PPIs. In this study, a random projection ensemble classifier (RPEC) was explored to identify novel PPIs using evolutionary information contained in protein amino acid sequences. The evolutionary information was obtained from a position-specific scoring matrix (PSSM) generated from PSI-BLAST. A novel feature fusion scheme was then developed by combining discrete cosine transform (DCT), fast Fourier transform (FFT), and singular value decomposition (SVD). Finally, via the random projection ensemble classifier, the performance of the presented approach was evaluated on Yeast, Human, and H. pylori PPI datasets using 5-fold cross-validation. Our approach achieved high prediction accuracies of 95.64%, 96.59%, and 87.62%, respectively, effectively outperforming other existing methods. Generally speaking, our approach is quite promising and supplies a practical and effective method for predicting novel PPIs.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhang, Shanwen
    Huang, Yu-An
    Yu, Chang-Qing
    Li, Li-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 809 - 817
  • [2] An Ensemble Classifier to Predict Protein-Protein Interactions by Combining PSSM-based Evolutionary Information with Local Binary Pattern Model
    Li, Yang
    Li, Li-Ping
    Wang, Lei
    Yu, Chang-Qing
    Wang, Zheng
    You, Zhu-Hong
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2019, 20 (14)
  • [3] Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier
    Wang, Lei
    You, Zhu-Hong
    Xia, Shi-Xiong
    Liu, Feng
    Chen, Xing
    Yan, Xin
    Zhou, Yong
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 418 : 105 - 110
  • [4] Predicting protein-protein interactions by a supervised learning classifier
    Huang, Y
    Frishman, D
    Muchnik, I
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2004, 28 (04) : 291 - 301
  • [5] Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation
    Li, Yang
    Wang, Zheng
    You, Zhu-Hong
    Li, Li-Ping
    Hu, Xuegang
    COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2022, 2022
  • [6] Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier
    Li, Zheng-Wei
    You, Zhu-Hong
    Chen, Xing
    Li, Li-Ping
    Huang, De-Shuang
    Yan, Gui-Ying
    Nie, Ru
    Huang, Yu-An
    ONCOTARGET, 2017, 8 (14) : 23638 - 23649
  • [7] An Ensemble Classifier with Random Projection for Predicting Multi-label Protein Subcellular Localization
    Wan, Shibiao
    Mak, Man-Wai
    Zhang, Bai
    Wang, Yue
    Kung, Sun-Yuan
    2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [8] Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
    Zhan, Xinke
    Xiao, Mang
    You, Zhuhong
    Yan, Chenggang
    Guo, Jianxin
    Wang, Liping
    Sun, Yaoqi
    Shang, Bingwan
    BIOLOGY-BASEL, 2022, 11 (07):
  • [9] Improving protein-protein interactions prediction accuracy using XGBoost feature selection and stacked ensemble classifier
    Chen, Cheng
    Zhang, Qingmei
    Yu, Bin
    Yu, Zhaomin
    Lawrence, Patrick J.
    Ma, Qin
    Zhang, Yan
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 123
  • [10] CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information
    Li, Li-Ping
    Zhang, Bo
    Cheng, Li
    FRONTIERS IN GENETICS, 2022, 13