Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence

被引:2
|
作者
Zhan, Xinke [1 ]
Xiao, Mang [2 ]
You, Zhuhong [3 ]
Yan, Chenggang [4 ,5 ]
Guo, Jianxin [1 ]
Wang, Liping [1 ]
Sun, Yaoqi [4 ]
Shang, Bingwan [1 ]
机构
[1] Xijing Univ, Sch Informat Engn, Xian 710123, Peoples R China
[2] Zhejiang Univ, Sir Run Run Shaw Hosp, Hangzhou 310016, Peoples R China
[3] Northwestern Polytech Univ, Sch Comp Sci, Xian 710129, Peoples R China
[4] Hangzhou Dianzi Univ, Sch Automat, Hangzhou 310018, Peoples R China
[5] Shandong Univ, Sch Mech Elect & Informat Engn, Weihai 264209, Peoples R China
来源
BIOLOGY-BASEL | 2022年 / 11卷 / 07期
基金
中国国家自然科学基金;
关键词
locality preserving projections; rotation forest; PSSM; SVM; KNN; RESIDUE CONSERVATION; ROTATION FOREST; DATABASE; PSSM; IDENTIFICATION; HYPERPLANES; NETWORKS; PROFILE; SITES;
D O I
10.3390/biology11070995
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Simple Summary Due to most traditional high-throughput experiments are tedious and laborious in identifying potential protein-protein interaction. To better improve accuracy prediction in protein-protein interactions. We proposed a novel computational method that can identify unknown protein-protein interaction efficiently and hope this method can provide a helpful idea and tool for proteomics research. Protein-protein interactions (PPIs) play an essential role in many biological cellular functions. However, it is still tedious and time-consuming to identify protein-protein interactions through traditional experimental methods. For this reason, it is imperative and necessary to develop a computational method for predicting PPIs efficiently. This paper explores a novel computational method for detecting PPIs from protein sequence, the approach which mainly adopts the feature extraction method: Locality Preserving Projections (LPP) and classifier: Rotation Forest (RF). Specifically, we first employ the Position Specific Scoring Matrix (PSSM), which can remain evolutionary information of biological for representing protein sequence efficiently. Then, the LPP descriptor is applied to extract feature vectors from PSSM. The feature vectors are fed into the RF to obtain the final results. The proposed method is applied to two datasets: Yeast and H. pylori, and obtained an average accuracy of 92.81% and 92.56%, respectively. We also compare it with K nearest neighbors (KNN) and support vector machine (SVM) to better evaluate the performance of the proposed method. In summary, all experimental results indicate that the proposed approach is stable and robust for predicting PPIs and promising to be a useful tool for proteomics research.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Predicting Protein-Protein Interactions based on ensemble classifiers
    Zhou, Zheng-Rong
    Song, Xiao-Feng
    Wang, Ming-Hao
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (06): : 1464 - 1467
  • [2] Predicting protein-protein interactions through sequence-based deep learning
    Hashemifar, Somaye
    Neyshabur, Behnam
    Khan, Aly A.
    Xu, Jinbo
    BIOINFORMATICS, 2018, 34 (17) : 802 - 810
  • [3] A Novel Ensemble Learning-Based Computational Method to Predict Protein-Protein Interactions from Protein Primary Sequences
    Pan, Jie
    Wang, Shiwei
    Yu, Changqing
    Li, Liping
    You, Zhuhong
    Sun, Yanmei
    BIOLOGY-BASEL, 2022, 11 (05):
  • [4] Protein Features Identification for Machine Learning-Based Prediction of Protein-Protein Interactions
    Raza, Khalid
    INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY, 2017, 750 : 305 - 317
  • [5] An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhang, Shanwen
    Huang, Yu-An
    Yu, Chang-Qing
    Li, Li-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 809 - 817
  • [6] Sequence-based machine learning method for predicting the effects of phosphorylation on protein-protein interactions
    Hong, Xiaokun
    Lv, Jiyang
    Li, Zhengxin
    Xiong, Yi
    Zhang, Jian
    Chen, Hai-Feng
    INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES, 2023, 243
  • [7] Predicting the Druggability of Protein-Protein Interactions Based on Sequence and Structure Features of Active Pockets
    Dai, Xu
    Jing, RunYu
    Guo, Yanzhi
    Dong, YongCheng
    Wang, YueLong
    Liu, Yuan
    Pu, XueMei
    Li, Menglong
    CURRENT PHARMACEUTICAL DESIGN, 2015, 21 (21) : 3051 - 3061
  • [8] Predicting protein-protein interactions based on protein-domain relationships
    Wang, B
    Huang, DS
    Chen, P
    Zhu, YP
    Li, YX
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), VOLS 1-5, 2005, : 316 - 319
  • [9] Sequence Representations and Their Utility for Predicting Protein-Protein Interactions
    Kimothi, Dhananjay
    Biyani, Pravesh
    Hogan, James M.
    Davis, Melissa J.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (01) : 646 - 657
  • [10] Seq-BEL: Sequence-Based Ensemble Learning for Predicting Virus-Human Protein-Protein Interaction
    Ma, Yingjun
    He, Tingting
    Tan, Yuting
    Jiang, Xingpeng
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (03) : 1322 - 1333