CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information

被引:0
作者
Li, Li-Ping [1 ,2 ]
Zhang, Bo [1 ,2 ]
Cheng, Li [3 ]
机构
[1] Xinjiang Agr Univ, Coll Grassland & Environm Sci, Urumqi, Peoples R China
[2] Xinjiang Key Lab Grassland Resources & Ecol, Urumqi, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China
基金
美国国家科学基金会;
关键词
plant; proteinprotein interactions; machine learning; sequence; evolutionary information; PSI-BLAST; DATABASE; BIOLOGY;
D O I
10.3389/fgene.2022.857839
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identification and characterization of plant protein-protein interactions (PPIs) are critical in elucidating the functions of proteins and molecular mechanisms in a plant cell. Although experimentally validated plant PPIs data have become increasingly available in diverse plant species, the high-throughput techniques are usually expensive and labor-intensive. With the incredibly valuable plant PPIs data accumulating in public databases, it is progressively important to propose computational approaches to facilitate the identification of possible PPIs. In this article, we propose an effective framework for predicting plant PPIs by combining the position-specific scoring matrix (PSSM), local optimal-oriented pattern (LOOP), and ensemble rotation forest (ROF) model. Specifically, the plant protein sequence is firstly transformed into the PSSM, in which the protein evolutionary information is perfectly preserved. Then, the local textural descriptor LOOP is employed to extract texture variation features from PSSM. Finally, the ROF classifier is adopted to infer the potential plant PPIs. The performance of CPIELA is evaluated via cross-validation on three plant PPIs datasets: Arabidopsis thaliana, Zea mays, and Oryza sativa. The experimental results demonstrate that the CPIELA method achieved the high average prediction accuracies of 98.63%, 98.09%, and 94.02%, respectively. To further verify the high performance of CPIELA, we also compared it with the other state-of-the-art methods on three gold standard datasets. The experimental results illustrate that CPIELA is efficient and reliable for predicting plant PPIs. It is anticipated that the CPIELA approach could become a useful tool for facilitating the identification of possible plant PPIs.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Predicting protein-protein interactions via multivariate mutual information of protein sequences
    Yijie Ding
    Jijun Tang
    Fei Guo
    [J]. BMC Bioinformatics, 17
  • [22] Predicting protein-protein interactions via multivariate mutual information of protein sequences
    Ding, Yijie
    Tang, Jijun
    Guo, Fei
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [23] Improving protein-protein interactions prediction accuracy using protein evolutionary information and relevance vector machine model
    An, Ji-Yong
    Meng, Fan-Rong
    You, Zhu-Hong
    Chen, Xing
    Yan, Gui-Ying
    Hu, Ji-Pu
    [J]. PROTEIN SCIENCE, 2016, 25 (10) : 1825 - 1833
  • [24] Accurate prediction of protein-protein interactions by integrating potential evolutionary information embedded in PSSM profile and discriminative vector machine classifier
    Li, Zheng-Wei
    You, Zhu-Hong
    Chen, Xing
    Li, Li-Ping
    Huang, De-Shuang
    Yan, Gui-Ying
    Nie, Ru
    Huang, Yu-An
    [J]. ONCOTARGET, 2017, 8 (14) : 23638 - 23649
  • [25] Protein-protein interactions prediction based on ensemble deep neural networks
    Zhang, Long
    Yu, Guoxian
    Xia, Dawen
    Wang, Jun
    [J]. NEUROCOMPUTING, 2019, 324 : 10 - 19
  • [26] Multi-level machine learning prediction of protein-protein interactions in Saccharomyces cerevisiae
    Zubek, Julian
    Tatjewski, Marcin
    Boniecki, Adam
    Mnich, Maciej
    Basu, Subhadip
    Plewczynski, Dariusz
    [J]. PEERJ, 2015, 3
  • [27] Highly Accurate Prediction of Protein-Protein Interactions via Incorporating Evolutionary Information and Physicochemical Characteristics
    Li, Zheng-Wei
    You, Zhu-Hong
    Chen, Xing
    Gui, Jie
    Nie, Ru
    [J]. INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (09):
  • [28] Evaluation of Machine Learning Algorithms on Protein-Protein Interactions
    Saha, Indrajit
    Klingstrom, Tomas
    Forsberg, Simon
    Wikander, Johan
    Zubek, Julian
    Kierczak, Marcin
    Plewczynski, Dariusz
    [J]. MAN-MACHINE INTERACTIONS 3, 2014, 242 : 211 - 218
  • [29] An ensemble approach for large-scale identification of protein-protein interactions using the alignments of multiple sequences
    Wang, Lei
    You, Zhu-Hong
    Chen, Xing
    Li, Jian-Qiang
    Yan, Xin
    Zhang, Wei
    Huang, Yu-An
    [J]. ONCOTARGET, 2017, 8 (03) : 5149 - 5159
  • [30] Predicting Protein-Protein Interactions Based on Ensemble Learning-Based Model from Protein Sequence
    Zhan, Xinke
    Xiao, Mang
    You, Zhuhong
    Yan, Chenggang
    Guo, Jianxin
    Wang, Liping
    Sun, Yaoqi
    Shang, Bingwan
    [J]. BIOLOGY-BASEL, 2022, 11 (07):