CPIELA: Computational Prediction of Plant Protein-Protein Interactions by Ensemble Learning Approach From Protein Sequences and Evolutionary Information

被引:0
|
作者
Li, Li-Ping [1 ,2 ]
Zhang, Bo [1 ,2 ]
Cheng, Li [3 ]
机构
[1] Xinjiang Agr Univ, Coll Grassland & Environm Sci, Urumqi, Peoples R China
[2] Xinjiang Key Lab Grassland Resources & Ecol, Urumqi, Peoples R China
[3] Chinese Acad Sci, Xinjiang Tech Inst Phys & Chem, Urumqi, Peoples R China
基金
美国国家科学基金会;
关键词
plant; proteinprotein interactions; machine learning; sequence; evolutionary information; PSI-BLAST; DATABASE; BIOLOGY;
D O I
10.3389/fgene.2022.857839
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Identification and characterization of plant protein-protein interactions (PPIs) are critical in elucidating the functions of proteins and molecular mechanisms in a plant cell. Although experimentally validated plant PPIs data have become increasingly available in diverse plant species, the high-throughput techniques are usually expensive and labor-intensive. With the incredibly valuable plant PPIs data accumulating in public databases, it is progressively important to propose computational approaches to facilitate the identification of possible PPIs. In this article, we propose an effective framework for predicting plant PPIs by combining the position-specific scoring matrix (PSSM), local optimal-oriented pattern (LOOP), and ensemble rotation forest (ROF) model. Specifically, the plant protein sequence is firstly transformed into the PSSM, in which the protein evolutionary information is perfectly preserved. Then, the local textural descriptor LOOP is employed to extract texture variation features from PSSM. Finally, the ROF classifier is adopted to infer the potential plant PPIs. The performance of CPIELA is evaluated via cross-validation on three plant PPIs datasets: Arabidopsis thaliana, Zea mays, and Oryza sativa. The experimental results demonstrate that the CPIELA method achieved the high average prediction accuracies of 98.63%, 98.09%, and 94.02%, respectively. To further verify the high performance of CPIELA, we also compared it with the other state-of-the-art methods on three gold standard datasets. The experimental results illustrate that CPIELA is efficient and reliable for predicting plant PPIs. It is anticipated that the CPIELA approach could become a useful tool for facilitating the identification of possible plant PPIs.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An Efficient Ensemble Learning Approach for Predicting Protein-Protein Interactions by Integrating Protein Primary Sequence and Evolutionary Information
    You, Zhu-Hong
    Huang, Wen-Zhun
    Zhang, Shanwen
    Huang, Yu-An
    Yu, Chang-Qing
    Li, Li-Ping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 16 (03) : 809 - 817
  • [2] Computational Approaches for the Prediction of Protein-Protein Interactions: A Survey
    Theofilatos, Konstantinos A.
    Dimitrakopoulos, Christos M.
    Tsakalidis, Athanasios K.
    Likothanassis, Spyridon D.
    Papadimitriou, Stergios T.
    Mavroudi, Seferina P.
    CURRENT BIOINFORMATICS, 2011, 6 (04) : 398 - 414
  • [3] Probabilistic prediction of protein-protein interactions from the protein sequences
    Chinnasamy, Arunkumar
    Mittal, Ankush
    Sung, Wing-Kin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2006, 36 (10) : 1143 - 1154
  • [4] Prediction of protein-protein interactions by label propagation with protein evolutionary and chemical information derived from heterogeneous network
    Wen, Yu-Ting
    Lei, Hai-Jun
    You, Zhu-Hong
    Lei, Bai-Ying
    Chen, Xing
    Li, Li-Ping
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 430 : 9 - 20
  • [5] FWHT-RF: A Novel Computational Approach to Predict Plant Protein-Protein Interactions via an Ensemble Learning Method
    Pan, Jie
    Li, Li-Ping
    Yu, Chang-Qing
    You, Zhu-Hong
    Ren, Zhong-Hao
    Tang, Jing-Yu
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [6] Automated feature engineering improves prediction of protein-protein interactions
    Sumonja, Neven
    Gemovic, Branislava
    Veljkovic, Nevena
    Perovic, Vladimir
    AMINO ACIDS, 2019, 51 (08) : 1187 - 1200
  • [7] Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier
    Wang, Lei
    You, Zhu-Hong
    Xia, Shi-Xiong
    Liu, Feng
    Chen, Xing
    Yan, Xin
    Zhou, Yong
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 418 : 105 - 110
  • [8] Hot spot prediction in protein-protein interactions by an ensemble system
    Liu, Quanya
    Chen, Peng
    Wang, Bing
    Zhang, Jun
    Li, Jinyan
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [9] Computational Methods for the Prediction of Protein-Protein Interactions
    Xia, Jun-Feng
    Wang, Shu-Lin
    Lei, Ying-Ke
    PROTEIN AND PEPTIDE LETTERS, 2010, 17 (09) : 1069 - 1078
  • [10] An Ensemble Classifier with Random Projection for Predicting Protein-Protein Interactions Using Sequence and Evolutionary Information
    Song, Xiao-Yu
    Chen, Zhan-Heng
    Sun, Xiang-Yang
    You, Zhu-Hong
    Li, Li-Ping
    Zhao, Yang
    APPLIED SCIENCES-BASEL, 2018, 8 (01):