Application of Genetic Programming (GP) Formalism for Building Disease Predictive Models from Protein-Protein Interactions (PPI) Data

被引:13
|
作者
Vyas, Renu [1 ]
Bapat, Sanket [2 ]
Goel, Purva [3 ]
Karthikeyan, Muthukumarasamy [2 ]
Tambe, Sanjeev S. [3 ]
Kulkarni, Bhaskar D. [3 ]
机构
[1] MIT Sch Bioengn Sci & Res, Pune 411008, Maharashtra, India
[2] Natl Chem Lab, CSIR, DIRC, Pune 411008, Maharashtra, India
[3] Natl Chem Lab, CSIR, Chem Engn & Proc Dev Div, Pune 411008, Maharashtra, India
关键词
Genetic programming; protein-protein interactions; disease; binding energy; machine learning; cancer; symbolic regression; BREAST-CANCER SURVIVABILITY; SELECTION;
D O I
10.1109/TCBB.2016.2621042
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Protein-protein interactions (PPIs) play a vital role in the biological processes involved in the cell functions and disease pathways. The experimental methods known to predict PPIs require tremendous efforts and the results are often hindered by the presence of a large number of false positives. Herein, we demonstrate the use of a new Genetic Programming (GP) based Symbolic Regression (SR) approach for predicting PPIs related to a disease. In this case study, a dataset consisting of 135 PPI complexes related to cancer was used to construct a generic PPI predicting model with good PPI prediction accuracy and generalization ability. A high correlation coefficient (CC) magnitude of 0.893, and low root mean square error (RMSE), and mean absolute percentage error (MAPE) values of 478.221 and 0.239, respectively, were achieved for both the training and test set outputs. To validate the discriminatory nature of the model, it was applied on a dataset of diabetes complexes where it yielded significantly low CC values. Thus, the GP model developed here serves a dual purpose: (a) a predictor of the binding energy of cancer related PPI complexes, and (b) a classifier for discriminating PPI complexes related to cancer from those of other diseases.
引用
收藏
页码:27 / 37
页数:11
相关论文
共 45 条
  • [11] A Clustering Genetic Algorithm for Inferring Protein-Protein Functional Interactions from Phylogenetic Profiles
    Tapia, Jose Juan
    Vallejo, Edgar E.
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 2757 - 2763
  • [12] Modulating Protein-Protein Interactions: From Structural Determinants of Binding to Druggability Prediction to Application
    Metz, Alexander
    Ciglia, Emanuele
    Gohlke, Holger
    CURRENT PHARMACEUTICAL DESIGN, 2012, 18 (30) : 4630 - 4647
  • [13] Identifying binary protein-protein interactions from affinity purification mass spectrometry data
    Zhang, Xiao-Fei
    Le Ou-Yang
    Hu, Xiaohua
    Dai, Dao-Qing
    BMC GENOMICS, 2015, 16
  • [14] Identifying binary protein-protein interactions from affinity purification mass spectrometry data
    Xiao-Fei Zhang
    Le Ou-Yang
    Xiaohua Hu
    Dao-Qing Dai
    BMC Genomics, 16
  • [15] Message-passing algorithms for the prediction of protein domain interactions from protein-protein interaction data
    Iqbal, Mudassar
    Freitas, Alex A.
    Johnson, Colin G.
    Vergassola, Massimo
    BIOINFORMATICS, 2008, 24 (18) : 2064 - 2070
  • [16] Assembly of exons from unitary transposable genetic elements: Implications for the evolution of protein-protein interactions
    Dwyer, DS
    JOURNAL OF THEORETICAL BIOLOGY, 1998, 194 (01) : 11 - 27
  • [17] Graphical models of protein-protein interaction specificity from correlated mutations and interaction data
    Thomas, John
    Ramakrishnan, Naren
    Bailey-Kellogg, Chris
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 76 (04) : 911 - 929
  • [18] EvoPPI: A Web Application to Compare Protein-Protein Interactions (PPIs) from Different Databases and Species
    Vazquez, Noe
    Rocha, Sara
    Lopez-Fernandez, Hugo
    Torres, Andre
    Camacho, Rui
    Fdez-Riverola, Florentino
    Vieira, Jorge
    Vieira, Cristina P.
    Reboiro-Jato, Miguel
    PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2019, 803 : 149 - 156
  • [19] Proteome-wide prediction of protein-protein interactions from high-throughput data
    Liu, Zhi-Ping
    Chen, Luonan
    PROTEIN & CELL, 2012, 3 (07) : 508 - 520
  • [20] Proteome-wide prediction of protein-protein interactions from high-throughput data
    ZhiPing Liu
    Luonan Chen
    Protein & Cell, 2012, 3 (07) : 508 - 520