PANDA: Predicting the change in proteins binding affinity upon mutations by finding a signal in primary structures

被引:3
作者
Abbasi, Wajid Arshad [1 ]
Abbas, Syed Ali [1 ]
Andleeb, Saiqa [2 ]
机构
[1] Univ Azad Jammu & Kashmir, Dept Comp Sci & Informat Technol, Computat Biol & Data Anal Lab, King Abdullah Campus, Muzaffarabad 13100, Aj&k, Pakistan
[2] Univ Azad Jammu & Kashmir, Dept Zool, Biotechnol Lab, King Abdullah Campus, Muzaffarabad 13100, Aj&k, Pakistan
关键词
Protein-protein interaction; protein sequence analysis; machine learning; web services; mutational analysis; binding affinity; FREE-ENERGY; WEB-SERVER; STABILITY; COMPLEXES; PROTEOME;
D O I
10.1142/S0219720021500153
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Accurately determining a change in protein binding affinity upon mutations is important to find novel therapeutics and to assist mutagenesis studies. Determination of change in binding affinity upon mutations requires sophisticated, expensive, and time-consuming wet-lab experiments that can be supported with computational methods. Most of the available computational prediction techniques depend upon protein structures that bound their applicability to only protein complexes with recognized 3D structures. In this work, we explore the sequence-based prediction of change in protein binding affinity upon mutation and question the effectiveness of K-fold cross-validation (CV) across mutations adopted in previous studies to assess the generalization ability of such predictors with no known mutation during training. We have used protein sequence information instead of protein structures along with machine learning techniques to accurately predict the change in protein binding affinity upon mutation. Our proposed sequence-based novel change in protein binding affinity predictor called PANDA performs comparably to the existing methods gauged through an appropriate CV scheme and an external independent test dataset. On an external test dataset, our proposed method gives a maximum Pearson correlation coefficient of 0.52 in comparison to the state-of-the-art existing protein structure-based method called MutaBind which gives a maximum Pearson correlation coefficient of 0.59. Our proposed protein sequence-based method, to predict a change in binding affinity upon mutations, has wide applicability and comparable performance in comparison to existing protein structure-based methods. We made PANDA easily accessible through a cloudbased webserver and python code available at https://sites.google.com/view/wajidarshad/ software and https://github.com/wajidarshad/panda, respectively.
引用
收藏
页数:19
相关论文
共 58 条
[1]   ISLAND: in-silico proteins binding affinity prediction using sequence information [J].
Abbasi, Wajid Arshad ;
Yaseen, Adiba ;
Ul Hassan, Fahad ;
Andleeb, Saiqa ;
Minhas, Fayyaz Ul Amir Afsar .
BIODATA MINING, 2020, 13 (01)
[2]   Learning protein binding affinity using privileged information [J].
Abbasi, Wajid Arshad ;
Asif, Amina ;
Ben-Hur, Asa ;
Minhas, Fayyaz ul Amir Afsar .
BMC BIOINFORMATICS, 2018, 19
[3]   Issues in performance evaluation for host-pathogen protein interaction prediction [J].
Abbasi, Wajid Arshad ;
Minhas, Fayyaz Ul Amir Afsar .
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2016, 14 (03)
[4]   Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening [J].
Ain, Qurrat Ul ;
Aleksandrova, Antoniya ;
Roessler, Florian D. ;
Ballester, Pedro J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2015, 5 (06) :405-424
[5]   Algorithmic approaches to protein-protein interaction site prediction [J].
Aumentado-Armstrong, Tristan T. ;
Istrate, Bogdan ;
Murgita, Robert A. .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
[6]   A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking [J].
Ballester, Pedro J. ;
Mitchell, John B. O. .
BIOINFORMATICS, 2010, 26 (09) :1169-1175
[7]   Combining Structural Modeling with Ensemble Machine Learning to Accurately Predict Protein Fold Stability and Binding Affinity Effects upon Mutation [J].
Berliner, Niklas ;
Teyra, Joan ;
Colak, Recep ;
Garcia Lopez, Sebastian ;
Kim, Philip M. .
PLOS ONE, 2014, 9 (09)
[8]   Development of a machine-learning model to predict Gibbs free energy of binding for protein-ligand complexes [J].
Bitencourt-Ferreira, Gabriela ;
de Azevedo, Walter Filgueira .
BIOPHYSICAL CHEMISTRY, 2018, 240 :63-69
[9]   Anatomy of hot spots in protein interfaces [J].
Bogan, AA ;
Thorn, KS .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 280 (01) :1-9
[10]  
Breiman L., 2001, IEEE Trans. Broadcast., V45, P5