Protein Binding Affinity Prediction Using Support Vector Regression and Interfecial Features

被引:0
作者
Yaseen, Adiba [1 ]
Abbasi, Wajid Arshad [1 ]
Minhas, Fayyaz ul Amir Afsar [1 ]
机构
[1] PIEAS, Dept Comp & Informat Sci, Islamabad, Pakistan
来源
PROCEEDINGS OF 2018 15TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST) | 2018年
关键词
Support vector machine (SVM); Area under the Receiver Operating Characteristic Curve (AUC-ROC); Area under the Precision-Recall Curve (AUC-PR);
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In understanding biology at the molecular level, analysis of protein interactions and protein binding affinity is a challenge. It is an important problem in computational and structural biology. Experimental measurement of binding affinity in the wet-lab is expensive and time consuming. Therefore, machine learning approaches are widely used to predict protein interactions and binding affinities by learning from specific properties of existing complexes. In this work, we propose an innovative computational model to predict binding affinities and interaction based on sequence, structural and interface features of the interacting proteins that are robust to binding associated conformational changes. We modeled the prediction of binding affinity asclassification and regression problem with least-squared and support vector regression models using structure and sequence features of proteins. Specifically, we have used the number and composition of interacting residues at protein complexes interface as features and sequence features. We evaluated the performance of our prediction models using Affinity Benchmark Dataset version 2.0 which contains a diverse set of both bound and unbound protein complex structures with known binding affinities. We evaluated our regression performance results with root mean square error (RMSE) as well as Spearman and Pearson's correlation coefficients using a leave-one-out cross-validation protocol. We evaluate classification results with AUC-ROC and AUC-PR. Our results show that Support Vector Regression performs significantly better than other models with a Spearman Correlation coefficient of 0.58, Pearson Correlation score of 0.55 and RMSE of 2.41 using 3-mer and sequence feature. It is interesting to note that simple features based on 3-mer features and the properties of the interface of a protein complex are predictive of its binding affinity. These features, together with support vector regression achieve higher accuracy than existing sequence based methods.
引用
收藏
页码:194 / 198
页数:5
相关论文
共 20 条
[1]   Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening [J].
Ain, Qurrat Ul ;
Aleksandrova, Antoniya ;
Roessler, Florian D. ;
Ballester, Pedro J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2015, 5 (06) :405-424
[2]  
Alberts B., 2002, Molecular Biology of the Cell. (4th edition), V4th ed
[3]   propy: a tool to generate various modes of Chou's PseAAC [J].
Cao, Dong-Sheng ;
Xu, Qing-Song ;
Liang, Yi-Zeng .
BIOINFORMATICS, 2013, 29 (07) :960-962
[4]   Where did the BLOSUM62 alignment score matrix come from? [J].
Eddy, SR .
NATURE BIOTECHNOLOGY, 2004, 22 (08) :1035-1036
[5]   How Structure Defines Affinity in Protein-Protein Interactions [J].
Erijman, Ariel ;
Rosenthal, Eran ;
Shifman, Julia M. .
PLOS ONE, 2014, 9 (10)
[6]  
Hassan Sergio A., 2005, V305, P451
[7]   Flexibility and small pockets at protein-protein interfaces: New insights into druggability [J].
Jubb, Harry ;
Blundell, Tom L. ;
Ascher, David B. .
PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 2015, 119 (01) :2-9
[8]   A structure-based benchmark for protein-protein binding affinity [J].
Kastritis, Panagiotis L. ;
Moal, Iain H. ;
Hwang, Howook ;
Weng, Zhiping ;
Bates, Paul A. ;
Bonvin, Alexandre M. J. J. ;
Janin, Joel .
PROTEIN SCIENCE, 2011, 20 (03) :482-491
[9]   PaPI: pseudo amino acid composition to score human protein-coding variants [J].
Limongelli, Ivan ;
Marini, Simone ;
Bellazzi, Riccardo .
BMC BIOINFORMATICS, 2015, 16
[10]   Protein-protein binding affinity prediction on a diverse set of structures [J].
Moal, Iain H. ;
Agius, Rudi ;
Bates, Paul A. .
BIOINFORMATICS, 2011, 27 (21) :3002-3009