Sequence-Based Prediction of Protein-Peptide Binding Sites Using Support Vector Machine

被引:85
作者
Taherzadeh, Ghazaleh [1 ]
Yang, Yuedong [1 ,2 ]
Zhang, Tuo [3 ]
Liew, Alan Wee-Chung [1 ]
Zhou, Yaoqi [1 ,2 ]
机构
[1] Griffith Univ, Sch Informat & Commun Technol, Parklands Dr, Southport, Qld 4215, Australia
[2] Griffith Univ, Inst Glycom, Parklands Dr, Southport, Qld 4215, Australia
[3] Weill Cornell Med Coll, 1300 York Ave, New York, NY 10065 USA
基金
澳大利亚研究理事会; 中国国家自然科学基金; 英国医学研究理事会;
关键词
protein-peptide; binding site; sequence-based; prediction; features; machine learning; support vector machine; IDENTIFICATION; SURFACES; GENERATION; DATABASE;
D O I
10.1002/jcc.24314
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Protein-peptide interactions are essential for all cellular processes including DNA repair, replication, gene-expression, and metabolism. As most protein-peptide interactions are uncharacterized, it is cost effective to investigate them computationally as the first step. All existing approaches for predicting protein-peptide binding sites, however, are based on protein structures despite the fact that the structures for most proteins are not yet solved. This article proposes the first machine-learning method called SPRINT to make Sequence-based prediction of Protein-peptide Residue-level Interactions. SPRINT yields a robust and consistent performance for 10-fold cross validations and independent test. The most important feature is evolution-generated sequence profiles. For the test set (1056 binding and non-binding residues), it yields a Matthews' Correlation Coefficient of 0.326 with a sensitivity of 64% and a specificity of 68%. This sequence-based technique shows comparable or more accurate than structure-based methods for peptide-binding site prediction. SPRINT is available as an online server at: http://sparks-lab.org/. (C) 2016 Wiley Periodicals, Inc.
引用
收藏
页码:1223 / 1229
页数:7
相关论文
共 40 条
[31]   Rosetta FlexPepDockab-initio: Simultaneous Folding, Docking and Refinement of Peptides onto Their Receptors [J].
Raveh, Barak ;
London, Nir ;
Zimmerman, Lior ;
Schueler-Furman, Ora .
PLOS ONE, 2011, 6 (04)
[32]   PEP-SiteFinder: a tool for the blind identification of peptide binding sites on protein surfaces [J].
Saladin, Adrien ;
Rey, Julien ;
Thevenet, Pierre ;
Zacharias, Martin ;
Moroy, Gautier ;
Tuffery, Pierre .
NUCLEIC ACIDS RESEARCH, 2014, 42 (W1) :W221-W226
[33]  
Vapnik, 2000, NATURE STAT LEARNING
[34]   Protein-Peptide Complex Prediction through Fragment Interaction Patterns [J].
Verschueren, Erik ;
Vanhee, Peter ;
Rousseau, Frederic ;
Schymkowitz, Joost ;
Serrano, Luis .
STRUCTURE, 2013, 21 (05) :789-797
[35]   Synthetic therapeutic peptides: science and market [J].
Vlieghe, Patrick ;
Lisowski, Vincent ;
Martinez, Jean ;
Khrestchatisky, Michel .
DRUG DISCOVERY TODAY, 2010, 15 (1-2) :40-56
[36]   Predicting Peptide Binding Sites on Protein Surfaces by Clustering Chemical Interactions [J].
Yan, Chengfei ;
Zou, Xiaoqin .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2015, 36 (01) :49-61
[37]   BioLiP: a semi-manually curated database for biologically relevant ligand-protein interactions [J].
Yang, Jianyi ;
Roy, Ambrish ;
Zhang, Yang .
NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) :D1096-D1103
[38]  
Yen SJ, 2006, LECT NOTES CONTR INF, V344, P731
[39]   The PickPocket method for predicting binding specificities for receptors based on receptor pocket similarities: application to MHC-peptide binding [J].
Zhang, Hao ;
Lund, Ole ;
Nielsen, Morten .
BIOINFORMATICS, 2009, 25 (10) :1293-1299
[40]   Analysis and Prediction of RNA-Binding Residues Using Sequence, Evolutionary Conservation, and Predicted Secondary Structure and Solvent Accessibility [J].
Zhang, Tuo ;
Zhang, Hua ;
Chen, Ke ;
Ruan, Jishou ;
Shen, Shiyi ;
Kurgan, Lukasz .
CURRENT PROTEIN & PEPTIDE SCIENCE, 2010, 11 (07) :609-628