Identification of binding pockets in protein structures using a knowledge-based potential derived from local structural similarities

被引:16
作者
Bianchi, Valerio [1 ]
Gherardini, Pier Federico [1 ]
Helmer-Citterich, Manuela [1 ]
Ausiello, Gabriele [1 ]
机构
[1] Univ Roma Tor Vergata, Dept Biol, Ctr Mol Bioinformat, I-00133 Rome, Italy
关键词
SITE PREDICTION; FUNCTIONAL RESIDUES; SERVER; CONSERVATION; SURFACE; WEB;
D O I
10.1186/1471-2105-13-S4-S17
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The identification of ligand binding sites is a key task in the annotation of proteins with known structure but uncharacterized function. Here we describe a knowledge-based method exploiting the observation that unrelated binding sites share small structural motifs that bind the same chemical fragments irrespective of the nature of the ligand as a whole. Results: PDBinder compares a query protein against a library of binding and non-binding protein surface regions derived from the PDB. The results of the comparison are used to derive a propensity value for each residue which is correlated with the likelihood that the residue is part of a ligand binding site. The method was applied to two different problems: i) the prediction of ligand binding residues and ii) the identification of which surface cleft harbours the binding site. In both cases PDBinder performed consistently better than existing methods. PDBinder has been trained on a non-redundant set of 1356 high-quality protein-ligand complexes and tested on a set of 239 holo and apo complex pairs. We obtained an MCC of 0.313 on the holo set with a PPV of 0.413 while on the apo set we achieved an MCC of 0.271 and a PPV of 0.372. Conclusions: We show that PDBinder performs better than existing methods. The good performance on the unbound proteins is extremely important for real-world applications where the location of the binding site is unknown. Moreover, since our approach is orthogonal to those used in other programs, the PDBinder propensity value can be integrated in other algorithms further increasing the final performance.
引用
收藏
页数:13
相关论文
共 34 条
[1]   Network analysis of protein structures identifies functional residues [J].
Amitai, G ;
Shemesh, A ;
Sitbon, E ;
Shklar, M ;
Netanely, D ;
Venger, I ;
Pietrokovski, S .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 344 (04) :1135-1146
[2]  
[Anonymous], BLASTCLUST SEQUENCE
[3]  
[Anonymous], Q SITEFINDER WEBSERV
[4]   Query3d: a new method for high-throughput analysis of functional residues in protein structures [J].
Ausiello, G ;
Via, A ;
Helmer-Citterich, M .
BMC BIOINFORMATICS, 2005, 6
[5]   pdbFun: mass selection and fast comparison of annotated PDB residues [J].
Ausiello, G ;
Zanzoni, A ;
Peluso, D ;
Via, A ;
Helmer-Citterich, M .
NUCLEIC ACIDS RESEARCH, 2005, 33 :W133-W137
[6]   Structural motifs recurring in different folds recognize the same ligand fragments [J].
Ausiello, Gabriele ;
Gherardini, Pier Federico ;
Gatti, Elena ;
Incani, Ottaviano ;
Helmer-Citterich, Manuela .
BMC BIOINFORMATICS, 2009, 10
[7]   A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (01) :129-134
[8]   Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure [J].
Capra, John A. ;
Laskowski, Roman A. ;
Thornton, Janet M. ;
Singh, Mona ;
Funkhouser, Thomas A. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (12)
[9]   LigASite -: a database of biologically relevant binding sites in proteins with known apo-structures [J].
Dessailly, Benoit H. ;
Lensink, Marc F. ;
Orengo, Christine A. ;
Wodak, Shoshana J. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D667-D673
[10]   Superpose3D: A Local Structural Comparison Program That Allows for User-Defined Structure Representations [J].
Gherardini, Pier Federico ;
Ausiello, Gabriele ;
Helmer-Citterich, Manuela .
PLOS ONE, 2010, 5 (08)