A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction

被引:68
作者
Hoffmann, Brice [1 ,3 ,4 ]
Zaslavskiy, Mikhail [1 ,2 ,3 ,4 ]
Vert, Jean-Philippe [1 ,3 ,4 ]
Stoven, Veronique [1 ,3 ,4 ]
机构
[1] Mines ParisTech, Ctr Computat Biol, F-77300 Fontainebleau, France
[2] Ctr Math Morphol, F-77300 Fontainebleau, France
[3] Inst Curie, F-75248 Paris, France
[4] INSERM, U900, F-75248 Paris, France
来源
BMC BIOINFORMATICS | 2010年 / 11卷
关键词
SITES; RECOGNITION; ALIGNMENT; SEARCH;
D O I
10.1186/1471-2105-11-99
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Predicting which molecules can bind to a given binding site of a protein with known 3D structure is important to decipher the protein function, and useful in drug design. A classical assumption in structural biology is that proteins with similar 3D structures have related molecular functions, and therefore may bind similar ligands. However, proteins that do not display any overall sequence or structure similarity may also bind similar ligands if they contain similar binding sites. Quantitatively assessing the similarity between binding sites may therefore be useful to propose new ligands for a given pocket, based on those known for similar pockets. Results: We propose a new method to quantify the similarity between binding pockets, and explore its relevance for ligand prediction. We represent each pocket by a cloud of atoms, and assess the similarity between two pockets by aligning their atoms in the 3D space and comparing the resulting configurations with a convolution kernel. Pocket alignment and comparison is possible even when the corresponding proteins share no sequence or overall structure similarities. In order to predict ligands for a given target pocket, we compare it to an ensemble of pockets with known ligands to identify the most similar pockets. We discuss two criteria to evaluate the performance of a binding pocket similarity measure in the context of ligand prediction, namely, area under ROC curve (AUC scores) and classification based scores. We show that the latter is better suited to evaluate the methods with respect to ligand prediction, and demonstrate the relevance of our new binding site similarity compared to existing similarity measures. Conclusions: This study demonstrates the relevance of the proposed method to identify ligands binding to known binding pockets. We also provide a new benchmark for future work in this field. The new method and the benchmark are available at http://cbio.ensmp.fr/paris/.
引用
收藏
页数:16
相关论文
共 27 条
  • [1] BIASOTTI S, 2004, 3D SHAPE MATCHING TO, P194
  • [2] The Poisson Index: a new probabilistic model for proteinligand binding site similarity
    Davies, J. R.
    Jackson, R. M.
    Mardia, K. V.
    Taylor, C. C.
    [J]. BIOINFORMATICS, 2007, 23 (22) : 3001 - 3008
  • [3] Gartner T, 2002, P 19 INT C MACH LEAR, P179
  • [4] A method for localizing ligand binding pockets in protein structures
    Glaser, F
    Morris, RJ
    Najmanovich, RJ
    Laskowski, RA
    Thornton, JM
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2006, 62 (02) : 479 - 488
  • [5] SitesBase: a database for structure-based protein-ligand binding site comparisons
    Gold, Nicola D.
    Jackson, Richard M.
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D231 - D234
  • [6] HAUSSLER D, 1999, CRL9910
  • [7] Three-dimensional shape searching: state-of-the-art review and future trends
    Iyer, N
    Jayanti, S
    Lou, K
    Kalyanaraman, Y
    Ramani, K
    [J]. COMPUTER-AIDED DESIGN, 2005, 37 (05) : 509 - 530
  • [8] Shape variation in protein binding pockets and their ligands
    Kahraman, Abdullah
    Morris, Richard J.
    Laskowski, Roman A.
    Thornton, Janet M.
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2007, 368 (01) : 283 - 301
  • [9] KONDOR R, 2003, ICML 03
  • [10] Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites
    Laurie, ATR
    Jackson, RM
    [J]. BIOINFORMATICS, 2005, 21 (09) : 1908 - 1916