FINDSITELHM: A Threading-Based Approach to Ligand Homology Modeling

被引:68
作者
Brylinski, Michal [1 ]
Skolnick, Jeffrey [1 ]
机构
[1] Georgia Inst Technol, Sch Biol, Ctr Study Syst Biol, Atlanta, GA 30332 USA
基金
美国国家卫生研究院;
关键词
PREDICTING PROTEIN FUNCTION; MOLECULAR-DOCKING; MECHANICAL CALCULATIONS; ESCHERICHIA-COLI; AUTOMATED-METHOD; LOW-RESOLUTION; FORCE-FIELD; DATA FUSION; SEQUENCE; PROGRAMS;
D O I
10.1371/journal.pcbi.1000405
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Ligand virtual screening is a widely used tool to assist in new pharmaceutical discovery. In practice, virtual screening approaches have a number of limitations, and the development of new methodologies is required. Previously, we showed that remotely related proteins identified by threading often share a common binding site occupied by chemically similar ligands. Here, we demonstrate that across an evolutionarily related, but distant family of proteins, the ligands that bind to the common binding site contain a set of strongly conserved anchor functional groups as well as a variable region that accounts for their binding specificity. Furthermore, the sequence and structure conservation of residues contacting the anchor functional groups is significantly higher than those contacting ligand variable regions. Exploiting these insights, we developed FINDSITELHM that employs structural information extracted from weakly related proteins to perform rapid ligand docking by homology modeling. In large scale benchmarking, using the predicted anchor-binding mode and the crystal structure of the receptor, FINDSITELHM outperforms classical docking approaches with an average ligand RMSD from native of similar to 2.5 angstrom. For weakly homologous receptor protein models, using FINDSITELHM, the fraction of recovered binding residues and specific contacts is 0.66 (0.55) and 0.49 (0.38) for highly confident (all) targets, respectively. Finally, in virtual screening for HIV-1 protease inhibitors, using similarity to the ligand anchor region yields significantly improved enrichment factors. Thus, the rather accurate, computationally inexpensive FINDSITELHM algorithm should be a useful approach to assist in the discovery of novel biopharmaceuticals.
引用
收藏
页数:21
相关论文
共 80 条
  • [71] Docking of small ligands to low-resolution and theoretically predicted receptor structures
    Wojciechowski, M
    Skolnick, J
    [J]. JOURNAL OF COMPUTATIONAL CHEMISTRY, 2002, 23 (01) : 189 - 197
  • [72] LOMETS: A local meta-threading-server for protein structure prediction
    Wu, Sitao
    Zhang, Yang
    [J]. NUCLEIC ACIDS RESEARCH, 2007, 35 (10) : 3375 - 3382
  • [73] In silico elucidation of the molecular mechanism defining the adverse effect of selective estrogen receptor modulators
    Xie, Lei
    Wang, Jian
    Bourne, Philip E.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (11) : 2324 - 2332
  • [74] Similarity search profiling reveals effects of fingerprint scaling in virtual screening
    Xue, L
    Stahura, FL
    Bajorath, E
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (06): : 2032 - 2039
  • [75] Similarity search profiles as a diagnostic tool for the analysis of virtual screening calculations
    Xue, L
    Godden, JW
    Stahura, FL
    Bajorath, J
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2004, 44 (04): : 1275 - 1281
  • [76] Design and evaluation of a molecular fingerprint involving the transformation of property descriptor values into a binary classification scheme
    Xue, L
    Godden, JW
    Stahura, FL
    Bajorath, J
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2003, 43 (04): : 1151 - 1157
  • [77] TASSER: An automated method for the prediction of protein tertiary structures in CASP6
    Zhang, Y
    Arakaki, AK
    Skolnick, JR
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 : 91 - 98
  • [78] On the origin and highly likely completeness of single-domain protein structures
    Zhang, Y
    Hubner, IA
    Arakaki, AK
    Shakhnovich, E
    Skolnick, J
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (08) : 2605 - 2610
  • [79] TM-align: a protein structure alignment algorithm based on the TM-score
    Zhang, Y
    Skolnick, J
    [J]. NUCLEIC ACIDS RESEARCH, 2005, 33 (07) : 2302 - 2309
  • [80] MDL DRUG DATA REPORT, P32901