FINDSITEcomb2.0: A New Approach for Virtual Ligand Screening of Proteins and Virtual Target Screening of Biomolecules

被引:32
作者
Zhou, Hongyi [1 ]
Cao, Hongnan [1 ]
Skolnick, Jeffrey [1 ]
机构
[1] Georgia Inst Technol, Sch Biol Sci, Ctr Study Syst Biol, 950 Atlantic Dr NW, Atlanta, GA 30332 USA
基金
美国国家卫生研究院;
关键词
STRUCTURAL ALIGNMENT; KINASE INHIBITORS; BENCHMARKING SETS; SCORING FUNCTION; DRUG DISCOVERY; DOCKING; PREDICTION; INTEGRATION; SEARCH;
D O I
10.1021/acs.jcim.8b00309
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Computational approaches for predicting protein ligand interactions can facilitate drug lead discovery and drug target determination. We have previously developed a threading/structural-based approach, FINDSITEcomb, for the virtual ligand screening of proteins that has been extensively experimentally validated. Even when low resolution predicted protein structures are employed, FINDSITEcomb has the advantage of being faster and more accurate than traditional high-resolution structure-based docking methods. It also overcomes the limitations of traditional QSAR methods that require a known set of seed ligands that bind to the given protein target. Here, we further improve FINDSITEcomb by enhancing its template ligand selection from the PDB/DrugBank/ChEMBL libraries of known protein ligand interactions by (1) parsing the template proteins and their corresponding binding ligands in the DrugBank and ChEMBL libraries into domains so that the ligands with falsely matched domains to the targets will not be selected as template ligands; (2) applying various thresholds to filter out falsely matched template structures in the structure comparison process and thus their corresponding ligands for template ligand selection. With a sequence identity cutoff of 30% of target to templates and modeled target structures, FINDSITEcomb is shown to significantly improve upon FINDSITEcomb on the DUD-E benchmark set by increasing the 1% enrichment factor from 16.7 to 22.1, with a p-value of 4.3 X 10(-3) by the Student t-test. With an 80% sequence identity cutoff of target to templates for the DUD-E set and modeled target structures, FINDSITEcomb2.0, having a 1% ROC enrichment factor of 52.39, also outperforms state-of-the-art methods that employ machine learning such as a deep convolutional neural network, CNN, with an enrichment of 29.65. Thus, FINDSITEcomb2.0 represents a significant improvement in the state-of-the-art.
引用
收藏
页码:2343 / 2354
页数:12
相关论文
共 57 条
[1]   ICM - A NEW METHOD FOR PROTEIN MODELING AND DESIGN - APPLICATIONS TO DOCKING AND STRUCTURE PREDICTION FROM THE DISTORTED NATIVE CONFORMATION [J].
ABAGYAN, R ;
TOTROV, M ;
KUZNETSOV, D .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1994, 15 (05) :488-506
[2]  
[Anonymous], 2015, 151002855 ARXIV
[3]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkr1065, 10.1093/nar/gkh121, 10.1093/nar/gkp985]
[4]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[5]   CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS [J].
BROOKS, BR ;
BRUCCOLERI, RE ;
OLAFSON, BD ;
STATES, DJ ;
SWAMINATHAN, S ;
KARPLUS, M .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1983, 4 (02) :187-217
[6]   Q-Dock: Low-resolution flexible ligand docking with pocket-specific threading restraints [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2008, 29 (10) :1574-1588
[7]   A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (01) :129-134
[8]   Comprehensive Structural and Functional Characterization of the Human Kinome by Protein Structure Modeling and Ligand Virtual Screening [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2010, 50 (10) :1839-1854
[9]   Q-DockLHM: Low-Resolution Refinement for Ligand Comparative Modeling [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2010, 31 (05) :1093-1105
[10]   FINDSITELHM: A Threading-Based Approach to Ligand Homology Modeling [J].
Brylinski, Michal ;
Skolnick, Jeffrey .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (06)