Boosting Docking-Based Virtual Screening with Deep Learning

被引:221
作者
Pereira, Janaina Cruz [1 ]
Caffarena, Ernesto Raul [1 ]
dos Santos, Cicero Nogueira [2 ]
机构
[1] Fiocruz MS, 4365 Ave Brasil, BR-21040900 Rio De Janeiro, RJ, Brazil
[2] IBM Watson, 1101 Kitchawan Rd, Yorktown Hts, NY 10598 USA
关键词
SCORING FUNCTION; DRUG DISCOVERY; PREDICTION; NNSCORE; PAIRS;
D O I
10.1021/acs.jcim.6b00355
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In this work, we propose a deep learning approach to improve docking-based virtual screening. The deep neural network that is introduced, DeepVS, uses the output of a docking program and learns how to extract relevant features from basic data such as atom and residues types obtained from protein ligand complexes. Our approach introduces the use of atom and amino acid embeddings and implements an effective way of creating distributed vector representations of protein ligand complexes by modeling the compound as a set of atom contexts that is further processed by a convolutional layer. One of the main advantages of the proposed method is that it does not require feature engineering. We evaluate DeepVS on the Directory of Useful Decoys (DUD), using the output of two docking programs: Autodock Vina1.1.2 and Dock 6.6. Using a strict evaluation with leave-one-out cross-validation, DeepVS outperforms the docking programs, with regard to both AUC ROC and enrichment factor. Moreover, using the output of Autodock DeepVS achieves, an AUC ROC of 0.81, which, to the best of our knowledge, is the best AUC reported so far for virtual screening using the 40 receptors from the DUD.
引用
收藏
页码:2495 / 2506
页数:12
相关论文
共 45 条
  • [1] [Anonymous], 2010, P PYTH SCI C
  • [2] [Anonymous], 2014, P COLING 2014 25 INT, DOI DOI 10.1109/ICCAR.2017.7942788
  • [3] Improvement of Virtual Screening Results by Docking Data Feature Analysis
    Arciniega, Marcelino
    Lange, Oliver F.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2014, 54 (05) : 1401 - 1411
  • [4] A survey of cross-validation procedures for model selection
    Arlot, Sylvain
    Celisse, Alain
    [J]. STATISTICS SURVEYS, 2010, 4 : 40 - 79
  • [5] ElectroShape: fast molecular similarity calculations incorporating shape, chirality and electrostatics
    Armstrong, M. Stuart
    Morris, Garrett M.
    Finn, Paul W.
    Sharma, Raman
    Moretti, Loris
    Cooper, Richard I.
    Richards, W. Graham
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2010, 24 (09) : 789 - 801
  • [6] A machine learning approach to predicting protein-ligand binding affinity with applications to molecular docking
    Ballester, Pedro J.
    Mitchell, John B. O.
    [J]. BIOINFORMATICS, 2010, 26 (09) : 1169 - 1175
  • [7] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
  • [8] Learning Deep Architectures for AI
    Bengio, Yoshua
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01): : 1 - 127
  • [9] Protein-based virtual screening of chemical databases. 1. Evaluation of different docking/scoring combinations
    Bissantz, C
    Folkers, G
    Rognan, D
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2000, 43 (25) : 4759 - 4767
  • [10] Structure-Based Virtual Screening for Drug Discovery: a Problem-Centric Review
    Cheng, Tiejun
    Li, Qingliang
    Zhou, Zhigang
    Wang, Yanli
    Bryant, Stephen H.
    [J]. AAPS JOURNAL, 2012, 14 (01): : 133 - 141