Nearest Neighbor Gaussian Process for Quantitative Structure-Activity Relationships

被引:5
作者
DiFranzo, Anthony [1 ]
Sheridan, Robert P. [2 ]
Liaw, Andy [3 ]
Tudor, Matthew [1 ]
机构
[1] Merck & Co Inc, Computat & Struct Chem, West Point, PA 19486 USA
[2] Merck & Co Inc, Computat & Struct Chem, Kenilworth, NJ 07033 USA
[3] Merck & Co Inc, Biometr Res, Rahway, NJ 07065 USA
关键词
ACTIVITY-RELATIONSHIP MODELS; LOCAL LAZY REGRESSION; QSAR; IMPROVE;
D O I
10.1021/acs.jcim.0c00678
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
While Gaussian process models are typically restricted to smaller data sets, we propose a variation which extends its applicability to the larger data sets common in the industrial drug discovery space, making it relatively novel in the quantitative structure-activity relationship (QSAR) field. By incorporating locality-sensitive hashing for fast nearest neighbor searches, the nearest neighbor Gaussian process model makes predictions with time complexity that is sub-linear with the sample size. The model can be efficiently built, permitting rapid updates to prevent degradation as new data is collected. Given its small number of hyperparameters, it is robust against overfitting and generalizes about as well as other common QSAR models. Like the usual Gaussian process model, it natively produces principled and well-calibrated uncertainty estimates on its predictions. We compare this new model with implementations of random forest, light gradient boosting, and k-nearest neighbors to highlight these promising advantages. The code for the nearest neighbor Gaussian process is available at https://github.com/Merck/nngp.
引用
收藏
页码:4653 / 4663
页数:11
相关论文
共 50 条
  • [31] Quantitative structure-activity relationships of nitroaromatics toxicity to the algae (Scenedesmus obliguus)
    Yan, XF
    Xiao, HM
    Gong, XD
    Ju, XH
    CHEMOSPHERE, 2005, 59 (04) : 467 - 471
  • [32] Computational methods in developing quantitative structure-activity relationships (QSAR):: A review
    Dudek, AZ
    Arodz, T
    Gálvez, J
    COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING, 2006, 9 (03) : 213 - 228
  • [33] Toxicity and quantitative structure-activity relationships of benzoic acids to Pseudokirchneriella subcapitata
    Lee, Po Yi
    Chen, Chung Yuan
    JOURNAL OF HAZARDOUS MATERIALS, 2009, 165 (1-3) : 156 - 161
  • [34] Structure-activity relationships in nitrothiophenes
    Morley, John O.
    Matthews, Thomas P.
    BIOORGANIC & MEDICINAL CHEMISTRY, 2006, 14 (23) : 8099 - 8108
  • [35] Quantitative structure-activity relationships studies for prediction of antimicrobial activity of synthesized disulfonamide derivatives
    Alyar, Saliha
    Ozbek, Neslihan
    Kuzukiran, Kubra
    Karacan, Nurcan
    MEDICINAL CHEMISTRY RESEARCH, 2011, 20 (02) : 175 - 183
  • [36] A perspective on the role of quantitative structure-activity and structure-property relationships in herbicide discovery
    Clark, Robert D.
    PEST MANAGEMENT SCIENCE, 2012, 68 (04) : 513 - 518
  • [37] An Investigation on the Quantitative Structure-Activity Relationships of the Anti-Inflammatory Activity of Diterpenoid Alkaloids
    Li, Xiao
    Li, Ning
    Sui, Zhenyu
    Bi, Kaishun
    Li, Zuojing
    MOLECULES, 2017, 22 (03):
  • [38] Relationships between the structure, cytotoxicity and hydrophobicity of quinazoline derivatives by quantitative structure-activity relationship
    Jantova, S
    Balaz, S
    Stankovsky, S
    Spirkova, K
    Lukacova, V
    FOLIA BIOLOGICA, 1997, 43 (02) : 83 - 89
  • [39] Structure Activity Relationship and Quantitative Structure-Activity Relationships Modeling of Antitrypanosomal Activities of Alkyldiamine Cryptolepine Derivatives
    Belaidi, Salah
    Salah, Toufik
    Melkemi, Nadjib
    Sinha, Leena
    Prasad, Onkar
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) : 2421 - 2427
  • [40] Quantitative structure-activity relationships (QSAR) for 9-anilinoacridines: a comparative analysis
    Gao, H
    Denny, WA
    Garg, R
    Hansch, C
    CHEMICO-BIOLOGICAL INTERACTIONS, 1998, 116 (03) : 157 - 180