Nearest Neighbor Gaussian Process for Quantitative Structure-Activity Relationships

被引:5
|
作者
DiFranzo, Anthony [1 ]
Sheridan, Robert P. [2 ]
Liaw, Andy [3 ]
Tudor, Matthew [1 ]
机构
[1] Merck & Co Inc, Computat & Struct Chem, West Point, PA 19486 USA
[2] Merck & Co Inc, Computat & Struct Chem, Kenilworth, NJ 07033 USA
[3] Merck & Co Inc, Biometr Res, Rahway, NJ 07065 USA
关键词
ACTIVITY-RELATIONSHIP MODELS; LOCAL LAZY REGRESSION; QSAR; IMPROVE;
D O I
10.1021/acs.jcim.0c00678
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
While Gaussian process models are typically restricted to smaller data sets, we propose a variation which extends its applicability to the larger data sets common in the industrial drug discovery space, making it relatively novel in the quantitative structure-activity relationship (QSAR) field. By incorporating locality-sensitive hashing for fast nearest neighbor searches, the nearest neighbor Gaussian process model makes predictions with time complexity that is sub-linear with the sample size. The model can be efficiently built, permitting rapid updates to prevent degradation as new data is collected. Given its small number of hyperparameters, it is robust against overfitting and generalizes about as well as other common QSAR models. Like the usual Gaussian process model, it natively produces principled and well-calibrated uncertainty estimates on its predictions. We compare this new model with implementations of random forest, light gradient boosting, and k-nearest neighbors to highlight these promising advantages. The code for the nearest neighbor Gaussian process is available at https://github.com/Merck/nngp.
引用
收藏
页码:4653 / 4663
页数:11
相关论文
共 50 条
  • [1] Feature selection in quantitative structure-activity relationships
    Walters, WP
    Goldman, BB
    CURRENT OPINION IN DRUG DISCOVERY & DEVELOPMENT, 2005, 8 (03) : 329 - 333
  • [2] Quantitative structure-activity relationships and the prediction of MHC supermotifs
    Doytchinova, IA
    Guan, PP
    Flower, DR
    METHODS, 2004, 34 (04) : 444 - 453
  • [3] Quantitative structure-activity relationships of sweet isovanillyl derivatives
    Bassoli, A
    Drew, MGB
    Hattotuwagama, CK
    Merlini, L
    Morini, G
    Wilden, GRW
    QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 2001, 20 (01): : 3 - 16
  • [4] Quantitative structure-activity relationships for organophosphates binding to acetylcholinesterase
    Ruark, Christopher D.
    Hack, C. Eric
    Robinson, Peter J.
    Anderson, Paul E.
    Gearhart, Jeffery M.
    ARCHIVES OF TOXICOLOGY, 2013, 87 (02) : 281 - 289
  • [5] Quantitative Structure-Activity Relationships of Aquatic Narcosis: A Review
    Adhikari, Chandana
    Mishra, Bijay Kumar
    CURRENT COMPUTER-AIDED DRUG DESIGN, 2018, 14 (01) : 7 - 28
  • [6] Descriptor Selection Improvements for Quantitative Structure-Activity Relationships
    Xia, Liang-Yong
    Wang, Qing-Yong
    Cao, Zehong
    Liang, Yong
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2019, 29 (09)
  • [7] Quantitative structure-activity relationships of α1 adrenergic antagonists
    Eric, S
    Solmajer, T
    Zupan, J
    Novic, M
    Oblak, M
    Agbaba, D
    JOURNAL OF MOLECULAR MODELING, 2004, 10 (02) : 139 - 150
  • [8] QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS (QSAR) STUDIES OF BISBENZAMIDINES WITH ANTIFUNGAL ACTIVITY
    de Almeida, Vera L.
    Dias Lopes, Julio Cesar
    Oliveira, Sheila Rodrigues
    Donnici, Claudio L.
    Montanari, Carlos A.
    QUIMICA NOVA, 2010, 33 (07): : 1482 - 1489
  • [9] Quantitative Structure-activity Relationships; Studying the Toxicity of Metal Nanoparticles
    Gao, Yuting
    Zhai, Honglin
    She, Xilin
    Si, Hongzong
    CURRENT TOPICS IN MEDICINAL CHEMISTRY, 2020, 20 (27) : 2506 - 2517
  • [10] Quantitative Structure-activity Relationships for μ-conotoxin GIIIA and its Derivatives
    Ding Xiaoqin
    Chen Changying
    Li Yulin
    Chen Jisheng
    Luo Yu
    Lai Luhua
    Xu Xiaojie
    ACTA PHYSICO-CHIMICA SINICA, 1996, 12 (06) : 485 - 490