Nearest Neighbor Gaussian Process for Quantitative Structure-Activity Relationships

被引:5
作者
DiFranzo, Anthony [1 ]
Sheridan, Robert P. [2 ]
Liaw, Andy [3 ]
Tudor, Matthew [1 ]
机构
[1] Merck & Co Inc, Computat & Struct Chem, West Point, PA 19486 USA
[2] Merck & Co Inc, Computat & Struct Chem, Kenilworth, NJ 07033 USA
[3] Merck & Co Inc, Biometr Res, Rahway, NJ 07065 USA
关键词
ACTIVITY-RELATIONSHIP MODELS; LOCAL LAZY REGRESSION; QSAR; IMPROVE;
D O I
10.1021/acs.jcim.0c00678
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
While Gaussian process models are typically restricted to smaller data sets, we propose a variation which extends its applicability to the larger data sets common in the industrial drug discovery space, making it relatively novel in the quantitative structure-activity relationship (QSAR) field. By incorporating locality-sensitive hashing for fast nearest neighbor searches, the nearest neighbor Gaussian process model makes predictions with time complexity that is sub-linear with the sample size. The model can be efficiently built, permitting rapid updates to prevent degradation as new data is collected. Given its small number of hyperparameters, it is robust against overfitting and generalizes about as well as other common QSAR models. Like the usual Gaussian process model, it natively produces principled and well-calibrated uncertainty estimates on its predictions. We compare this new model with implementations of random forest, light gradient boosting, and k-nearest neighbors to highlight these promising advantages. The code for the nearest neighbor Gaussian process is available at https://github.com/Merck/nngp.
引用
收藏
页码:4653 / 4663
页数:11
相关论文
共 50 条
  • [41] Quantitative structure-activity relationships for prediction of the toxicity of hydroxylated and quinoid PCB metabolites
    Niu, Junfeng
    Long, Xingxing
    Shi, Shuqiong
    [J]. JOURNAL OF MOLECULAR MODELING, 2007, 13 (01) : 163 - 169
  • [42] Quantitative structure-activity relationships to predict sweet and non-sweet tastes
    Rojas, Cristian
    Ballabio, Davide
    Consonni, Viviana
    Tripaldi, Piercosimo
    Mauri, Andrea
    Todeschini, Roberto
    [J]. THEORETICAL CHEMISTRY ACCOUNTS, 2016, 135 (03) : 1 - 13
  • [43] Evolutionary neural networks in quantitative structure-activity relationships of dihydrofolate reductase inhibitors
    Kyngas, J
    Valjakka, J
    [J]. QUANTITATIVE STRUCTURE-ACTIVITY RELATIONSHIPS, 1996, 15 (04): : 296 - 301
  • [44] Prediction of environmental toxicity and fate using quantitative structure-activity relationships (QSARs)
    Dearden, JC
    [J]. JOURNAL OF THE BRAZILIAN CHEMICAL SOCIETY, 2002, 13 (06) : 754 - 762
  • [45] Quantitative structure-activity relationships (QSAR) of aroma compounds in different aged Huangjiu
    Feng, Tao
    Hu, Zhongshan
    Chen, Ling
    Chen, Da
    Wang, Xu
    Yao, Lingyun
    Sun, Min
    Song, Shiqing
    Wang, Huatian
    [J]. JOURNAL OF FOOD SCIENCE, 2020, 85 (10) : 3273 - 3281
  • [46] Ortho effects in quantitative structure-activity relationships for acetylcholinesterase inhibition by aryl carbamates
    Lin, G
    Liu, YC
    Lin, YF
    Wu, YG
    [J]. JOURNAL OF ENZYME INHIBITION AND MEDICINAL CHEMISTRY, 2004, 19 (05) : 395 - 401
  • [47] Quantitative Structure-Activity/Property Relationships: The Ubiquitous Links between Cause and Effect
    Berhanu, Workalemahu M.
    Pillai, Girinath G.
    Oliferenko, Alexander A.
    Katritzky, Alan R.
    [J]. CHEMPLUSCHEM, 2012, 77 (07): : 507 - 517
  • [48] Quantitative structure-activity relationships for green algae growth inhibition by polymer particles
    Nolte, Tom M.
    Peijnenburg, Willie J. G. M.
    Hendriks, A. Jan.
    van de Meent, Dik
    [J]. CHEMOSPHERE, 2017, 179 : 49 - 56
  • [49] Quantitative structure-activity relationships for estrogen receptor binding affinity of phenolic chemicals
    Hu, JY
    Aizawa, T
    [J]. WATER RESEARCH, 2003, 37 (06) : 1213 - 1222
  • [50] Meta-heuristics on quantitative structure-activity relationships: study on polychlorinated biphenyls
    Jaentschi, Lorentz
    Bolboaca, Sorana D.
    Sestras, Radu E.
    [J]. JOURNAL OF MOLECULAR MODELING, 2010, 16 (02) : 377 - 386