Gaussian processes:: A method for automatic QSAR Modeling of ADME properties

被引:166
作者
Obrezanova, Olga
Csanyi, Gabor
Gola, Joelle M. R.
Segall, Matthew D.
机构
[1] BioFocus DPI, Cambridge CB4 0GD, England
[2] Univ Cambridge, Cavendish Lab, Cambridge CB3 0HE, England
关键词
D O I
10.1021/ci7000633
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In this article, we discuss the application of the Gaussian Process method for the prediction of absorption, distribution, metabolism, and excretion (ADME) properties. On the basis of a Bayesian probabilistic approach, the method is widely used in the field of machine learning but has rarely been applied in quantitative structure-activity relationship and ADME modeling. The rnethod is suitable for modeling nonlinear relationships, does not require subjective determination of the model parameters, works for a large number of descriptors, and is inherently resistant to overtraining. The performance of Gaussian Processes compares well with and often exceeds that of artificial neural networks. Due to these features, the Gaussian Processes technique is eminently suitable for automatic model generation-one of the demands of modem drug discovery. Here, we describe the basic concept of the method in the context of regression problems and illustrate its application to the modeling of several ADME properties: blood-brain barrier, hERG inhibition, and aqueous solubility at pH 7.4. We also compare Gaussian Processes with other modeling techniques.
引用
收藏
页码:1847 / 1857
页数:11
相关论文
共 31 条
  • [1] Quantitative structure - Activity relationship studies using gaussian processes
    Burden, FR
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (03): : 830 - 835
  • [2] Use of automatic relevance determination in QSAR studies using Bayesian neural networks
    Burden, FR
    Ford, MG
    Whitley, DC
    Winkler, DA
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (06): : 1423 - 1430
  • [3] Unsupervised data base clustering based on Daylight's fingerprint and Tanimoto similarity: A fast and automated way to cluster small and large data sets
    Butina, D
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (04): : 747 - 750
  • [4] Automated QSPR through competitive workflow
    Cartmell, J
    Enoch, S
    Krstajic, D
    Leahy, DE
    [J]. JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2005, 19 (11) : 821 - 833
  • [5] Gaussian process: An efficient technique to solve quantitative structure-property relationship problems
    Enot, DP
    Gautier, R
    Le Marouille, JY
    [J]. SAR AND QSAR IN ENVIRONMENTAL RESEARCH, 2001, 12 (05) : 461 - 469
  • [6] Everitt B. S, 2001, APPL MULTIVARIATE DA
  • [7] *GAUSS PROC RES, GAUSS PROC
  • [8] QSAR - How good is it in practice? Comparison of descriptor sets on an unbiased cross section of corporate data sets
    Gedeck, Peter
    Rohde, Bernhard
    Bartels, Christian
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2006, 46 (05) : 1924 - 1936
  • [9] Gibbs M., 1997, EFFICIENT IMPLEMENTA
  • [10] Beware of q2!
    Golbraikh, A
    Tropsha, A
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2002, 20 (04) : 269 - 276