Performance of Deep and Shallow Neural Networks, the Universal Approximation Theorem, Activity Cliffs, and QSAR

被引:83
作者
Winkler, David A. [1 ,2 ,3 ,4 ]
Le, Tu C. [1 ]
机构
[1] CSIRO Mfg, Clayton, Vic 3168, Australia
[2] Monash Univ, Monash Inst Pharmaceut Sci, Parkville, Vic 3052, Australia
[3] La Trobe Univ, Latrobe Inst Mol Sci, Bundoora, Vic 3082, Australia
[4] Flinders Univ S Australia, Sch Chem & Phys Sci, Bedford Pk, SA 5042, Australia
关键词
deep learning; deep neural network; shallow neural network; Bayesian regularized neural network; universal approximation theorem; activity cliff; DESCRIPTOR SELECTION; DISCOVERY; CLASSIFICATION; MODELS;
D O I
10.1002/minf.201600118
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Neural networks have generated valuable Quantitative Structure-Activity/Property Relationships (QSAR/QSPR) models for a wide variety of small molecules and materials properties. They have grown in sophistication and many of their initial problems have been overcome by modern mathematical techniques. QSAR studies have almost always used so-called "shallow" neural networks in which there is a single hidden layer between the input and output layers. Recently, a new and potentially paradigm-shifting type of neural network based on Deep Learning has appeared. Deep learning methods have generated impressive improvements in image and voice recognition, and are now being applied to QSAR and QSAR modelling. This paper describes the differences in approach between deep and shallow neural networks, compares their abilities to predict the properties of test sets for 15 large drug data sets (the kaggle set), discusses the results in terms of the Universal Approximation theorem for neural networks, and describes how DNN may ameliorate or remove troublesome "activity cliffs" in QSAR data sets.
引用
收藏
页数:6
相关论文
共 44 条
  • [1] Beware of R2: Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models
    Alexander, D. L. J.
    Tropsha, A.
    Winkler, David A.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (07) : 1316 - 1322
  • [2] Deep learning for computational biology
    Angermueller, Christof
    Parnamaa, Tanel
    Parts, Leopold
    Stegle, Oliver
    [J]. MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)
  • [3] A renaissance of neural networks in drug discovery
    Baskin, Igor I.
    Winkler, David
    Tetko, Igor V.
    [J]. EXPERT OPINION ON DRUG DISCOVERY, 2016, 11 (08) : 785 - 795
  • [4] Crowdsourcing in pharma: a strategic framework
    Bentzien, Joerg
    Bharadwaj, Ragu
    Thompson, David C.
    [J]. DRUG DISCOVERY TODAY, 2015, 20 (07) : 874 - 883
  • [5] Optimal Sparse Descriptor Selection for QSAR Using Bayesian Methods
    Burden, F. R.
    Winkler, D. A.
    [J]. QSAR & COMBINATORIAL SCIENCE, 2009, 28 (6-7): : 645 - 653
  • [6] New QSAR methods applied to structure-activity mapping and combinatorial chemistry
    Burden, FR
    Winkler, DA
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (02): : 236 - 242
  • [7] Robust QSAR models using Bayesian regularized neural networks
    Burden, FR
    Winkler, DA
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (16) : 3183 - 3187
  • [8] Burden Frank, 2008, V458, P25
  • [9] Relevance Vector Machines: Sparse Classification Methods for QSAR
    Burden, Frank R.
    Winkler, David A.
    [J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (08) : 1529 - 1534
  • [10] An Optimal Self-Pruning Neural Network and Nonlinear Descriptor Selection in QSAR
    Burden, Frank R.
    Winkler, David A.
    [J]. QSAR & COMBINATORIAL SCIENCE, 2009, 28 (10): : 1092 - 1097