Performance of Deep and Shallow Neural Networks, the Universal Approximation Theorem, Activity Cliffs, and QSAR

被引：83

作者：

Winkler, David A. ^{[1
,2
,3
,4
]}

Le, Tu C. ^{[1
]}

机构：

[1] CSIRO Mfg, Clayton, Vic 3168, Australia

[2] Monash Univ, Monash Inst Pharmaceut Sci, Parkville, Vic 3052, Australia

[3] La Trobe Univ, Latrobe Inst Mol Sci, Bundoora, Vic 3082, Australia

[4] Flinders Univ S Australia, Sch Chem & Phys Sci, Bedford Pk, SA 5042, Australia

来源：

MOLECULAR INFORMATICS | 2017年 / 36卷 / 1-2期

关键词：

deep learning; deep neural network; shallow neural network; Bayesian regularized neural network; universal approximation theorem; activity cliff; DESCRIPTOR SELECTION; DISCOVERY; CLASSIFICATION; MODELS;

D O I：

10.1002/minf.201600118

中图分类号：

R914 [药物化学];

学科分类号：

100701 ;

摘要：

Neural networks have generated valuable Quantitative Structure-Activity/Property Relationships (QSAR/QSPR) models for a wide variety of small molecules and materials properties. They have grown in sophistication and many of their initial problems have been overcome by modern mathematical techniques. QSAR studies have almost always used so-called "shallow" neural networks in which there is a single hidden layer between the input and output layers. Recently, a new and potentially paradigm-shifting type of neural network based on Deep Learning has appeared. Deep learning methods have generated impressive improvements in image and voice recognition, and are now being applied to QSAR and QSAR modelling. This paper describes the differences in approach between deep and shallow neural networks, compares their abilities to predict the properties of test sets for 15 large drug data sets (the kaggle set), discusses the results in terms of the Universal Approximation theorem for neural networks, and describes how DNN may ameliorate or remove troublesome "activity cliffs" in QSAR data sets.

引用

页数：6

共 44 条

[1] Beware of R2: Simple, Unambiguous Assessment of the Prediction Accuracy of QSAR and QSPR Models
Alexander, D. L. J.
Tropsha, A.
Winkler, David A.
[J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (07) : 1316 - 1322
[2] Deep learning for computational biology
Angermueller, Christof
Parnamaa, Tanel
Parts, Leopold
Stegle, Oliver
[J]. MOLECULAR SYSTEMS BIOLOGY, 2016, 12 (07)
[3] A renaissance of neural networks in drug discovery
Baskin, Igor I.
Winkler, David
Tetko, Igor V.
[J]. EXPERT OPINION ON DRUG DISCOVERY, 2016, 11 (08) : 785 - 795
[4] Crowdsourcing in pharma: a strategic framework
Bentzien, Joerg
Bharadwaj, Ragu
Thompson, David C.
[J]. DRUG DISCOVERY TODAY, 2015, 20 (07) : 874 - 883
[5] Optimal Sparse Descriptor Selection for QSAR Using Bayesian Methods
Burden, F. R.
Winkler, D. A.
[J]. QSAR & COMBINATORIAL SCIENCE, 2009, 28 (6-7): : 645 - 653
[6] New QSAR methods applied to structure-activity mapping and combinatorial chemistry
Burden, FR
Winkler, DA
[J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1999, 39 (02): : 236 - 242
[7] Robust QSAR models using Bayesian regularized neural networks
Burden, FR
Winkler, DA
[J]. JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (16) : 3183 - 3187
[8] Burden Frank, 2008, V458, P25
[9] Relevance Vector Machines: Sparse Classification Methods for QSAR
Burden, Frank R.
Winkler, David A.
[J]. JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2015, 55 (08) : 1529 - 1534
[10] An Optimal Self-Pruning Neural Network and Nonlinear Descriptor Selection in QSAR
Burden, Frank R.
Winkler, David A.
[J]. QSAR & COMBINATORIAL SCIENCE, 2009, 28 (10): : 1092 - 1097

← 1 2 3 4 5 →