Approximating smooth functions by deep neural networks with sigmoid activation function

被引：44

作者：

Langer, Sophie ^{[1
]}

机构：

[1] Tech Univ Darmstadt, Fachbereich Math, Schlossgartenstr 7, D-64289 Darmstadt, Germany

来源：

JOURNAL OF MULTIVARIATE ANALYSIS | 2021年 / 182卷

关键词：

Deep learning; Full connectivity; Neural networks; Uniform approximation;

D O I：

10.1016/j.jmva.2020.104696

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

We study the power of deep neural networks (DNNs) with sigmoid activation function. Recently, it was shown that DNNs approximate any d-dimensional, smooth function on a compact set with a rate of order W-p/d, where W is the number of nonzero weights in the network and p is the smoothness of the function. Unfortunately, these rates only hold for a special class of sparsely connected DNNs. We ask ourselves if we can show the same approximation rate for a simpler and more general class, i.e., DNNs which are only defined by its width and depth. In this article we show that DNNs with fixed depth and a width of order M-d achieve an approximation rate of M-2p. As a conclusion we quantitatively characterize the approximation power of DNNs in terms of the overall weights W-0 in the network and show an approximation rate of W-0(-p/d). This more general result finally helps us to understand which network topology guarantees a special target accuracy. (C) 2020 Elsevier Inc. All rights reserved.

引用

页数：21

共 21 条

[1] ON DEEP LEARNING AS A REMEDY FOR THE CURSE OF DIMENSIONALITY IN NONPARAMETRIC REGRESSION
Bauer, Benedikt
Kohler, Michael
[J]. ANNALS OF STATISTICS, 2019, 47 (04) : 2261 - 2285
[2] Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[3] Eldan R., 2016, C LEARN THEOR, P907
[4] ON THE APPROXIMATE REALIZATION OF CONTINUOUS-MAPPINGS BY NEURAL NETWORKS
FUNAHASHI, K
[J]. NEURAL NETWORKS, 1989, 2 (03) : 183 - 192
[5] Deep Neural Networks for Acoustic Modeling in Speech Recognition
Hinton, Geoffrey
Deng, Li
Yu, Dong
Dahl, George E.
Mohamed, Abdel-rahman
Jaitly, Navdeep
Senior, Andrew
Vanhoucke, Vincent
Patrick Nguyen
Sainath, Tara N.
Kingsbury, Brian
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 82 - 97
[6] MULTILAYER FEEDFORWARD NETWORKS ARE UNIVERSAL APPROXIMATORS
HORNIK, K
STINCHCOMBE, M
WHITE, H
[J]. NEURAL NETWORKS, 1989, 2 (05) : 359 - 366
[7] Kohler M., 2020, ARXIV190811133
[8] Kohler M., 2020, ARXIV190811140
[9] Kohler M., 2020, ARXIV191203925
[10] Optimal global rates of convergence for noiseless regression estimation problems with adaptively chosen design
Kohler, Michael
[J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 132 : 197 - 208

← 1 2 3 →