Universality of deep convolutional neural networks

被引：367

作者：

Zhou, Ding-Xuan ^{[1
,2
]}

机构：

[1] City Univ Hong Kong, Sch Data Sci, Kowloon, Hong Kong, Peoples R China

[2] City Univ Hong Kong, Dept Math, Kowloon, Hong Kong, Peoples R China

来源：

APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS | 2020年 / 48卷 / 02期

关键词：

Deep learning; Convolutional neural network; Universality; Approximation theory; MULTILAYER FEEDFORWARD NETWORKS; OPTIMAL APPROXIMATION; BOUNDS;

D O I：

10.1016/j.acha.2019.06.004

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

Deep learning has been widely applied and brought breakthroughs in speech recognition, computer vision, and many other domains. Deep neural network architectures and computational issues have been well studied in machine learning. But there lacks a theoretical foundation for understanding the approximation or generalization ability of deep learning methods generated by the network architectures such as deep convolutional neural networks. Here we show that a deep convolutional neural network (CNN) is universal, meaning that it can be used to approximate any continuous function to an arbitrary accuracy when the depth of the neural network is large enough. This answers an open question in learning theory. Our quantitative estimate, given tightly in terms of the number of free parameters to be computed, verifies the efficiency of deep CNNs in dealing with large dimensional data. Our study also demonstrates the role of convolutions in deep CNNs. (C) 2019 Elsevier Inc. All rights reserved.

引用

页码：787 / 794

页数：8

共 30 条

[1]

[Anonymous], 2015, Deep learn. nat., DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]

[2]

[Anonymous], 2016, Deep Learning

[3]

[Anonymous], ARXIV161200824V1

[4] UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].

BARRON, AR .

IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945

[5] Optimal Approximation with Sparsely Connected Deep Neural Networks [J].

Boelcskei, Helmut ;

Grohs, Philipp ;

Kutyniok, Gitta ;

Petersen, Philipp .

SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2019, 1 (01) :8-45

[6] Invariant Scattering Convolution Networks [J].

Bruna, Joan ;

Mallat, Stephane .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1872-1886

[7] Limitations of the approximation capabilities of neural networks with one hidden layer [J].

Chui, CK ;

Li, X ;

Mhaskar, HN .

ADVANCES IN COMPUTATIONAL MATHEMATICS, 1996, 5 (2-3) :233-243

[8]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[9]

Daubechies I., 1992, CBMS NSF REGIONAL C, DOI [DOI 10.1137/1.9781611970104, 10.1137/1.9781611970104]

[10] Integrating AI into radiology workflow: levels of research, production, and feedback maturity [J].

Dikici, Engin ;

Bigelow, Matthew ;

Prevedello, Luciano M. ;

White, Richard D. ;

Erdal, Barbaros S. .

JOURNAL OF MEDICAL IMAGING, 2020, 7 (01)

← 1 2 3 →