Universality of deep convolutional neural networks

被引:367
作者
Zhou, Ding-Xuan [1 ,2 ]
机构
[1] City Univ Hong Kong, Sch Data Sci, Kowloon, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Math, Kowloon, Hong Kong, Peoples R China
关键词
Deep learning; Convolutional neural network; Universality; Approximation theory; MULTILAYER FEEDFORWARD NETWORKS; OPTIMAL APPROXIMATION; BOUNDS;
D O I
10.1016/j.acha.2019.06.004
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Deep learning has been widely applied and brought breakthroughs in speech recognition, computer vision, and many other domains. Deep neural network architectures and computational issues have been well studied in machine learning. But there lacks a theoretical foundation for understanding the approximation or generalization ability of deep learning methods generated by the network architectures such as deep convolutional neural networks. Here we show that a deep convolutional neural network (CNN) is universal, meaning that it can be used to approximate any continuous function to an arbitrary accuracy when the depth of the neural network is large enough. This answers an open question in learning theory. Our quantitative estimate, given tightly in terms of the number of free parameters to be computed, verifies the efficiency of deep CNNs in dealing with large dimensional data. Our study also demonstrates the role of convolutions in deep CNNs. (C) 2019 Elsevier Inc. All rights reserved.
引用
收藏
页码:787 / 794
页数:8
相关论文
共 30 条
[1]  
[Anonymous], 2015, Deep learn. nat., DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[2]  
[Anonymous], 2016, Deep Learning
[3]  
[Anonymous], ARXIV161200824V1
[4]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[5]   Optimal Approximation with Sparsely Connected Deep Neural Networks [J].
Boelcskei, Helmut ;
Grohs, Philipp ;
Kutyniok, Gitta ;
Petersen, Philipp .
SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2019, 1 (01) :8-45
[6]   Invariant Scattering Convolution Networks [J].
Bruna, Joan ;
Mallat, Stephane .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1872-1886
[7]   Limitations of the approximation capabilities of neural networks with one hidden layer [J].
Chui, CK ;
Li, X ;
Mhaskar, HN .
ADVANCES IN COMPUTATIONAL MATHEMATICS, 1996, 5 (2-3) :233-243
[8]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[9]  
Daubechies I., 1992, CBMS NSF REGIONAL C, DOI [DOI 10.1137/1.9781611970104, 10.1137/1.9781611970104]
[10]   Integrating AI into radiology workflow: levels of research, production, and feedback maturity [J].
Dikici, Engin ;
Bigelow, Matthew ;
Prevedello, Luciano M. ;
White, Richard D. ;
Erdal, Barbaros S. .
JOURNAL OF MEDICAL IMAGING, 2020, 7 (01)