Error bounds for approximations with deep ReLU neural networks in Ws,p norms

被引:89
作者
Guehring, Ingo [1 ]
Kutyniok, Gitta [1 ,2 ,3 ]
Petersen, Philipp [4 ]
机构
[1] Tech Univ Berlin, Inst Math, Berlin, Germany
[2] Tech Univ Berlin, Dept Comp Sci & Elect Engn, Berlin, Germany
[3] Univ Tromso, Dept Phys & Technol, Tromso, Norway
[4] Univ Oxford, Math Inst, Oxford, England
关键词
Deep neural networks; approximation rates; Sobolev spaces; PDEs; curse of dimension; ALGORITHM; SMOOTH;
D O I
10.1142/S0219530519410021
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We analyze to what extent deep Rectified Linear Unit (ReLU) neural networks can efficiently approximate Sobolev regular functions if the approximation error is measured with respect to weaker Sobolev norms. In this context, we first establish upper approximation bounds by ReLU neural networks for Sobolev regular functions by explicitly constructing the approximate ReLU neural networks. Then, we establish lower approximation bounds for the same type of function classes. A trade-off between the regularity used in the approximation norm and the complexity of the neural network can be observed in upper and lower bounds. Our results extend recent advances in the approximation theory of ReLU networks to the regime that is most relevant for applications in the numerical analysis of partial differential equations.
引用
收藏
页码:803 / 859
页数:57
相关论文
共 68 条
  • [1] Adams R.A., 2003, Sobolev Spaces, V140
  • [2] Almira J., ARXIV181010032
  • [3] Amann H., 2008, GRUNDSTUDIUM MATH
  • [4] [Anonymous], ARXIV180600421
  • [5] [Anonymous], ARXIV180703973
  • [6] [Anonymous], ARXIV171208688
  • [7] Arora R., 2018, P 2018 INT C LEARN R
  • [8] Barron A., 1994, MACH LEARN, V14, P115
  • [9] APPROXIMATION AND ESTIMATION BOUNDS FOR ARTIFICIAL NEURAL NETWORKS
    BARRON, AR
    [J]. MACHINE LEARNING, 1994, 14 (01) : 115 - 133
  • [10] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828