RELU DEEP NEURAL NETWORKS AND LINEAR FINITE ELEMENTS

被引:111
作者
He, Juncai [1 ]
Li, Lin [2 ]
Xu, Jinchao [3 ]
Zheng, Chunyue [3 ]
机构
[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Beijing Int Ctr Math Res, Beijing 100871, Peoples R China
[3] Penn State Univ, Dept Math, State Coll, PA 16802 USA
关键词
Finite element method; deep neural network; piecewise linear function; MULTILAYER FEEDFORWARD NETWORKS; ORDINARY DIFFERENTIAL-EQUATIONS; APPROXIMATION; SPACE;
D O I
10.4208/jcm.1901-m2018-0160
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In this paper, we investigate the relationship between deep neural networks (DNN) with rectified linear unit (ReLU) function as the activation function and continuous piecewise linear (CPWL) functions, especially CPWL functions from the simplicial linear finite element method (FEM). We first consider the special case of FEM. By exploring the DNN representation of its nodal basis functions, we present a ReLU DNN representation of CPWL in FEM. We theoretically establish that at least 2 hidden layers are needed in a ReLU DNN to represent any linear finite element functions in Omega subset of R-d when d >= 2. Consequently, for d = 2, 3 which are often encountered in scientific and engineering computing, the minimal number of two hidden layers are necessary and sufficient for any CPWL function to be represented by a ReLU DNN. Then we include a detailed account on how a general CPWL in R d can be represented by a ReLU DNN with at most left perpendicular log(2) (d + 1) right perpendicular hidden layers and we also give an estimation of the number of neurons in DNN that are needed in such a representation. Furthermore, using the relationship between DNN and FEM, we theoretically argue that a special class of DNN models with low bit-width are still expected to have an adequate representation power in applications. Finally, as a proof of concept, we present some numerical results for using ReLU DNNs to solve a two point boundary problem to demonstrate the potential of applying DNN for numerical solution of partial differential equations.
引用
收藏
页码:502 / 527
页数:26
相关论文
共 44 条
[1]  
[Anonymous], 2009, LEARNING MULTIPLE LA
[2]  
[Anonymous], APPL COMPUTATIONAL H
[3]  
[Anonymous], 1994, Acta Numerica, DOI 10.1017/S0962492900002439
[4]  
[Anonymous], 2017, ARXIV170703351
[5]  
[Anonymous], 2016, arXiv
[6]  
[Anonymous], 2016, ADV NEURAL INFORM PR
[7]  
[Anonymous], 2018, Approximation by Combinations of ReLU and Squared ReLU Ridge Functions with $ ell 1 $ and $ ell 0 $ Controls
[8]  
[Anonymous], 2016, C LEARNING THEORY
[9]  
[Anonymous], 2007, MATH THEORY FINITE E
[10]  
[Anonymous], DEEP NEURAL NETWORKS