MgNet: A unified framework of multigrid and convolutional neural network

被引:82
作者
He, Juncai [1 ]
Xu, Jinchao [2 ]
机构
[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China
[2] Penn State Univ, Dept Math, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
convolutional neural network; multigrid; unified framework; network architecture;
D O I
10.1007/s11425-019-9547-2
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We develop a unified model, known as MgNet, that simultaneously recovers some convolutional neural networks (CNN) for image classification and multigrid (MG) methods for solving discretized partial differential equations (PDEs). This model is based on close connections that we have observed and uncovered between the CNN and MG methodologies. For example, pooling operation and feature extraction in CNN correspond directly to restriction operation and iterative smoothers in MG, respectively. As the solution space is often the dual of the data space in PDEs, the analogous concept of feature space and data space (which are dual to each other) is introduced in CNN. With such connections and new concept in the unified model, the function of various convolution operations and pooling used in CNN can be better understood. As a result, modified CNN models (with fewer weights and hyperparameters) are developed that exhibit competitive and sometimes better performance in comparison with existing CNN models when applied to both CIFAR-10 and CIFAR-100 data sets.
引用
收藏
页码:1331 / 1354
页数:24
相关论文
共 55 条
[1]  
[Anonymous], DEEP RELU NETWORKS L
[2]  
[Anonymous], LEARNING SCALESA MUL
[3]  
[Anonymous], SCI CHINA MATH
[4]  
[Anonymous], APPROXIMATION PROPER
[5]  
[Anonymous], 2013, MULTIGRID METHODS AP
[6]  
[Anonymous], MULTILEVEL RESIDUAL
[7]  
[Anonymous], FINITE ELEMENT METHO
[8]  
[Anonymous], PROC CVPR IEEE
[9]  
[Anonymous], 2014, VERY DEEP CONVOLUTIO
[10]  
[Anonymous], PDE NET LEARNING PDE