Function-Space Optimality of Neural Architectures with Multivariate Nonlinearities
被引:0
|
作者:
Parhi, Rahul
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
Ecole Polytech Fed Lausanne, Biomed Imaging Grp, CH-1015 Lausanne, SwitzerlandUniv Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
Parhi, Rahul
[1
,2
]
Unser, Michael
论文数: 0引用数: 0
h-index: 0
机构:
Ecole Polytech Fed Lausanne, Biomed Imaging Grp, CH-1015 Lausanne, SwitzerlandUniv Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
Unser, Michael
[3
]
机构:
[1] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
We investigate the function-space optimality (specifically, the Banach-space optimality) of a large class of shallow neural architectures with multivariate nonlinearities/activation functions. To that end, we construct a new family of Banach spaces defined via a regularization operator, the k-plane transform, and a sparsity-promoting norm. We prove a representer theorem that states that the solution sets to learning problems posed over these Banach spaces are completely characterized by neural architectures with multivariate nonlinearities. These optimal architectures have skip connections and are tightly connected to orthogonal weight normalization and multi-index models, both of which have received recent interest in the neural network community. Our framework is compatible with a number of classical nonlinearities including the rectified linear unit activation function, the norm activation function, and the radial basis functions found in the theory of thin-plate/polyharmonic splines. We also show that the underlying spaces are special instances of reproducing kernel Banach spaces and variation spaces. Our results shed light on the regularity of functions learned by neural networks trained on data, particularly with multivariate nonlinearities, and provide new theoretical motivation for several architectural choices found in practice.
机构:
Renmin Univ China, Sch Math, Beijing 100872, Peoples R ChinaRenmin Univ China, Sch Math, Beijing 100872, Peoples R China
Meng, Yan
Ming, Pingbing
论文数: 0引用数: 0
h-index: 0
机构:
Chinese Acad Sci AMSS, Inst Computat Math & Sci Engn Comp LSEC, 55, East Rd Zhong Guan Cun, Beijing 100190, Peoples R China
Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R ChinaRenmin Univ China, Sch Math, Beijing 100872, Peoples R China
机构:
Northwest A&F Univ, Dept Elect Engn, Shaanxi Yangling 712100, Peoples R China
Arizona State Univ, Sch Elect Comp & Energy Engn, Tempe, AZ 85287 USA
Northwest A&F Univ, Inst Efficient Water Use Arid Agr China, Shaanxi Yangling 712100, Peoples R ChinaNorthwest A&F Univ, Dept Elect Engn, Shaanxi Yangling 712100, Peoples R China
Chen, Diyi
Han, Wenting
论文数: 0引用数: 0
h-index: 0
机构:
Northwest A&F Univ, Inst Efficient Water Use Arid Agr China, Shaanxi Yangling 712100, Peoples R China
Chinese Acad Sci, Inst Soil & Water Conservat, Shaanxi Yangling 712100, Peoples R China
Minist Water Resources, Shaanxi Yangling 712100, Peoples R ChinaNorthwest A&F Univ, Dept Elect Engn, Shaanxi Yangling 712100, Peoples R China
机构:
Karlsruhe Institute of Technology, Institute of Experimental Particle Physics, Karlsruhe
CERN, GenevaKarlsruhe Institute of Technology, Institute of Experimental Particle Physics, Karlsruhe
Wunsch S.
Jörger S.
论文数: 0引用数: 0
h-index: 0
机构:
Karlsruhe Institute of Technology, Institute of Experimental Particle Physics, KarlsruheKarlsruhe Institute of Technology, Institute of Experimental Particle Physics, Karlsruhe
Jörger S.
Wolf R.
论文数: 0引用数: 0
h-index: 0
机构:
Karlsruhe Institute of Technology, Institute of Experimental Particle Physics, KarlsruheKarlsruhe Institute of Technology, Institute of Experimental Particle Physics, Karlsruhe
Wolf R.
Quast G.
论文数: 0引用数: 0
h-index: 0
机构:
Karlsruhe Institute of Technology, Institute of Experimental Particle Physics, KarlsruheKarlsruhe Institute of Technology, Institute of Experimental Particle Physics, Karlsruhe