On Transversality of Bent Hyperplane Arrangements and the Topological Expressiveness of ReLU Neural Networks

被引:10
作者
Grigsby, J. Elisenda [1 ]
Lindsey, Kathryn [1 ]
机构
[1] Boston Coll, Dept Math, Chestnut Hill, MA 02467 USA
关键词
ReLU neural networks; polyhedral complexes; topological expressiveness; APPROXIMATION;
D O I
10.1137/20M1368902
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Let F : R-n -> R be a feed forward ReLU neural network. It is well-known that for any choice of parameters, F is continuous and piece wise affine-linear. We lay some foundations for a systematic investigation of how the architecture of F impacts the geometry and topology of its possible decision regions, F-1((-infinity, t)) and F -1((t, infinity)), for binary classification tasks. Following the classical progression for smooth functions in differential topology, we first define the notion of a generic, transversal ReLU neural network and show that almost all ReLU networks are generic and transversal. We then define a partially oriented linear 1-complex in the domain of F and identify properties of this complex that yield an obstruction to the existence of bounded connected components of a decision region. We use this obstruction to prove that a decision region of a generic, transversal ReLU network F : R-n -> R with a single hidden layer of dimension n+ 1 can have no more than one bounded connected component.
引用
收藏
页码:216 / 242
页数:27
相关论文
共 29 条
[1]  
Arora R., 2018, INT C LEARNING REPRE
[2]   On decision regions of narrow deep neural networks [J].
Beise, Hans-Peter ;
Da Cruz, Steve Dias ;
Schroder, Udo .
NEURAL NETWORKS, 2021, 140 :121-129
[3]   Reconciling modern machine-learning practice and the classical bias-variance trade-off [J].
Belkin, Mikhail ;
Hsu, Daniel ;
Ma, Siyuan ;
Mandal, Soumik .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (32) :15849-15854
[4]   On the Complexity of Neural Network Classifiers: A Comparison Between Shallow and Deep Architectures [J].
Bianchini, Monica ;
Scarselli, Franco .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (08) :1553-1565
[5]  
Bland D., 2016, The Theory of Linear Viscoelasticity
[6]  
Cisse M, 2017, PR MACH LEARN RES, V70
[7]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[8]  
Gr?nbaum B., 2003, CONVEX POLYTOPES, DOI [10.1007/978-1-4613-0019-9, DOI 10.1007/978-1-4613-0019-9]
[9]  
GRIGSBY J. E., 2021, LOCAL GLOBAL T UNPUB
[10]  
GRIGSBY J. E., 2021, PREPRINTS