On Transversality of Bent Hyperplane Arrangements and the Topological Expressiveness of ReLU Neural Networks

被引：10

作者：

Grigsby, J. Elisenda ^{[1
]}

Lindsey, Kathryn ^{[1
]}

机构：

[1] Boston Coll, Dept Math, Chestnut Hill, MA 02467 USA

来源：

SIAM JOURNAL ON APPLIED ALGEBRA AND GEOMETRY | 2022年 / 6卷 / 02期

关键词：

ReLU neural networks; polyhedral complexes; topological expressiveness; APPROXIMATION;

D O I：

10.1137/20M1368902

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

Let F : R-n -> R be a feed forward ReLU neural network. It is well-known that for any choice of parameters, F is continuous and piece wise affine-linear. We lay some foundations for a systematic investigation of how the architecture of F impacts the geometry and topology of its possible decision regions, F-1((-infinity, t)) and F -1((t, infinity)), for binary classification tasks. Following the classical progression for smooth functions in differential topology, we first define the notion of a generic, transversal ReLU neural network and show that almost all ReLU networks are generic and transversal. We then define a partially oriented linear 1-complex in the domain of F and identify properties of this complex that yield an obstruction to the existence of bounded connected components of a decision region. We use this obstruction to prove that a decision region of a generic, transversal ReLU network F : R-n -> R with a single hidden layer of dimension n+ 1 can have no more than one bounded connected component.

引用

页码：216 / 242

页数：27

共 29 条

[1]

Arora R., 2018, INT C LEARNING REPRE

[2] On decision regions of narrow deep neural networks [J].

Beise, Hans-Peter ;

Da Cruz, Steve Dias ;

Schroder, Udo .

NEURAL NETWORKS, 2021, 140 :121-129

[3] Reconciling modern machine-learning practice and the classical bias-variance trade-off [J].

Belkin, Mikhail ;

Hsu, Daniel ;

Ma, Siyuan ;

Mandal, Soumik .

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2019, 116 (32) :15849-15854

[4] On the Complexity of Neural Network Classifiers: A Comparison Between Shallow and Deep Architectures [J].

Bianchini, Monica ;

Scarselli, Franco .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (08) :1553-1565

[5]

Bland D., 2016, The Theory of Linear Viscoelasticity

[6]

Cisse M, 2017, PR MACH LEARN RES, V70

[7]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[8]

Gr?nbaum B., 2003, CONVEX POLYTOPES, DOI [10.1007/978-1-4613-0019-9, DOI 10.1007/978-1-4613-0019-9]

[9]

GRIGSBY J. E., 2021, LOCAL GLOBAL T UNPUB

[10]

GRIGSBY J. E., 2021, PREPRINTS

← 1 2 3 →