Interplay between depth and width for interpolation in neural ODEs

被引：1

作者：

Alvarez-Lopez, Antonio ^{[1
,3
]}

Slimane, Arselane Hadj ^{[2
]}

Zuazua, Enrique ^{[1
,3
,4
]}

机构：

[1] Univ Autonoma Madrid, Dept Ingn Quim, C Francisco Tomas & Valiente 7, 28049 Madrid, Spain

[2] ENS Paris Saclay, 4 Ave Sci, F-91190 Gif Sur Yvette, France

[3] Friedrich Alexander Univ Erlangen Nurnberg, Chair Dynam Control Machine Learning & Numer Alexa, Dept Math, Cauerstr 11, D-91058 Erlangen, Germany

[4] Fdn Deusto, Ave Univ 24, Bilbao 48007, Spain

来源：

NEURAL NETWORKS | 2024年 / 180卷

关键词：

Neural ODEs; Depth; Width; Simultaneous controllability; Transport control; Wasserstein distance;

D O I：

10.1016/j.neunet.2024.106640

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural ordinary differential equations have emerged as a natural tool for supervised learning from a control perspective, yet a complete understanding of the role played by their architecture remains elusive. In this work, we examine the interplay between the width p and the number of transitions between layers L (corresponding to a depth of L+1). Specifically, we construct explicit controls interpolating either a finite dataset D, comprising N pairs of points in R-d, or two probability measures within a Wasserstein error margin epsilon>0. Our findings reveal a balancing trade-off between p and L, with L scaling as 1+O(N/p) for data interpolation, and as 1+O(p(-1)+(1+p)(-1)epsilon(-d)) for measures. In the high-dimensional and wide setting where d, p > N, our result can be refined to achieve L=0. This naturally raises the problem of data interpolation in the autonomous regime, characterized by L=0. We adopt two alternative approaches: either controlling in a probabilistic sense, or by relaxing the target condition. In the first case, when p = N we develop an inductive control strategy based on a separability assumption whose probability increases with d. In the second one, we establish an explicit error decay rate with respect to p which results from applying a universal approximation theorem to a custom-built Lipschitz vector field interpolating D

引用

页数：14

共 50 条

[1] Microtrench depth and width of SiON plasma etching
Kim, Byungwhan
Bae, Junggi
Lee, Byung Teak
VACUUM, 2006, 81 (03) : 338 - 343
[2] Depth and Width for Unbounded DG-Modules
Rao, Yanping
Liu, Zhongkui
Yang, Xiaoyan
Chen, Wenjing
ALGEBRA COLLOQUIUM, 2023, 30 (01) : 61 - 72
[3] On robustness of neural ODEs image classifiers
Cui, Wenjun
Zhang, Honglei
Chu, Haoyu
Hu, Pipi
Li, Yidong
INFORMATION SCIENCES, 2023, 632 : 576 - 593
[4] Jacobian Norm Regularisation and Conditioning in Neural ODEs
Josias, Shane
Brink, Willie
ARTIFICIAL INTELLIGENCE RESEARCH, SACAIR 2022, 2022, 1734 : 31 - 45
[5] CONTRACTIVITY OF NEURAL ODES: AN EIGENVALUE OPTIMIZATION PROBLEM
Guglielmi, Nicola
DE Marinis, Arturo
Savostianov, Anton
Tudisco, Francesco
MATHEMATICS OF COMPUTATION, 2025,
[6] Enabling global interpolation, derivative estimation and model identification from sparse multi-experiment time series data via neural ODEs
Bradley, William
Volkovinsky, Ron
Boukouvala, Fani
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 130
[7] Sparsity in long-time control of neural ODEs
Esteve-Yague, Carlos
Geshkovski, Borjan
SYSTEMS & CONTROL LETTERS, 2023, 172
[8] Size Effect in Bending Strength of Sugi Timber Effect of timber depth and width
Nagao, Hirofumi
Id, Hirofumi
Kato, Hideo
Miura, Sachiko
Shimoda, Yuuko
MOKUZAI GAKKAISHI, 2014, 60 (02): : 100 - 106
[9] Effect of Prosthesis Width and Depth on Heterotopic Ossification After Cervical Disc Arthroplasty
Zeng, Junfeng
Liu, Hao
Chen, Hua
Rong, Xin
Meng, Yang
Yang, Yi
Deng, Yuxiao
Ding, Chen
SPINE, 2019, 44 (09) : 624 - 628
[10] Depth of Field Affects Perceived Depth-width Ratios in Photographs of Natural Scenes
Nefs, Harold T.
SEEING AND PERCEIVING, 2012, 25 (06): : 577 - 595

← 1 2 3 4 5 →