Randomized Sparse Neural Galerkin Schemes for Solving Evolution Equations with Deep Networks

被引：0

作者：

Berman, Jules ^{[1
]}

Peherstorfer, Benjamin ^{[1
]}

机构：

[1] New York Univ, Courant Inst Math Sci, New York, NY 10012 USA

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

基金：

美国国家科学基金会;

关键词：

MODEL-REDUCTION; TIME; APPROXIMATION; DYNAMICS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Training neural networks sequentially in time to approximate solution fields of time-dependent partial differential equations can be beneficial for preserving causality and other physics properties; however, the sequential-in-time training is numerically challenging because training errors quickly accumulate and amplify over time. This work introduces Neural Galerkin schemes that update randomized sparse subsets of network parameters at each time step. The randomization avoids overfitting locally in time and so helps prevent the error from accumulating quickly over the sequential-in-time training, which is motivated by dropout that addresses a similar issue of overfitting due to neuron co-adaptation. The sparsity of the update reduces the computational costs of training without losing expressiveness because many of the network parameters are redundant locally at each time step. In numerical experiments with a wide range of evolution equations, the proposed scheme with randomized sparse updates is up to two orders of magnitude more accurate at a fixed computational budget and up to two orders of magnitude faster at a fixed accuracy than schemes with dense updates.

引用

页数：18

共 54 条

[41] Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations
Raissi, M.
Perdikaris, P.
Karniadakis, G. E.
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2019, 378 : 686 - 707
[42] DISCRETE-TIME VS CONTINUOUS-TIME NONLINEAR SIGNAL-PROCESSING OF CU ELECTRODISSOLUTION DATA
RICOMARTINEZ, R
KRISCHER, K
KEVREKIDIS, IG
KUBE, MC
HUDSON, JL
[J]. CHEMICAL ENGINEERING COMMUNICATIONS, 1992, 118 : 25 - 48
[43] Rotskoff Grant M., 2021, ARXIV
[44] Deep learning of dynamics and signal-noise decomposition with time-stepping constraints
Rudy, Samuel H.
Kutz, J. Nathan
Brunton, Steven L.
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2019, 396 : 483 - 506
[45] Dynamically orthogonal field equations for continuous stochastic dynamical systems
Sapsis, Themistoklis P.
Lermusiaux, Pierre F. J.
[J]. PHYSICA D-NONLINEAR PHENOMENA, 2009, 238 (23-24) : 2347 - 2360
[46] DGM: A deep learning algorithm for solving partial differential equations
Sirignano, Justin
Spiliopoulos, Konstantinos
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 375 : 1339 - 1364
[47] Srivastava N, 2014, J MACH LEARN RES, V15, P1929
[48] Exact imposition of boundary conditions with distance functions in physics-informed deep neural networks
Sukumar, N.
Srivastava, Ankit
[J]. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2022, 389
[49] Sung Yi-Lin, 2021, ARXIV
[50] Recurrent neural network closure of parametric POD-Galerkin reduced-order models based on the Mori-Zwanzig formalism
Wang, Qian
Ripamonti, Nicolo
Hesthaven, Jan S.
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2020, 410

← 1 2 3 4 5 6 →