Uncertainty quantification of graph convolution neural network models of evolving processes

被引：0

作者：

Hauth, Jeremiah ^{[1
]}

Safta, Cosmin ^{[2
]}

Huan, Xun ^{[1
]}

Patel, Ravi G. ^{[3
]}

Jones, Reese E. ^{[2
]}

机构：

[1] Univ Michigan, Ann Arbor, MI USA

[2] Sandia Natl Labs, Livermore, CA 94550 USA

[3] Sandia Natl Labs, Albuquerque, NM USA

来源：

COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING | 2024年 / 429卷

关键词：

Neural networks; Uncertainty quantification; Recurrent networks; Neural ordinary differential equations; Stein variational gradient descent; FRAMEWORK; INFERENCE; LAWS;

D O I：

10.1016/j.cma.2024.117195

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The application of neural network models to scientific machine learning tasks has proliferated in recent years. In particular, neural networks have proved to be adept at modeling processes with spatial-temporal complexity. Nevertheless, these highly parameterized models have garnered skepticism in their ability to produce outputs with quantified error bounds over the regimes of interest. Hence there is a need to find uncertainty quantification methods that are suitable for neural networks. In this work we present comparisons of the parametric uncertainty quantification of neural networks modeling complex spatial-temporal processes with Hamiltonian Monte Carlo and Stein variational gradient descent and its projected variant. Specifically we apply these methods to graph convolutional neural network models of evolving systems modeled with recurrent neural network and neural ordinary differential equations architectures. We show that Stein variational inference is a viable alternative to Monte Carlo methods with some clear advantages for complex neural network models. For our exemplars, Stein variational interference gave similar pushed forward uncertainty profiles through time compared to Hamiltonian Monte Carlo, albeit with generally more generous variance. Projected Stein variational gradient descent also produced similar uncertainty profiles to the non-projected counterpart, but large reductions in the active weight space were confounded by the stability of the neural network predictions and the convoluted likelihood landscape.

引用

页数：23

共 79 条

[71] Wang DL, 2018, PR MACH LEARN RES, V80
[72] Wang DL, 2019, ADV NEUR IN, V32
[73] Predicting plastic anisotropy using crystal plasticity and Bayesian neural network surrogate models
Zapiain, David Montes de Oca
Lim, Hojun
Park, Taejoon
Pourboghrat, Farhang
[J]. MATERIALS SCIENCE AND ENGINEERING A-STRUCTURAL MATERIALS PROPERTIES MICROSTRUCTURE AND PROCESSING, 2022, 833
[74] Localization models for the plastic response of polycrystalline materials using the material knowledge systems framework
Zapiain, David Montes de Oca
Kalidindi, Surya R.
[J]. MODELLING AND SIMULATION IN MATERIALS SCIENCE AND ENGINEERING, 2019, 27 (07)
[75] Advances in Variational Inference
Zhang, Cheng
Butepage, Judith
Kjellstrom, Hedvig
Mandt, Stephan
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) : 2008 - 2026
[76] Zhang C, 2018, BAYESIAN ANAL, V13, P485, DOI [10.1214/17-BA1060, 10.1214/17-ba1060]
[77] Zhu Jun, 2018, 35 INT C MACH LEARN, V13, P9629
[78] Bayesian deep convolutional encoder-decoder networks for surrogate modeling and uncertainty quantification
Zhu, Yinhao
Zabaras, Nicholas
[J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 366 : 415 - 447
[79] Zou Difan, 2019, Adv. Neural Inf. Process. Syst., V32

← 1 2 3 4 5 6 7 8 →