Uncertainty quantification of graph convolution neural network models of evolving processes

被引：0

作者：

Hauth, Jeremiah ^{[1
]}

Safta, Cosmin ^{[2
]}

Huan, Xun ^{[1
]}

Patel, Ravi G. ^{[3
]}

Jones, Reese E. ^{[2
]}

机构：

[1] Univ Michigan, Ann Arbor, MI USA

[2] Sandia Natl Labs, Livermore, CA 94550 USA

[3] Sandia Natl Labs, Albuquerque, NM USA

来源：

COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING | 2024年 / 429卷

关键词：

Neural networks; Uncertainty quantification; Recurrent networks; Neural ordinary differential equations; Stein variational gradient descent; FRAMEWORK; INFERENCE; LAWS;

D O I：

10.1016/j.cma.2024.117195

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The application of neural network models to scientific machine learning tasks has proliferated in recent years. In particular, neural networks have proved to be adept at modeling processes with spatial-temporal complexity. Nevertheless, these highly parameterized models have garnered skepticism in their ability to produce outputs with quantified error bounds over the regimes of interest. Hence there is a need to find uncertainty quantification methods that are suitable for neural networks. In this work we present comparisons of the parametric uncertainty quantification of neural networks modeling complex spatial-temporal processes with Hamiltonian Monte Carlo and Stein variational gradient descent and its projected variant. Specifically we apply these methods to graph convolutional neural network models of evolving systems modeled with recurrent neural network and neural ordinary differential equations architectures. We show that Stein variational inference is a viable alternative to Monte Carlo methods with some clear advantages for complex neural network models. For our exemplars, Stein variational interference gave similar pushed forward uncertainty profiles through time compared to Hamiltonian Monte Carlo, albeit with generally more generous variance. Projected Stein variational gradient descent also produced similar uncertainty profiles to the non-projected counterpart, but large reductions in the active weight space were confounded by the stability of the neural network predictions and the convoluted likelihood landscape.

引用

页数：23

共 79 条

[1] Phase-field simulations of intergranular fission gas bubble behavior in U3Si2 nuclear fuel
Aagesen, Larry K.
Andersson, David
Beeler, Benjamin W.
Cooper, Michael W. D.
Gamble, Kyle A.
Miao, Yinbin
Pastore, Giovanni
Tonks, Michael R.
[J]. JOURNAL OF NUCLEAR MATERIALS, 2020, 541 (541)
[2] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[3] Ando T., 2010, Bayesian model selection and statistical modeling, V1st
[4] An introduction to MCMC for machine learning
Andrieu, C
de Freitas, N
Doucet, A
Jordan, MI
[J]. MACHINE LEARNING, 2003, 50 (1-2) : 5 - 43
[5] [Anonymous], 2018, ADV NEURAL INFORM PR, V31
[6] Bi-Directional ConvLSTM U-Net with Densley Connected Convolutions
Azad, Reza
Asadi-Aghbolaghi, Maryam
Fathy, Mahmood
Escalera, Sergio
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 406 - 415
[7] Baker N., 2019, WORKSH BAS RES NEEDS
[8] Banerjee C, 2023, Arxiv, DOI arXiv:2309.01909
[9] Laplacian eigenmaps for dimensionality reduction and data representation
Belkin, M
Niyogi, P
[J]. NEURAL COMPUTATION, 2003, 15 (06) : 1373 - 1396
[10] Berger JO, 2013, Statistical Decision Theory and Bayesian Analysis

← 1 2 3 4 5 6 7 8 →