Uncertainty quantification of graph convolution neural network models of evolving processes

被引:0
作者
Hauth, Jeremiah [1 ]
Safta, Cosmin [2 ]
Huan, Xun [1 ]
Patel, Ravi G. [3 ]
Jones, Reese E. [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI USA
[2] Sandia Natl Labs, Livermore, CA 94550 USA
[3] Sandia Natl Labs, Albuquerque, NM USA
关键词
Neural networks; Uncertainty quantification; Recurrent networks; Neural ordinary differential equations; Stein variational gradient descent; FRAMEWORK; INFERENCE; LAWS;
D O I
10.1016/j.cma.2024.117195
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The application of neural network models to scientific machine learning tasks has proliferated in recent years. In particular, neural networks have proved to be adept at modeling processes with spatial-temporal complexity. Nevertheless, these highly parameterized models have garnered skepticism in their ability to produce outputs with quantified error bounds over the regimes of interest. Hence there is a need to find uncertainty quantification methods that are suitable for neural networks. In this work we present comparisons of the parametric uncertainty quantification of neural networks modeling complex spatial-temporal processes with Hamiltonian Monte Carlo and Stein variational gradient descent and its projected variant. Specifically we apply these methods to graph convolutional neural network models of evolving systems modeled with recurrent neural network and neural ordinary differential equations architectures. We show that Stein variational inference is a viable alternative to Monte Carlo methods with some clear advantages for complex neural network models. For our exemplars, Stein variational interference gave similar pushed forward uncertainty profiles through time compared to Hamiltonian Monte Carlo, albeit with generally more generous variance. Projected Stein variational gradient descent also produced similar uncertainty profiles to the non-projected counterpart, but large reductions in the active weight space were confounded by the stability of the neural network predictions and the convoluted likelihood landscape.
引用
收藏
页数:23
相关论文
共 79 条
  • [71] Wang DL, 2018, PR MACH LEARN RES, V80
  • [72] Wang DL, 2019, ADV NEUR IN, V32
  • [73] Predicting plastic anisotropy using crystal plasticity and Bayesian neural network surrogate models
    Zapiain, David Montes de Oca
    Lim, Hojun
    Park, Taejoon
    Pourboghrat, Farhang
    [J]. MATERIALS SCIENCE AND ENGINEERING A-STRUCTURAL MATERIALS PROPERTIES MICROSTRUCTURE AND PROCESSING, 2022, 833
  • [74] Localization models for the plastic response of polycrystalline materials using the material knowledge systems framework
    Zapiain, David Montes de Oca
    Kalidindi, Surya R.
    [J]. MODELLING AND SIMULATION IN MATERIALS SCIENCE AND ENGINEERING, 2019, 27 (07)
  • [75] Advances in Variational Inference
    Zhang, Cheng
    Butepage, Judith
    Kjellstrom, Hedvig
    Mandt, Stephan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (08) : 2008 - 2026
  • [76] Zhang C, 2018, BAYESIAN ANAL, V13, P485, DOI [10.1214/17-BA1060, 10.1214/17-ba1060]
  • [77] Zhu Jun, 2018, 35 INT C MACH LEARN, V13, P9629
  • [78] Bayesian deep convolutional encoder-decoder networks for surrogate modeling and uncertainty quantification
    Zhu, Yinhao
    Zabaras, Nicholas
    [J]. JOURNAL OF COMPUTATIONAL PHYSICS, 2018, 366 : 415 - 447
  • [79] Zou Difan, 2019, Adv. Neural Inf. Process. Syst., V32