Multi-level convolutional autoencoder networks for parametric prediction of spatio-temporal dynamics

被引:96
作者
Xu, Jiayang [1 ]
Duraisamy, Karthik [1 ]
机构
[1] Univ Michigan, Dept Aerosp Engn, Ann Arbor, MI 48109 USA
关键词
Reduced order modeling; Data-driven modeling; Autoencoders; Convolutional neural networks; Machine learning; PROPER GENERALIZED DECOMPOSITION; EMPIRICAL INTERPOLATION METHOD; PETROV-GALERKIN PROJECTION; MODEL ORDER REDUCTION; NEURAL-NETWORKS; ORTHOGONAL DECOMPOSITION; SYSTEMS; FLOWS; EQUATIONS; POD;
D O I
10.1016/j.cma.2020.113379
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
A data-driven framework is proposed towards the end of predictive modeling of complex spatio-temporal dynamics, leveraging nested non-linear manifolds. Three levels of neural networks are used, with the goal of predicting the future state of a system of interest in a parametric setting. A convolutional autoencoder is used as the top level to encode the high dimensional input data along spatial dimensions into a sequence of latent variables. A temporal convolutional autoencoder (TCAE) serves as the second level, which further encodes the output sequence from the first level along the temporal dimension, and outputs a set of latent variables that encapsulate the spatio-temporal evolution of the dynamics. The use of dilated temporal convolutions grows the receptive field exponentially with network depth, allowing for efficient processing of long temporal sequences typical of scientific computations. A fully-connected network is used as the third level to learn the mapping between these latent variables and the global parameters from training data, and predict them for new parameters. For future state predictions, the second level uses a temporal convolutional network to predict subsequent steps of the output sequence from the top level. Latent variables at the bottom-most level are decoded to obtain the dynamics in physical space at new global parameters and/or at a future time. Predictive capabilities are evaluated on a range of problems involving discontinuities, wave propagation, strong transients, and coherent structures. The sensitivity of the results to different modeling choices is assessed. The results suggest that given adequate data and careful training, effective data-driven predictive models can be constructed. Perspectives are provided on the present approach and its place in the landscape of model reduction. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:36
相关论文
共 85 条
  • [1] AN ONLINE METHOD FOR INTERPOLATING LINEAR PARAMETRIC REDUCED-ORDER MODELS
    Amsallem, David
    Farhat, Charbel
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2011, 33 (05) : 2169 - 2198
  • [2] [Anonymous], 2014, P SSST EMNLP 2014 8
  • [3] [Anonymous], 2015, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123
  • [4] [Anonymous], 2018, ARXIV181003455
  • [5] [Anonymous], 1993, Advances in neural information processing systems
  • [6] [Anonymous], 2016, GUID CONVOLUTION ARI
  • [7] [Anonymous], 2018, ARXIV180406076
  • [8] Astrid P., 2004, Reduction of process simulation models: a proper orthogonal decomposition approach
  • [9] Missing Point Estimation in Models Described by Proper Orthogonal Decomposition
    Astrid, Patricia
    Weiland, Siep
    Willcox, Karen
    Backx, Ton
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2008, 53 (10) : 2237 - 2251
  • [10] Bai S., 2018, ARXIV