Multi-level convolutional autoencoder networks for parametric prediction of spatio-temporal dynamics

被引：111

作者：

Xu, Jiayang ^{[1
]}

Duraisamy, Karthik ^{[1
]}

机构：

[1] Univ Michigan, Dept Aerosp Engn, Ann Arbor, MI 48109 USA

来源：

COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING | 2020年 / 372卷

关键词：

Reduced order modeling; Data-driven modeling; Autoencoders; Convolutional neural networks; Machine learning; PROPER GENERALIZED DECOMPOSITION; EMPIRICAL INTERPOLATION METHOD; PETROV-GALERKIN PROJECTION; MODEL ORDER REDUCTION; NEURAL-NETWORKS; ORTHOGONAL DECOMPOSITION; SYSTEMS; FLOWS; EQUATIONS; POD;

D O I：

10.1016/j.cma.2020.113379

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

A data-driven framework is proposed towards the end of predictive modeling of complex spatio-temporal dynamics, leveraging nested non-linear manifolds. Three levels of neural networks are used, with the goal of predicting the future state of a system of interest in a parametric setting. A convolutional autoencoder is used as the top level to encode the high dimensional input data along spatial dimensions into a sequence of latent variables. A temporal convolutional autoencoder (TCAE) serves as the second level, which further encodes the output sequence from the first level along the temporal dimension, and outputs a set of latent variables that encapsulate the spatio-temporal evolution of the dynamics. The use of dilated temporal convolutions grows the receptive field exponentially with network depth, allowing for efficient processing of long temporal sequences typical of scientific computations. A fully-connected network is used as the third level to learn the mapping between these latent variables and the global parameters from training data, and predict them for new parameters. For future state predictions, the second level uses a temporal convolutional network to predict subsequent steps of the output sequence from the top level. Latent variables at the bottom-most level are decoded to obtain the dynamics in physical space at new global parameters and/or at a future time. Predictive capabilities are evaluated on a range of problems involving discontinuities, wave propagation, strong transients, and coherent structures. The sensitivity of the results to different modeling choices is assessed. The results suggest that given adequate data and careful training, effective data-driven predictive models can be constructed. Perspectives are provided on the present approach and its place in the landscape of model reduction. (c) 2020 Elsevier B.V. All rights reserved.

引用

页数：36

共 85 条

[1] AN ONLINE METHOD FOR INTERPOLATING LINEAR PARAMETRIC REDUCED-ORDER MODELS [J].

Amsallem, David ;

Farhat, Charbel .

SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2011, 33 (05) :2169-2198

[2]

[Anonymous], IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123

[3]

[Anonymous], 2014, P SSST8 8 WORKSH SYN

[4]

[Anonymous], 2019, ARXIV190300033

[5]

[Anonymous], INT C LEARNING REPRE

[6]

[Anonymous], 2018, ARXIV181003455

[7]

[Anonymous], 2019, ARXIV190709909

[8]

[Anonymous], 2018, A Guide to Convolution Arithmetic for Deep Learning

[9]

[Anonymous], 2018, ARXIV180406076

[10]

Astrid P., 2004, Reduction of process simulation models: a proper orthogonal decomposition approach

← 1 2 3 4 5 6 7 8 9 →