Data-driven forecasting of high-dimensional chaotic systems with long short-term memory networks

被引:336
作者
Vlachas, Pantelis R. [1 ]
Byeon, Wonmin [1 ]
Wan, Zhong Y. [2 ]
Sapsis, Themistoklis P. [2 ]
Koumoutsakos, Petros [1 ]
机构
[1] ETH, Chair Computat Sci, Clausiusstr 33, CH-8092 Zurich, Switzerland
[2] MIT, Dept Mech Engn, 77 Massachusetts Ave, Cambridge, MA 02139 USA
来源
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2018年 / 474卷 / 2213期
基金
欧洲研究理事会;
关键词
data-driven forecasting; long short-term memory; Gaussian processes; T21 barotropic climate model; Lorenz; 96; SINGULAR SPECTRUM ANALYSIS; TIME-SERIES; MODEL-REDUCTION; PREDICTION; QUANTIFICATION; DECOMPOSITION; EVENTS;
D O I
10.1098/rspa.2017.0844
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We introduce a data-driven forecasting method for high-dimensional chaotic systems using long shortterm memory (LSTM) recurrent neural networks. The proposed LSTM neural networks perform inference of high-dimensional dynamical systems in their reduced order space and are shown to be an effective set of nonlinear approximators of their attractor. We demonstrate the forecasting performance of the LSTM and compare it with Gaussian processes (GPs) in time series obtained from the Lorenz 96 system, the Kuramoto-Sivashinsky equation and a prototype climate model. The LSTM networks outperform the GPs in short-term forecasting accuracy in all applications considered. A hybrid architecture, extending the LSTM with a mean stochastic model (MSM-LSTM), is proposed to ensure convergence to the invariant measure. This novel hybrid method is fully data-driven and extends the forecasting capabilities of LSTM networks.
引用
收藏
页数:20
相关论文
共 75 条
  • [1] A new hybrid enhanced local linear neuro-fuzzy model based on the optimized singular spectrum analysis and its application for nonlinear and chaotic time series forecasting
    Abdollahzade, Majid
    Miranian, Arash
    Hassani, Hossein
    Iranmanesh, Hossein
    [J]. INFORMATION SCIENCES, 2015, 295 : 107 - 125
  • [2] Empirical correction of a toy climate model
    Allgaier, Nicholas A.
    Harris, Kameron D.
    Danforth, Christopher M.
    [J]. PHYSICAL REVIEW E, 2012, 85 (02):
  • [3] [Anonymous], 1997, Neural Computation
  • [4] [Anonymous], P 19 INT JOINT C ART
  • [5] Ergodic Theory, Dynamic Mode Decomposition, and Computation of Spectral Properties of the Koopman Operator
    Arbabi, Hassan
    Mezic, Igor
    [J]. SIAM JOURNAL ON APPLIED DYNAMICAL SYSTEMS, 2017, 16 (04): : 2096 - 2126
  • [6] Forecast improvement in Lorenz 96 system
    Basnarkov, L.
    Kocarev, L.
    [J]. NONLINEAR PROCESSES IN GEOPHYSICS, 2012, 19 (05) : 569 - 575
  • [7] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT
    BENGIO, Y
    SIMARD, P
    FRASCONI, P
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02): : 157 - 166
  • [8] Least squares shadowing sensitivity analysis of a modified Kuramoto-Sivashinsky equation
    Blonigan, Patrick J.
    Wang, Qiqi
    [J]. CHAOS SOLITONS & FRACTALS, 2014, 64 : 16 - 25
  • [9] Automated reverse engineering of nonlinear dynamical systems
    Bongard, Josh
    Lipson, Hod
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (24) : 9943 - 9948
  • [10] Broomhead D. S., 1988, Complex Systems, V2, P321