Learning on predictions: Fusing training and autoregressive inference for long-term spatiotemporal forecasts

被引:0
作者
Vlachas, P. R. [1 ]
Koumoutsakos, P. [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Computat Sci & Engn Lab, Clausiusstr 33, CH-8092 Zurich, Switzerland
[2] Harvard Univ, Harvard John A Paulson Sch Engn & Appl Sci, Computat Sci & Engn Lab, 29 Oxford St, Cambridge, MA 02138 USA
关键词
Autoregressive forecasting; RNN; LSTM; BPTT; Exposure bias; RECURRENT NEURAL-NETWORK; BACKPROPAGATION; TIME; ALGORITHMS;
D O I
10.1016/j.physd.2024.134371
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Predictions of complex systems ranging from natural language processing to weather forecasting have benefited from advances in Recurrent Neural Networks (RNNs). RNNs are typically trained using techniques like Backpropagation Through Time (BPTT) to minimize one-step-ahead prediction loss. During testing, RNNs often operate in an auto-regressive mode, with the output of the network fed back into its input. However, this process can eventually result in exposure bias since the network has been trained to process "ground-truth" data rather than its own predictions. This inconsistency causes errors that compound over time, indicating that the distribution of data used for evaluating losses differs from the actual operating conditions encountered by the model during training. Inspired by the solution to this challenge in language processing networks we propose the Scheduled Autoregressive Truncated Backpropagation Through Time (BPTT-SA) algorithm for predicting complex dynamical systems using RNNs. We find that BPTT-SA effectively reduces iterative error propagation in Convolutional and Convolutional Autoencoder RNNs and demonstrates its capabilities in the long-term prediction of high-dimensional fluid flows.
引用
收藏
页数:13
相关论文
共 82 条
  • [1] Ahmad AM, 2004, IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2, P98
  • [2] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [3] Alnaes M.S., 2015, Arch. Numer. Softw., V3, DOI [DOI 10.11588/ANS.2015.100.20553, 10.11588/ans.2015.100.20553]
  • [4] A comparison of recurrent training algorithms for time series analysis and system identification
    Anderson, JS
    Kevrekidis, IG
    RicoMartinez, R
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 1996, 20 : S751 - S756
  • [5] Searching for exotic particles in high-energy physics with deep learning
    Baldi, P.
    Sadowski, P.
    Whiteson, D.
    [J]. NATURE COMMUNICATIONS, 2014, 5
  • [6] Beal J, PREPRINT, DOI DOI 10.1214/20-BA1223
  • [7] Bengio S, 2015, ADV NEUR IN, V28
  • [8] Bengio Y., 2009, P 26 ANN INT C MACH, V382, P41, DOI [DOI 10.1145/1553374.1553380, 10.1145/1553374.155338]
  • [9] Are extreme dissipation events predictable in turbulent fluid flows?
    Blonigan, Patrick J.
    Farazmand, Mohammad
    Sapsis, Themistoklis P.
    [J]. PHYSICAL REVIEW FLUIDS, 2019, 4 (04)
  • [10] Brenner M, 2022, PR MACH LEARN RES