Markovian RNN: An Adaptive Time Series Prediction Network With HMM-Based Switching for Nonstationary Environments

被引:12
作者
Ilhan, Fatih [1 ,2 ]
Karaahmetoglu, Oguzhan [1 ,2 ]
Balaban, Ismail [2 ]
Kozat, Suleyman Serdar [1 ,2 ]
机构
[1] Bilkent Univ, Dept Elect & Elect Engn, TR-06800 Ankara, Turkey
[2] DataBoss AS, ODTU Teknokent, TR-06800 Ankara, Turkey
关键词
Hidden Markov models; Time series analysis; Switches; Predictive models; Task analysis; Adaptation models; Data models; Hidden Markov models (HMMs); nonlinear regression; nonstationarity; recurrent neural networks (RNNs); regime switching; time series prediction; NEURAL-NETWORKS; MIXTURE; MODEL; DEEP;
D O I
10.1109/TNNLS.2021.3100528
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We investigate nonlinear regression for nonstationary sequential data. In most real-life applications such as business domains including finance, retail, energy, and economy, time series data exhibit nonstationarity due to the temporally varying dynamics of the underlying system. We introduce a novel recurrent neural network (RNN) architecture, which adaptively switches between internal regimes in a Markovian way to model the nonstationary nature of the given data. Our model, Markovian RNN employs a hidden Markov model (HMM) for regime transitions, where each regime controls hidden state transitions of the recurrent cell independently. We jointly optimize the whole network in an end-to-end fashion. We demonstrate the significant performance gains compared to conventional methods such as Markov Switching ARIMA, RNN variants and recent statistical and deep learning-based methods through an extensive set of experiments with synthetic and real-life datasets. We also interpret the inferred parameters and regime belief values to analyze the underlying dynamics of the given sequences.
引用
收藏
页码:715 / 728
页数:14
相关论文
共 43 条
[1]  
[Anonymous], 1995, Back-Propagation: Theory, Architectures and Applications
[2]   Adaptive Hidden Markov Model With Anomaly States for Price Manipulation Detection [J].
Cao, Yi ;
Li, Yuhua ;
Coleman, Sonya ;
Belatreche, Ammar ;
McGinnity, Thomas Martin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (02) :318-330
[3]   Quantized Nonstationary Filtering of Networked Markov Switching RSNSs: A Multiple Hierarchical Structure Strategy [J].
Cheng, Jun ;
Park, Ju H. ;
Zhao, Xudong ;
Karimi, Hamid Reza ;
Cao, Jinde .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2020, 65 (11) :4816-4823
[4]  
Cho K, 2014, Proceedings of the Empiricial Methods in Natural Language Processing, P1724, DOI [10.3115/v1/d14-1179, 10.3115/v1/D14-1179]
[5]   A survey on the application of recurrent neural networks to statistical language modeling [J].
De Mulder, Wim ;
Bethard, Steven ;
Moens, Marie-Francine .
COMPUTER SPEECH AND LANGUAGE, 2015, 30 (01) :61-98
[6]   Mixture of MLP-experts for trend forecasting of time series: A case study of the Tehran stock exchange [J].
Ebrahimpour, Reza ;
Nikoo, Hossein ;
Masoudnia, Saeed ;
Yousefi, Mohammad Reza ;
Ghaemi, Mohammad Sajjad .
INTERNATIONAL JOURNAL OF FORECASTING, 2011, 27 (03) :804-816
[7]   FINDING STRUCTURE IN TIME [J].
ELMAN, JL .
COGNITIVE SCIENCE, 1990, 14 (02) :179-211
[8]   Hidden Markov processes [J].
Ephraim, Y ;
Merhav, N .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2002, 48 (06) :1518-1569
[9]   Unsupervised Anomaly Detection With LSTM Neural Networks [J].
Ergen, Tolga ;
Kozat, Suleyman Serdar .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (08) :3127-3141
[10]   Efficient Online Learning Algorithms Based on LSTM Neural Networks [J].
Ergen, Tolga ;
Kozat, Suleyman Serdar .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (08) :3772-3783