ModAugNet: A new forecasting framework for stock market index value with an overfitting prevention LSTM module and a prediction LSTM module

被引:205
作者
Baek, Yujin [1 ]
Kim, Ha Young [1 ,2 ]
机构
[1] Ajou Univ, Dept Financial Engn, Worldcupro 206, Suwon 16499, South Korea
[2] Ajou Univ, Dept Data Sci, Worldcupro 206, Suwon 16499, South Korea
基金
新加坡国家研究基金会;
关键词
Long short-term memory; Data augmentation; Overfitting; Deep learning; Stock market index; ARTIFICIAL NEURAL-NETWORKS; VOLATILITY; MACHINE;
D O I
10.1016/j.eswa.2018.07.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Forecasting a financial asset's price is important as one can lower the risk of investment decision- making with accurate forecasts. Recently, the deep neural network is popularly applied in this area of research; however, it is prone to overfitting owing to limited availability of data points for training. We propose a novel data augmentation approach for stock market index forecasting through our ModAugNet framework, which consists of two modules: an overfitting prevention LSTM module and a prediction LSTM module. The performance of the proposed model is evaluated using two different representative stock market data (S&P500 and Korea Composite Stock Price Index 200 (KOSPI200)). The results confirm the excellent forecasting accuracy of the proposed model. ModAugNet-c yields a lower test error than the comparative model (SingleNet) in which an overfitting prevention LSTM module is absent. The test mean squared error (MSE), mean absolute percentage error (MAPE), and mean absolute error (MAE) for S&P500 decreased to 54.1%, 35.5%, and 32.7%, respectively, of the corresponding S&P500 forecasting errors of SingleNet, while the same for KOSPI200 decreased to 48%, 23.9%, and 32.7%, respectively, of the corresponding KOSPI200 forecasting errors of SingleNet. Furthermore, through the analyses of the trained ModAugNet-c, we found that test performance is entirely dependent on the prediction LSTM module. The contribution of this study is its applicability in various instances where it is challenging to artificially augment data, such as medical data analysis and financial time-series modeling. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:457 / 480
页数:24
相关论文
共 61 条
  • [1] Introduction to financial forecasting
    AbuMostafa, YS
    Atiya, AF
    [J]. APPLIED INTELLIGENCE, 1996, 6 (03) : 205 - 213
  • [2] Stock Price Prediction Using the ARIMA Model
    Adebiyi, Ayodele A.
    Adewumi, Aderemi O.
    Ayo, Charles K.
    [J]. 2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, : 106 - 112
  • [3] A mutual association based nonlinear ensemble mechanism for time series forecasting
    Adhikari, Ratnadip
    [J]. APPLIED INTELLIGENCE, 2015, 43 (02) : 233 - 250
  • [4] Akita R., 2016, P IEEE ACIS 15 INT C, P1, DOI [DOI 10.1109/ICIS.2016.7550882, 10.1109/ICIS.2016.7550882]
  • [5] [Anonymous], ONLINE MACHINE LEARN
  • [6] [Anonymous], 2017, CoRRabs/1712.04621
  • [7] [Anonymous], 1997, Neural Computation
  • [8] [Anonymous], 1977, J MARKETING RES
  • [9] [Anonymous], 2000, Pattern Classification, DOI DOI 10.1007/978-3-319-57027-3_4
  • [10] [Anonymous], 2014, ABS14053531 CORR