Stock market prediction with time series data and news headlines: a stacking ensemble approach

被引:14
作者
Corizzo, Roberto [1 ]
Rosen, Jacob [1 ]
机构
[1] Amer Univ, Dept Comp Sci, 4400 Massachusetts Ave NW, Washington, DC 20016 USA
关键词
Ensemble learning; Deep learning; Stock market analysis; Time series; PRICE PREDICTION; LSTM;
D O I
10.1007/s10844-023-00804-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series forecasting models are gaining traction in many real-world domains as valuable decision support tools. Stock market analysis is a challenging domain, characterized by a complex multi-variate and time-evolving nature, with high volatility, and multiple correlations with exogenous factors. Autoregressive, machine learning, and deep learning models for temporal data have been adopted thus far to solve this task. However, they are usually limited to the analysis of a single data source or modality, and do not collectively deal with all the inherent challenges and complexities presented by stock market data. In this paper, inspired by the promising learning capabilities of hybrid ensemble methods, we propose a novel stacking ensemble approach for stock market prediction that jointly considers news headlines, multi-variate time series data, and multiple base models as predictors. By taking multiple factors into consideration, our model is able to learn historical patterns leveraging multiple data sources and models. Our experiments showcase the ability of our model to outperform popular baselines on next-day stock market trend prediction. A portfolio analysis reveals that our method is also able to yield potential gains or capital preservation capabilities when its predictions are exploited for trading decisions.
引用
收藏
页码:27 / 56
页数:30
相关论文
共 48 条
[1]   Stock Price Prediction Using the ARIMA Model [J].
Adebiyi, Ayodele A. ;
Adewumi, Aderemi O. ;
Ayo, Charles K. .
2014 UKSIM-AMSS 16TH INTERNATIONAL CONFERENCE ON COMPUTER MODELLING AND SIMULATION (UKSIM), 2014, :106-112
[2]   Forecasting the Risk Factor of Frontier Markets: A Novel Stacking Ensemble of Neural Network Approach [J].
Akter, Mst Shapna ;
Shahriar, Hossain ;
Chowdhury, Reaz ;
Mahdy, M. R. C. .
FUTURE INTERNET, 2022, 14 (09)
[3]   The Predictability of the Amman Stock Exchange using the Univariate Autoregressive Integrated Moving Average (ARIMA) Model [J].
Al-Shiab, Mohammad .
JOURNAL OF ECONOMIC AND ADMINISTRATIVE SCIENCES, 2006, 22 (02) :17-35
[4]  
Althelaya KA, 2018, INT CONF INFORM COMM, P151, DOI 10.1109/IACS.2018.8355458
[5]  
[Anonymous], 2008, Ensemble Learning, P758
[6]   Multi-Horizon Air Pollution Forecasting with Deep Neural Networks [J].
Arsov, Mirche ;
Zdravevski, Eftim ;
Lameski, Petre ;
Corizzo, Roberto ;
Koteli, Nikola ;
Gramatikov, Sasho ;
Mitreski, Kosta ;
Trajkovik, Vladimir .
SENSORS, 2021, 21 (04) :1-18
[7]   LSTM based decision support system for swing trading in stock market [J].
Banik, Shouvik ;
Sharma, Nonita ;
Mangla, Monika ;
Shitharth, S. ;
Mohanty, Sachi Nandan .
KNOWLEDGE-BASED SYSTEMS, 2022, 239
[8]  
Barbaglia Luca, 2021, Mining Data for Financial Applications. 5th ECML PKDD Workshop, MIDAS 2020. Revised Selected Papers. Lecture Notes in Artificial Intelligence (Subseries of LNCS) (LNAI 12591), P135, DOI 10.1007/978-3-030-66981-2_11
[9]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[10]   Predicting stock market index using LSTM [J].
Bhandari, Hum Nath ;
Rimal, Binod ;
Pokhrel, Nawa Raj ;
Rimal, Ramchandra ;
Dahal, Keshab R. ;
Khatri, Rajendra K. C. .
MACHINE LEARNING WITH APPLICATIONS, 2022, 9