Stock Price Prediction Using a Multivariate Multistep LSTM: A Sentiment and Public Engagement Analysis Model

被引:6
作者
Aasi, Bipin [1 ]
Imtiaz, Syeda Aniqa [1 ]
Qadeer, Hamzah Arif [1 ]
Singarajah, Magdalean [1 ]
Kashef, Rasha [1 ]
机构
[1] Ryerson Univ, Elect Comp & Biomed Engn Dept, Toronto, ON, Canada
来源
2021 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS) | 2021年
关键词
forecasting; stock market; Tweets; Google; sentiment; LSTM; IoT Analytics;
D O I
10.1109/IEMTRONICS52119.2021.9422526
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The impact of many factors on stock price has made the prediction of the stock market a problematic and highly complicated task to achieve. IoT analytics has enabled predictive analysis concerning the stock market, with internet search trends, reactions to current events, Twitter data, and historical stock returns as input data. Although inconsistencies remain as to which data sources are deemed most adequate, data preprocessing techniques have successfully overcome data integrity issues and unstructured data formats in specific applications. Additionally, advancements in computational power and machine learning technologies have led to the ability to handle tremendous amounts of information, accompanied by the growth of interest in this specific domain. In this paper, a Multivariate Multistep Output Long-Short-Term-Memory (MMLSTM) model is proposed to provide a one-week prediction on the stock close value for the technology company, "Apple Inc." with the stock name "AAPL". A large variety of data sources enabled by IoT platforms have been employed to model the impact of public sentiment and engagement on the closing price of this particular stock by looking at Google Search Trends, e-News headlines, and Tweets involving AAPL and its products. The proposed MMLSTM has improved the Mean Square Error (MSE) of up to 65% compared to ARIMA and Random Forest models. In addition, the proposed MMLSTM has outperformed most of the LSTM models introduced in the literature.
引用
收藏
页码:161 / 168
页数:8
相关论文
共 33 条
[1]  
Armstrong J. S., 1993, Long Range Planning, V26, P150, DOI [10.1016/0024-6301(93)90280-s, DOI 10.1016/0024-6301(93)90280-S]
[2]  
Bakar N. A., 2017, INT J ADV ENG RES SC, V4, P130, DOI [DOI 10.22161/IJAERS.4.11.20, 10.22161/ijaers.4.11.20]
[3]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]  
Brownlee J., 2019, A Gentle Introduction to Cross-Entropy for Machine Learning
[5]   Comparison of direct and iterative artificial neural network forecast approaches in multi-periodic time series forecasting [J].
Hamzacebi, Coskun ;
Akay, Diyar ;
Kutay, Fevzi .
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (02) :3839-3844
[6]  
Huang M., 2019, EMPIR ECON, V59, P2821
[7]  
Hutto C., 2014, Proceedings of the International AAAI Conference on Web Social Media, V8, P216, DOI DOI 10.1609/ICWSM.V8I1.14550
[9]   Bitcoin Network Mechanics: Forecasting the BTC Closing Price Using Vector Auto-Regression Models Based on Endogenous and Exogenous Feature Variables [J].
Ibrahim, Ahmed ;
Kashef, Rasha ;
Li, Menglu ;
Valencia, Esteban ;
Huang, Eric .
JOURNAL OF RISK AND FINANCIAL MANAGEMENT, 2020, 13 (09)
[10]   Effects of the validation set on stock returns forecasting [J].
Jerez, Tomas ;
Kristjanpoller, Werner .
EXPERT SYSTEMS WITH APPLICATIONS, 2020, 150