Predicting Stock Market Movements with Social Media and Machine Learning

被引:2
作者
Koukaras, Paraskevas [1 ]
Tsichli, Vasiliki [1 ]
Tjortjis, Christos [1 ]
机构
[1] Int Hellen Univ, Sch Sci & Technol, 14th Km Thessaloniki N Moudania, Thessaloniki 57001, Greece
来源
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST) | 2021年
关键词
Social Media; Prediction; Machine Learning; Data Science; Stocks;
D O I
10.5220/0010712600003058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microblogging data analysis and sentiment extraction has become a popular approach for market prediction. However, this kind of data contain noise and it is difficult to distinguish truly valid information. In this work we collected 782.459 tweets starting from 2018/11/01 until 2019/31/07. For each day, we create a graph (271 graphs in total) describing users and their followers. We utilize each graph to obtain a PageRank score which is multiplied with sentiment data. Findings indicate that using an importance-based measure, such as PageRank, can improve the scoring ability of the applied prediction models. This approach is validated utilizing three datasets (PageRank, economic and sentiment). On average, the PageRank dataset achieved a lower mean squared error than the economic dataset and the sentiment dataset. Finally, we tested multiple machine learning models, showing that XGBoost is the best model, with the random forest being the second best and LSTM being the worst.
引用
收藏
页码:436 / 443
页数:8
相关论文
共 31 条
  • [11] Hagberg AA, 2008, EXPLORING NETWORK ST, P11, DOI DOI 10.1016/J.JELECTROCARD.2010.09.003
  • [12] Combining bag-of-words and sentiment features of annual reports to predict abnormal stock returns
    Hajek, Petr
    [J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (07) : 343 - 358
  • [13] Hasan AA, 2018, INT CONF ELECTRO INF, P23, DOI 10.1109/EIT.2018.8500292
  • [14] Hochreiter S., 1991, DIPLOMA TU, V91
  • [15] Hutto C, 2014, 8 INT C WEBL SOC MED, DOI DOI 10.1609/ICWSM.V8I1.14550
  • [16] Koukaras P., 2019, Machine Learning Paradigms. Learning and Analytics in Intelligent Systems, P401
  • [17] Koukaras P., 2019, COMPUTING, P1
  • [18] Kuepper J., 2019, TIMING TRADES COMMOD
  • [19] FOUNDATIONS OF PORTFOLIO THEORY
    MARKOWITZ, HM
    [J]. JOURNAL OF FINANCE, 1991, 46 (02) : 469 - 477
  • [20] Mitchell C., 2019, AROON OSCILLATOR DEF