Predicting Stock Market Movements with Social Media and Machine Learning

被引:2
作者
Koukaras, Paraskevas [1 ]
Tsichli, Vasiliki [1 ]
Tjortjis, Christos [1 ]
机构
[1] Int Hellen Univ, Sch Sci & Technol, 14th Km Thessaloniki N Moudania, Thessaloniki 57001, Greece
来源
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES (WEBIST) | 2021年
关键词
Social Media; Prediction; Machine Learning; Data Science; Stocks;
D O I
10.5220/0010712600003058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Microblogging data analysis and sentiment extraction has become a popular approach for market prediction. However, this kind of data contain noise and it is difficult to distinguish truly valid information. In this work we collected 782.459 tweets starting from 2018/11/01 until 2019/31/07. For each day, we create a graph (271 graphs in total) describing users and their followers. We utilize each graph to obtain a PageRank score which is multiplied with sentiment data. Findings indicate that using an importance-based measure, such as PageRank, can improve the scoring ability of the applied prediction models. This approach is validated utilizing three datasets (PageRank, economic and sentiment). On average, the PageRank dataset achieved a lower mean squared error than the economic dataset and the sentiment dataset. Finally, we tested multiple machine learning models, showing that XGBoost is the best model, with the random forest being the second best and LSTM being the worst.
引用
收藏
页码:436 / 443
页数:8
相关论文
共 31 条
  • [1] Long short-term memory
    Hochreiter, S
    Schmidhuber, J
    [J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
  • [2] Is all that talk just noise? The information content of Internet stock message boards
    Antweiler, W
    Frank, MZ
    [J]. JOURNAL OF FINANCE, 2004, 59 (03) : 1259 - 1294
  • [3] Belega Daniel, 2019, 2019 IEEE 5th International forum on Research and Technology for Society and Industry (RTSI). Proceedings, P1, DOI 10.1109/RTSI.2019.8895576
  • [4] Blystone D., 2019, OVERBOUGHT OVERSOLD
  • [5] Twitter mood predicts the stock market
    Bollen, Johan
    Mao, Huina
    Zeng, Xiaojun
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2011, 2 (01) : 1 - 8
  • [6] Chakraborty P, 2017, 2017 6TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION & 2017 7TH INTERNATIONAL SYMPOSIUM IN COMPUTATIONAL MEDICAL AND HEALTH TECHNOLOGY (ICIEV-ISCMHT)
  • [7] Rational herding in financial economics
    Devenow, A
    Welch, I
    [J]. EUROPEAN ECONOMIC REVIEW, 1996, 40 (3-5) : 603 - 615
  • [8] COMMON RISK-FACTORS IN THE RETURNS ON STOCKS AND BONDS
    FAMA, EF
    FRENCH, KR
    [J]. JOURNAL OF FINANCIAL ECONOMICS, 1993, 33 (01) : 3 - 56
  • [9] THE BEHAVIOR OF STOCK-MARKET PRICES
    FAMA, EF
    [J]. JOURNAL OF BUSINESS, 1965, 38 (01) : 34 - 105
  • [10] EFFICIENT CAPITAL MARKETS - REVIEW OF THEORY AND EMPIRICAL WORK
    FAMA, EF
    [J]. JOURNAL OF FINANCE, 1970, 25 (02) : 383 - 423