Tracking Multiple Social Media for Stock Market Event Prediction

被引:21
作者
Jin, Fang [1 ]
Wang, Wei [2 ]
Chakraborty, Prithwish [2 ]
Self, Nathan [2 ]
Chen, Feng [3 ]
Ramakrishnan, Naren [2 ]
机构
[1] Texas Tech Univ, Dept Comp Sci, Lubbock, TX 79409 USA
[2] Virginia Tech, Dept Comp Sci, Discovery Analyt Ctr, Blacksburg, VA USA
[3] SUNY Albany, Dept Comp Sci, Albany, NY 12222 USA
来源
ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS, ICDM 2017 | 2017年 / 10357卷
关键词
Market prediction; Multiple social media; Features combination; Google trends; Twitter burst; News sentiment;
D O I
10.1007/978-3-319-62701-4_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of modeling the continuously changing trends in finance markets and generating real-time, meaningful predictions about significant changes in those markets has drawn considerable interest from economists and data scientists alike. In addition to traditional market indicators, growth of varied social media has enabled economists to leverage micro-and real-time indicators about factors possibly influencing the market, such as public emotion, anticipations and behaviors. We propose several specific market related features that can be mined from varied sources such as news, Google search volumes and Twitter. We further investigate the correlation between these features and financial market fluctuations. In this paper, we present a Delta Naive Bayes (DNB) approach to generate prediction about financial markets. We present a detailed prospective analysis of prediction accuracy generated from multiple, combined sources with those generated from a single source. We find that multi-source predictions consistently outperform single-source predictions, even though with some limitations.
引用
收藏
页码:16 / 30
页数:15
相关论文
共 27 条
[1]  
[Anonymous], SCI REPORTS
[2]  
[Anonymous], 2011, J COMPUT SCI-NETH, DOI DOI 10.1016/j.jocs.2010.12.007
[3]  
[Anonymous], 2000, P KDD 2000 WORKSHOP
[4]   Clustering stock market companies via chaotic map synchronization [J].
Basalto, N ;
Bellotti, R ;
De Carlo, F ;
Facchi, P ;
Pascazio, S .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2005, 345 (1-2) :196-206
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[7]   GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY [J].
BOLLERSLEV, T .
JOURNAL OF ECONOMETRICS, 1986, 31 (03) :307-327
[8]   Social Media-Based Forecasting: A Case Study of Tweets and Stock Prices in the Financial Services Industry [J].
He, Wu ;
Guo, Lin ;
Shen, Jiancheng ;
Akula, Vasudeva .
JOURNAL OF ORGANIZATIONAL AND END USER COMPUTING, 2016, 28 (02) :74-91
[9]   Modeling Mass Protest Adoption in Social Network Communities using Geometric Brownian Motion [J].
Jin, Fang ;
Khandpur, Rupinder Paul ;
Self, Nathan ;
Dougherty, Edward ;
Guo, Sheng ;
Chen, Feng ;
Prakash, B. Aditya ;
Ramakrishnan, Naren .
PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, :1660-1669
[10]  
Jin F, 2013, 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), P1470