Public Sentiment Analysis in Twitter Data for Prediction of A Company's Stock Price Movements

被引:54
作者
Bing, Li [1 ]
Chan, Keith C. C. [1 ]
Ou, Carol [2 ]
机构
[1] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[2] Tilburg Univ, Tilburg Sch Econ & Management, Dept Management, NL-5000 LE Tilburg, Netherlands
来源
2014 IEEE 11TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE) | 2014年
关键词
social media; Twitter; stock market; data mining;
D O I
10.1109/ICEBE.2014.47
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
There has recently been some effort to mine social media for public sentiment analysis. Studies have suggested that public emotions shown through Tweeter may well be correlated with the Dow Jones Industrial Average. However, can public sentiment be analyzed to predict the movements of the stock price of a particular company? If so, is it possible for the stock price of one company to be more predictable than that of another company? Is there a particular kind of companies whose stock price are more predictable based on analyzing public sentiments as reflected in Twitter data? In this article, we propose a method to mine Twitter data for answers to these questions. Specifically, we propose to use a data mining algorithm to determine if the price of a selection of 30 companies listed in NASDAQ and the New York Stock Exchange can actually be predicted by the given 15 million records of tweets (i.e., Twitter messages). We do so by extracting ambiguous textual tweet data through NLP techniques to define public sentiment, then make use of a data mining technique to discover patterns between public sentiment and real stock price movements. With the proposed algorithm, we manage to discover that it is possible for the stock price of some companies to be predicted with an average accuracy as high as 76.12%. In this paper, we describe the data mining algorithm that we use and discuss the key findings in relation to the questions posed.
引用
收藏
页码:232 / 239
页数:8
相关论文
共 15 条
[1]  
[Anonymous], J AM SOC INFORM SCI
[2]  
[Anonymous], MINING THOUGHT STREA
[3]  
Asur S., 2010, CORR
[4]   Twitter mood predicts the stock market [J].
Bollen, Johan ;
Mao, Huina ;
Zeng, Xiaojun .
JOURNAL OF COMPUTATIONAL SCIENCE, 2011, 2 (01) :1-8
[5]   LEARNING SEQUENTIAL PATTERNS FOR PROBABILISTIC INDUCTIVE PREDICTION [J].
CHAN, KCC ;
WONG, AKC ;
CHIU, DKY .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1994, 24 (10) :1532-1547
[6]  
Gruhl D., 2005, SIGKDD C KNOWL DISC
[7]  
Joshi M., 2010, MOVIE REV REV EXPT T
[8]  
Joshi M., 2010, P NAACL HLT
[9]  
Leskovec J., 2006, P 7 ACM C EL COMM
[10]   Forecasting stock indices: a comparison of classification and level estimation models [J].
Leung, MT ;
Daouk, H ;
Chen, AS .
INTERNATIONAL JOURNAL OF FORECASTING, 2000, 16 (02) :173-190