Using four different online media sources to forecast the crude oil price

被引:50
作者
Elshendy, Mohammed [1 ]
Colladon, Andrea Fronzetti [1 ]
Battistoni, Elisa [1 ]
Gloor, Peter A. [2 ]
机构
[1] Univ Roma Tor Vergata, Dept Enterprise Engn, Via Politecn 1, I-00133 Rome, Italy
[2] MIT, MIT Ctr Collect Intelligence, Cambridge, MA 02139 USA
关键词
Financial forecast; Global Data on Events; Location and Tone; Google Trends; oil price; Twitter; Wikipedia; MARKET; UNEMPLOYMENT; DEMAND; TWEETS;
D O I
10.1177/0165551517698298
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This study looks for signals of economic awareness on online social media and tests their significance in economic predictions. The study analyses, over a period of 2 years, the relationship between the West Texas Intermediate daily crude oil price and multiple predictors extracted from Twitter; Google Trends; Wikipedia; and the Global Data on Events, Location and Tone (GDELT) database. Semantic analysis is applied to study the sentiment, emotionality and complexity of the language used. Autoregressive Integrated Moving Average with Explanatory Variable (ARIMAX) models are used to make predictions and to confirm the value of the study variables. Results show that the combined analysis of the four media platforms carries valuable information in making financial forecasting. Twitter language complexity, GDELT number of articles and Wikipedia page reads have the highest predictive power. This study also allows a comparison of the different fore-sighting abilities of each platform, in terms of how many days ahead a platform can predict a price movement before it happens. In comparison with previous work, more media sources and more dimensions of the interaction and of the language used are combined in a joint analysis.
引用
收藏
页码:408 / 421
页数:14
相关论文
共 83 条
[1]  
AHMED RanaAbdullah., 2014, American journal of applied sciences, V11, P425, DOI [10.3844/ajassp.2014.425.432, DOI 10.3844/AJASSP.2014.425.432]
[2]  
Andrews B., 2013, Building ARIMA and ARIMAX Models for Predicting Long-Term Disability Benefit Application Rates in the Public/Private Sectors. Society of Actuaries
[3]  
[Anonymous], LECT NOTES COMPUTER
[4]  
[Anonymous], 2014, ANAL VERBREITUNG INN
[5]  
[Anonymous], 2016, DESIGNING NETWORKS I
[6]  
[Anonymous], 2005, International Journal of Knowledge and Systems Sciences
[7]  
[Anonymous], 2011, Proc. Int. AAAI Conf. Web Soc. Media, DOI DOI 10.1609/ICWSM.V5I1.14171
[8]  
[Anonymous], 2012, Discourse of Twitter and Social Media: How we Use Language to Create Affiliation on the Web
[9]  
[Anonymous], 2008, Proceedings of the 17th ACM conference on Information and knowledge management
[10]  
[Anonymous], 2013, SCI REPORTS