Forecasting crude oil price with a new hybrid approach and multi-source data

被引:46
作者
Yang, Yifan [1 ]
Guo, Ju'e [1 ]
Sun, Shaolong [1 ]
Li, Yixin [1 ]
机构
[1] Xi An Jiao Tong Univ, Sch Management, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Crude oil price forecasting; GSVI data; Kernel extreme learning machine; Herd behavior; Divide and conquer; COMPONENT ANALYSIS; MIDAS TOUCH; GOOGLE DATA; SEARCH; PREDICTION; MODEL; VOLATILITY; NETWORK; MACHINE;
D O I
10.1016/j.engappai.2021.104217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Faced with the growing research toward crude oil price fluctuations influential factors following the accelerated development of Internet technology, accessible data such as Google search volume index (GSVI) are increasingly quantified and incorporated into forecasting approaches. In this study, we apply multi-scale data that including both traditional economic data and GSVI data reflecting macro and micro mechanisms affecting crude oil price respectively, so as to reduce the forecasting deviation and improve the forecasting accuracy at source. In addition, a new hybrid approach: K-means+KPCA+KELM based on "divide and conquer'' strategy is proposed for deeply exploring the information of above multi-data so that improve monthly crude oil price forecasting accuracy. Empirical results can be analyzed from data and method levels. At the data level, GSVI data perform better than economic data in level forecasting accuracy but with opposite performance in directional forecasting accuracy because of "Herd Behavior'', while hybrid data combined their advantages and obtain best forecasting performance in both level and directional accuracy. At the method level, the approaches with "divide and conquer'' strategy gain a better forecasting performance, which demonstrates that "divide and conquer'' strategy can effectively improve the forecasting performance.
引用
收藏
页数:10
相关论文
共 58 条
[11]   Market efficiency, long-term returns, and behavioral finance [J].
Fama, EF .
JOURNAL OF FINANCIAL ECONOMICS, 1998, 49 (03) :283-306
[12]   A hybrid Bayesian-network proposition for forecasting the crude oil price [J].
Fazelabdolabadi, Babak .
FINANCIAL INNOVATION, 2019, 5 (01)
[13]   Tutorial on practical tips of the most influential data preprocessing algorithms in data mining [J].
Garcia, Salvador ;
Luengo, Julian ;
Herrera, Francisco .
KNOWLEDGE-BASED SYSTEMS, 2016, 98 :1-29
[14]   Machine learning in energy economics and finance: A review [J].
Ghoddusi, Hamed ;
Creamer, German G. ;
Rafizadeh, Nima .
ENERGY ECONOMICS, 2019, 81 :709-727
[15]   Detecting influenza epidemics using search engine query data [J].
Ginsberg, Jeremy ;
Mohebbi, Matthew H. ;
Patel, Rajan S. ;
Brammer, Lynnette ;
Smolinski, Mark S. ;
Brilliant, Larry .
NATURE, 2009, 457 (7232) :1012-U4
[16]   Google data in bridge equation models for German GDP [J].
Goetz, Thomas B. ;
Knetsch, Thomas A. .
INTERNATIONAL JOURNAL OF FORECASTING, 2019, 35 (01) :45-66
[17]   How does market concern derived from the Internet affect oil prices? [J].
Guo, Jian-Feng ;
Ji, Qiang .
APPLIED ENERGY, 2013, 112 :1536-1543
[18]   Hybrid structures in time series modeling and forecasting: A review [J].
Hajirahimi, Zahra ;
Khashei, Mehdi .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 86 :83-106
[19]   Can investor attention predict oil prices? [J].
Han, Liyan ;
Lv, Qiuna ;
Yin, Libo .
ENERGY ECONOMICS, 2017, 66 :547-558
[20]   A nonparametric GARCH model of crude oil price return volatility [J].
Hou, Aijun ;
Suardi, Sandy .
ENERGY ECONOMICS, 2012, 34 (02) :618-626