Machine learning and oil price point and density forecasting

被引:28
作者
Costa, Alexandre Bonnet R. [1 ]
Ferreira, Pedro Cavalcanti G. [2 ]
Gaglianone, Wagner P. [3 ]
Guillen, Osmani Teixeira C. [4 ,5 ]
Issler, Joao Victor [2 ]
Lin, Yihao [2 ]
机构
[1] Petrobras SA, Corp Risks, Rio De Janeiro, Brazil
[2] Getulio Vargas Fdn, Brazilian Sch Econ & Finance FGV EPGE, Praia Botafogo 190,S1104, BR-22253900 Rio De Janeiro, RJ, Brazil
[3] Banco Cent Brasil, Res Dept, Brasilia, DF, Brazil
[4] Banco Cent Brasil, Open Market Operat Dept, Brasilia, DF, Brazil
[5] Ibmec RJ, Rio De Janeiro, RJ, Brazil
关键词
Machine learning; Commodity prices; Forecasting; FUTURES PRICES; REAL PRICE; BIG DATA; COINTEGRATION; REGRESSION; MODEL; SPOT; US; PREDICTION; SELECTION;
D O I
10.1016/j.eneco.2021.105494
中图分类号
F [经济];
学科分类号
02 ;
摘要
The purpose of this paper is to explore machine learning techniques to forecast the oil price. In the era of big data, we investigate whether new automated tools can improve over traditional approaches in terms of forecast accuracy. Oil price point and density forecasts are built from 23 methods, including regression trees (random forest, quantile regression forest, xgboost), regularization procedures (elastic net, lasso, ridge), standard econometric models and forecast combinations, besides the structural factor model of Schwartz and Smith (2000). The database contains 315 macroeconomic and financial variables, used to build high-dimensional models. To evaluate the predictive power of each method, an extensive pseudo out-of-sample forecasting exercise is built, in monthly and quarterly frequencies, with horizons from one month up to five years. Overall, the results indicate a good performance of the machine learning methods in the short-run. Up to six months, lasso-based models, oil future prices, VECM and the Schwartz-Smith model provide the best forecasts. At longer horizons, forecast combinations also become relevant. In several cases, the accuracy gains in respect to the random walk forecast are statistically significant and reach two-digit figures, in percentage terms, using the R2 out-of-sample statistic; an expressive achievement compared to the previous literature.
引用
收藏
页数:21
相关论文
共 85 条
[1]   WHAT DRIVES OIL PRICES? EMERGING VERSUS DEVELOPED ECONOMIES [J].
Aastveit, Knut Are ;
Bjornland, Hilde C. ;
Thorsrud, Leif Anders .
JOURNAL OF APPLIED ECONOMETRICS, 2015, 30 (07) :1013-1028
[2]  
Adolfson M., 2005, FORECASTING PERFORMA
[3]  
Alquist R, 2013, HBK ECON, P427, DOI 10.1016/B978-0-444-53683-9.00008-6
[4]   Permutation importance: a corrected feature importance measure [J].
Altmann, Andre ;
Tolosi, Laura ;
Sander, Oliver ;
Lengauer, Thomas .
BIOINFORMATICS, 2010, 26 (10) :1340-1347
[5]  
[Anonymous], MACH LEARN
[6]  
Araujo G.S., 2020, MACHINE LEARNING MET
[7]   Determining the number of factors in approximate factor models [J].
Bai, JS ;
Ng, S .
ECONOMETRICA, 2002, 70 (01) :191-221
[8]   Forecasting economic time series using targeted predictors [J].
Bai, Jushan ;
Ng, Serena .
JOURNAL OF ECONOMETRICS, 2008, 146 (02) :304-317
[9]  
Baker ScottR., 2015, MEASURING EC POLICY
[10]  
Banbura M, 2013, HBK ECON, P195, DOI 10.1016/B978-0-444-53683-9.00004-9