An ARIMA-LSTM model for predicting volatile agricultural price series with random forest technique

被引:59
作者
Ray, Soumik [1 ]
Lama, Achal [2 ]
Mishra, Pradeep [3 ]
Biswas, Tufleuddin [1 ]
Das, Soumitra Sankar [4 ]
Gurung, Bishal [5 ]
机构
[1] Centurion Univ Technol & Management, Dept Agr Econ & Stat, Paralakhemundi, Odisha, India
[2] ICAR Indian Agr Stat Res Inst, Div Forecasting & Agr Syst Modelling, New Delhi, India
[3] Jawaharlal Nehru Krishi Vishwa Vidyalaya, Coll Agr, Rewa, MP, India
[4] ICFAI Univ, Fac Management & Commerce, Agartala, Tripura, India
[5] North Eastern Hill Univ, Dept Stat, Shillong, India
关键词
Random forest; ARIMA; GARCH; LSTM; Hybrid; Volatile; TIME-SERIES; NEURAL-NETWORK; SELECTION;
D O I
10.1016/j.asoc.2023.110939
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning mechanism is establishing itself as a promising area for modelling and forecasting complex time series over conventional statistical models. In this article, focus has been made on presenting a machine learning algorithm with special attention to deep learning model in form of a potential alternative to statistical models such as Autoregressive Integrated Moving Average (ARIMA) and ARIMA-Generalised Autoregressive Conditional Heteroscedasticity (GARCH) models. Further, an improved hybrid ARIMA-Long Short-Term Memory (LSTM) model based on the random forest lag selection criterion has been introduced. ARIMA model has been used to estimate the mean effect and the GARCH model is employed with the residuals obtained from the ARIMA model to estimate the volatile behaviour of the series. ARIMA-GARCH models act as superior statistical models over ARIMA models based on the lowest AIC and BIC values. LSTM model is employed on all normalised training data series. After which we built a comparison scenario independently between ARIMA, ARIMA-GARCH, LSTM and ARIMA-LSTM models on forecasting accuracy in terms of the lowest RMSE, MAPE and MASE values. The proposed random forest-based ARIMA-LSTM model proved its superiority over the conventional statistical models with an improvement to the tune of 8-25% for RMSE, 2-28% for MAPE and 2-29% for MASE. The proposed hybrid model has been successfully applied to volatile monthly price indices of pulses namely gram, moong and urad. This piece of work will enrich the literature on machine learning and further intrigue re-searchers to apply it to various other volatile data sets.
引用
收藏
页数:20
相关论文
共 52 条
[1]  
Achal Lama Achal Lama, 2015, Agricultural Economics Research Review, V28, P73, DOI 10.5958/0974-0279.2015.00005.1
[2]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[3]   Asymmetric volatility in cryptocurrency markets: New evidence from smooth transition GARCH models [J].
Ben Cheikh, Nidhaleddine ;
Ben Zaied, Younes ;
Chevallier, Julien .
FINANCE RESEARCH LETTERS, 2020, 35
[4]  
Bhardwaj P., 2014, Economic Affairs, V59, P415
[5]   GENERALIZED AUTOREGRESSIVE CONDITIONAL HETEROSKEDASTICITY [J].
BOLLERSLEV, T .
JOURNAL OF ECONOMETRICS, 1986, 31 (03) :307-327
[6]  
Box G.E., 1976, Time series analysis: forecasting and control, V2
[7]  
Cortez P, 2010, IEEE IJCNN
[8]   DISTRIBUTION OF THE ESTIMATORS FOR AUTOREGRESSIVE TIME-SERIES WITH A UNIT ROOT [J].
DICKEY, DA ;
FULLER, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (366) :427-431
[9]   COMPARING PREDICTIVE ACCURACY [J].
DIEBOLD, FX ;
MARIANO, RS .
JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 1995, 13 (03) :253-263
[10]  
Dritsaki C., 2018, International Journal of Energy Economics and Policy (IJEEP), V8, P14