A hybrid model based on bidirectional long short-term memory neural network and Catboost for short-term electricity spot price forecasting

被引:40
作者
Zhang, Fan [1 ,2 ]
Fleyeh, Hasan [3 ]
Bales, Chris [2 ]
机构
[1] Dalarna Univ, Dept Microdata Anal, S-79188 Falun, Sweden
[2] Dalarna Univ, Dept Energy Technol, Falun, Sweden
[3] Dalarna Univ, Dept Comp Engn, Falun, Sweden
关键词
Bidirectional long short-term memory neural network; deep learning; electricity price forecasting; machine learning; boosting algorithms; energy market; LSTM; MARKET;
D O I
10.1080/01605682.2020.1843976
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Electricity price forecasting plays a crucial role in a liberalised electricity market. Generally speaking, long-term electricity price is widely utilised for investment profitability analysis, grid or transmission expansion planning, while medium-term forecasting is important to markets that involve medium-term contracts. Typical applications of medium-term forecasting are risk management, balance sheet calculation, derivative pricing, and bilateral contracting. Short-term electricity price forecasting is essential for market providers to adjust the schedule of production, i.e., balancing consumers' demands and electricity generation. Results from short-term forecasting are utilised by market players to decide the timing of purchasing or selling to maximise profits. Among existing forecasting approaches, neural networks are regarded as the state of art method due to their capability of modelling high non-linearity and complex patterns inside time series data. However, deep neural networks are not studied comprehensively in this field, which represents a good motivation to fill this research gap. In this article, a deep neural network-based hybrid approach is proposed for short-term electricity price forecasting. To be more specific, categorical boosting (Catboost) algorithm is used for feature selection and a bidirectional long short-term memory neural network (BDLSTM) will serve as the main forecasting engine in the proposed method. To evaluate the effectiveness of the proposed approach, 2018 hourly electricity price data from the Nord Pool market are invoked as a case study. Moreover, the performance of the proposed approach is compared with those of multi-layer perception (MLP) neural network, support vector regression (SVR), ensemble tree, ARIMA as well as two recent deep learning-based models, gated recurrent units (GRU) and LSTM models. A real-world dataset of Nord Pool market is used in this study to validate the proposed approach. Mean percentage error (MAPE), root mean square error (RMSE), and mean absolute error (MAE) are used to measure the model performance. Experiment results show that the proposed model achieves lower forecasting errors than other models considered in this study although the proposed model is more time consuming in terms of training and forecasting.
引用
收藏
页码:301 / 325
页数:25
相关论文
共 54 条
[1]   A new prediction strategy for price spike forecasting of day-ahead electricity markets [J].
Amjady, Nima ;
Keynia, Farshid .
APPLIED SOFT COMPUTING, 2011, 11 (06) :4246-4256
[2]   Day-Ahead Deregulated Electricity Market Price Forecasting Using Recurrent Neural Network [J].
Anbazhagan, S. ;
Kumarappan, N. .
IEEE SYSTEMS JOURNAL, 2013, 7 (04) :866-872
[3]  
[Anonymous], 2017, POLITICS NUCL ENERGY
[4]  
[Anonymous], 2015, Understanding LSTM networks
[5]   Exploiting the past and the future in protein secondary structure prediction [J].
Baldi, P ;
Brunak, S ;
Frasconi, P ;
Soda, G ;
Pollastri, G .
BIOINFORMATICS, 1999, 15 (11) :937-946
[6]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[7]  
Berglund S., 2009, PUTTING POLITICS PER
[8]   A medium-term integrated risk management model for a hydrothermal generation company [J].
Cabero, J ;
Baíllo, A ;
Cerisola, S ;
Ventosa, M ;
García-Alcalde, A ;
Perán, F ;
Relaño, G .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2005, 20 (03) :1379-1388
[9]  
Cestnik B., 1990, ECAI 90. Proceedings of the 9th European Conference on Artificial Intelligence, P147
[10]  
Chen T, 2016, ery and Data Mining, V785, P785, DOI [DOI 10.1145/2939672.2939785, 10.1145/2939672.2939785]