Improving demand-response scheme in smart grids using reinforcement learning

被引:7
作者
Bagherpour, Reza [1 ]
Mozayani, Nasser [1 ]
Badnava, Babak [2 ]
机构
[1] Iran Univ Sci & Technol, Dept Comp Engn, Univ Rd,Hengam St, Tehran 1684613114, Iran
[2] Univ Kansas, Dept Mech Engn, Lawrence, KS 66045 USA
关键词
deep contextual bandit algorithm; demand response; dynamic pricing; smart grid; ELECTRICITY; ALGORITHM;
D O I
10.1002/er.7165
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
With efficient load management, in peak times, customers can reduce their consumption, shift the shiftable loads to proper times or even store the electrical energy in the low prices range. To achieve these effects, the dynamic pricing policy is one of the most effective ways to encourage users to change their consumption patterns. However, because of the uncertainty of electricity consumption, it is a complicated problem to determine an optimal pricing policy. For this purpose, in this study, a deep contextual bandit algorithm is considered to solve this problem, which uses a deep neural network to learns the context and the associated reward. Existing works on dynamic pricing policy used some limited datasets and assumptions, which may be contrary to the nature of the real-world markets. Due to the different demands of the consumers, which change throughout the year, and the existence of multi pricing agents in future markets, it is important to have a good market structure, method, and especially datasets which representative of all possible demands. The simulations were performed in different system models. Results show that the proposed algorithm can improve the system's reliability, reduce energy cost, and control the power system's ramp rate. Especially in multiagent markets with multiobjective costs, we can adjust the pricing model based on the market strategy to satisfy customers or reduce energy consumption.
引用
收藏
页码:21082 / 21095
页数:14
相关论文
共 37 条
[1]   A multi-agent based energy management solution for integrated buildings and microgrid system [J].
Anvari-Moghaddam, Amjad ;
Rahimi-Kian, Ashkan ;
Mirian, Maryam S. ;
Guerrero, Josep M. .
APPLIED ENERGY, 2017, 203 :41-56
[2]  
Auer P., 2002, J. Mach. Learn. Res., DOI DOI 10.4271/610369
[3]   Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems [J].
Bubeck, Sebastien ;
Cesa-Bianchi, Nicolo .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 5 (01) :1-122
[4]  
Denholm P., 2015, Overgeneration from solar energy in california: a field guide to the duck chart, DOI DOI 10.2172/1226167
[5]  
Dong Y., 2015, J. Power Energy Eng., V3, P240
[6]  
Freud S., 1958, STANDARD EDITION COM, V12, P145
[7]  
Ghasemkhani A, 2018, IEEE INT CONF COMM
[8]  
H. R.-B. of the A. M. Society and undefined, 1952, SOME ASPECTS SEQUENT
[9]   An exploratory analysis of California residential customer response to critical peak pricing of electricity [J].
Herter, Karen ;
McAuliffe, Patrick ;
Rosenfeld, Arthur .
ENERGY, 2007, 32 (01) :25-34
[10]   A genetic algorithm based dynamic pricing for improving bi-directional interactions with reduced power imbalance [J].
Huang, Pei ;
Xu, Tao ;
Sun, Yongjun .
ENERGY AND BUILDINGS, 2019, 199 :275-286