Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning

被引:5
|
作者
Cai, Wenqi [1 ]
Esfahani, Hossein N. [1 ]
Kordabad, Arash B. [1 ]
Gros, Sebastien [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, Trondheim, Norway
来源
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC) | 2021年
关键词
DEMAND RESPONSE;
D O I
10.1109/CDC45484.2021.9683333
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The cost of the power distribution infrastructures is driven by the peak power encountered in the system. Therefore, the distribution network operators consider billing consumers behind a common transformer in the function of their peak demand and leave it to the consumers to manage their collective costs. This management problem is, however, not trivial. In this paper, we consider a multi-agent residential smart grid system, where each agent has local renewable energy production and energy storage, and all agents are connected to a local transformer. The objective is to develop an optimal policy that minimizes the economic cost consisting of both the spot-market cost for each consumer and their collective peak-power cost. We propose to use a parametric Model Predictive Control (MPC)-scheme to approximate the optimal policy. The optimality of this policy is limited by its finite horizon and inaccurate forecasts of the local power production-consumption. A Deterministic Policy Gradient (DPG) method is deployed to adjust the MPC parameters and improve the policy. Our simulations show that the proposed MPC-based Reinforcement Learning (RL) method can effectively decrease the long-term economic cost for this smart grid problem.
引用
收藏
页码:6365 / 6370
页数:6
相关论文
共 42 条