Energy management in residential microgrid using model predictive control-based reinforcement learning and Shapley value

被引:34
作者
Cai, Wenqi [1 ]
Kordabad, Arash Bahari [1 ]
Gros, Sebastien [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, Hogskoleringen 1, N-7491 Trondheim, Norway
关键词
Reinforcement learning (RL); Model predictive control (MPC); Microgrid (MG); Energy management(EM); Cooperative coalition game (CCG); Shapley value; Profit distribution; MPC;
D O I
10.1016/j.engappai.2022.105793
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an Energy Management (EM) strategy for residential microgrid systems using Model Predictive Control (MPC)-based Reinforcement Learning (RL) and Shapley value. We construct a typical residential microgrid system that considers fluctuating spot-market prices, highly uncertain user demand and renewable generation, and collective peak power penalties. To optimize the benefits for all residential prosumers, the EM problem is formulated as a Cooperative Coalition Game (CCG). The objective is to first find an energy trading policy that reduces the collective economic cost (including spot-market cost and peak power cost) of the residential coalition, and then to distribute the profits obtained through cooperation to all residents. An MPC-based RL approach, which compensates for the shortcomings of MPC and RL and benefits from the advantages of both, is proposed to reduce the monthly collective cost despite the system uncertainties. To determine the amount of monthly electricity bill each resident should pay, we transfer the cost distribution problem into a profit distribution problem. Then, the Shapley value approach is applied to equitably distribute the profits (i.e., cost savings) gained through cooperation to all residents based on the weighted average of their respective marginal contributions. Finally, simulations are performed on a three-household microgrid system located in Oslo, Norway, to validate the proposed strategy, where a real-world dataset of April 2020 is used. Simulation results show that the proposed MPC-based RL approach could effectively reduce the long-term economic cost by about 17.5%, and the Shapley value method provides a solution for allocating the collective bills fairly.
引用
收藏
页数:13
相关论文
共 55 条
  • [1] Distributed reinforcement learning energy management approach in multiple residential energy hubs
    Ahrarinouri, Mehdi
    Rastegar, Mohammad
    Karami, Kiana
    Seifi, Ali Reza
    [J]. SUSTAINABLE ENERGY GRIDS & NETWORKS, 2022, 32
  • [2] Efficient energy management for a grid-tied residential microgrid
    Anvari-Moghaddam, Amjad
    Guerrero, Josep M.
    Vasquez, Juan C.
    Monsef, Hassan
    Rahimi-Kian, Ashkan
    [J]. IET GENERATION TRANSMISSION & DISTRIBUTION, 2017, 11 (11) : 2752 - 2761
  • [3] Energy Management of a Residential Heating System Through Deep Reinforcement Learning
    Brandi, Silvio
    Coraci, Davide
    Borello, Davide
    Capozzoli, Alfonso
    [J]. SUSTAINABILITY IN ENERGY AND BUILDINGS 2021, 2022, 263 : 329 - 339
  • [4] Reinforcement learning for control: Performance, stability, and deep approximators
    Busoniu, Lucian
    de Bruin, Tim
    Tolic, Domagoj
    Kober, Jens
    Palunko, Ivana
    [J]. ANNUAL REVIEWS IN CONTROL, 2018, 46 : 8 - 28
  • [5] Cai W., 2021, 2021 60 IEEE C DECIS, P6365
  • [6] MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles
    Cai, Wenqi
    Kordabad, Arash B.
    Esfahani, Hossein N.
    Lekkas, Anastasios M.
    Gros, Sebastien
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2990 - 2995
  • [7] Calculating direct and indirect contributions of players in cooperative games via the multi-linear extension
    Casajus, Andre
    Huettner, Frank
    [J]. ECONOMICS LETTERS, 2018, 164 : 27 - 30
  • [8] Polynomial calculation of the Shapley value based on sampling
    Castro, Javier
    Gomez, Daniel
    Tejada, Juan
    [J]. COMPUTERS & OPERATIONS RESEARCH, 2009, 36 (05) : 1726 - 1730
  • [9] Model-predictive control and reinforcement learning in multi-energy system case studies
    Ceusters, Glenn
    Rodriguez, Roman Cantu
    Garcia, Alberte Bouso
    Franke, Rudiger
    Deconinck, Geert
    Helsen, Lieve
    Nowe, Ann
    Messagie, Maarten
    Camargo, Luis Ramirez
    [J]. APPLIED ENERGY, 2021, 303
  • [10] Chen Y., 2018, ENERGY, V2, pL3