A Learning-Based Model Predictive Control Strategy for Home Energy Management Systems

被引:2
作者
Cai, Wenqi [1 ]
Sawant, Shambhuraj [1 ]
Reinhardt, Dirk [1 ]
Rastegarpour, Soroush [2 ]
Gros, Sebastien [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, N-7013 Trondheim, Norway
[2] ABB Corp Res, Dept Automat & Control, S-72226 Vasteras, Sweden
关键词
Model predictive control (MPC); reinforcement learning (RL); home energy management system (HEMS); inaccurate model; system uncertainties; HEAT-PUMP;
D O I
10.1109/ACCESS.2023.3346324
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a model predictive control (MPC)-based reinforcement learning (RL) approach for a home energy management system (HEMS). The house consists of an air-to-water heat pump connected to a hot water tank that supplies thermal energy to a water-based floor heating system. Additionally, it includes a photovoltaic (PV) array and a battery storage system. The HEMS is supposed to exploit the house thermal inertia and battery storage to shift demand from peak hours to off-peak periods and earn benefits by selling excess energy to the utility grid during periods of high electricity prices. However, designing such a HEMS is challenging because the discrepancies due to model mismatch make erroneous predictions of the system dynamics, leading to a non-optimal decision making. Besides, uncertainties in the house thermodynamics, misprediction in the forecasting of PV generation, outdoor temperature, and user load demand make the problem more challenging. We solve this issue by approximating the optimal policy by a parameterized MPC scheme and updating the parameters via a compatible delayed deterministic actor-critic (with gradient Q-learning critic, i.e., CDDAC-GQ) algorithm. Simulation results show that the proposed MPC-based RL HEMS can effectively deliver a policy that satisfies both indoor thermal comfort and economic costs even in the case of inaccurate model and system uncertainties. Furthermore, we conduct a thorough comparison between the CDDAC-GQ algorithm and the conventional twin delayed deep deterministic policy gradient (TD3) algorithm, the results of which affirm the efficacy of our proposed method in addressing complex HEMS problems.
引用
收藏
页码:145264 / 145280
页数:17
相关论文
共 60 条
  • [1] Artificial neural network (ANN) based model predictive control (MPC) and optimization of HVAC systems: A state of the art review and case study of a residential HVAC system
    Afram, Abdul
    Janabi-Sharifi, Farrokh
    Fung, Alan S.
    Raahemifar, Kaamran
    [J]. ENERGY AND BUILDINGS, 2017, 141 : 96 - 113
  • [2] Alrumayh O, 2015, IEEE ELECTR POW ENER, P152, DOI 10.1109/EPEC.2015.7379942
  • [3] Automated Demand Response From Home Energy Management System Under Dynamic Pricing and Power and Comfort Constraints
    Althaher, Sereen
    Mancarella, Pierluigi
    Mutale, Joseph
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2015, 6 (04) : 1874 - 1883
  • [4] DRL-HEMS: Deep Reinforcement Learning Agent for Demand Response in Home Energy Management Systems Considering Customers and Operators Perspectives
    Amer, Aya A.
    Shaban, Khaled
    Massoud, Ahmed M.
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (01) : 239 - 250
  • [5] CasADi: a software framework for nonlinear optimization and optimal control
    Andersson, Joel A. E.
    Gillis, Joris
    Horn, Greg
    Rawlings, James B.
    Diehl, Moritz
    [J]. MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) : 1 - 36
  • [6] Home energy management under realistic and uncertain conditions: A comparison of heuristic, deterministic, and stochastic control methods
    Blonsky, Michael
    McKenna, Killian
    Maguire, Jeff
    Vincent, Tyrone
    [J]. APPLIED ENERGY, 2022, 325
  • [7] Design of Ensemble Forecasting Models for Home Energy Management Systems
    Bot, Karol
    Santos, Samira
    Laouali, Inoussa
    Ruano, Antonio
    Ruano, Maria da Graca
    [J]. ENERGIES, 2021, 14 (22)
  • [8] Energy management in residential microgrid using model predictive control-based reinforcement learning and Shapley value
    Cai, Wenqi
    Kordabad, Arash Bahari
    Gros, Sebastien
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [9] Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning
    Cai, Wenqi
    Esfahani, Hossein N.
    Kordabad, Arash B.
    Gros, Sebastien
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6365 - 6370
  • [10] MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles
    Cai, Wenqi
    Kordabad, Arash B.
    Esfahani, Hossein N.
    Lekkas, Anastasios M.
    Gros, Sebastien
    [J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2990 - 2995