A Learning-Based Model Predictive Control Strategy for Home Energy Management Systems

被引：2

作者：

Cai, Wenqi ^{[1
]}

Sawant, Shambhuraj ^{[1
]}

Reinhardt, Dirk ^{[1
]}

Rastegarpour, Soroush ^{[2
]}

Gros, Sebastien ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol NTNU, Dept Engn Cybernet, N-7013 Trondheim, Norway

[2] ABB Corp Res, Dept Automat & Control, S-72226 Vasteras, Sweden

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Model predictive control (MPC); reinforcement learning (RL); home energy management system (HEMS); inaccurate model; system uncertainties; HEAT-PUMP;

D O I：

10.1109/ACCESS.2023.3346324

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a model predictive control (MPC)-based reinforcement learning (RL) approach for a home energy management system (HEMS). The house consists of an air-to-water heat pump connected to a hot water tank that supplies thermal energy to a water-based floor heating system. Additionally, it includes a photovoltaic (PV) array and a battery storage system. The HEMS is supposed to exploit the house thermal inertia and battery storage to shift demand from peak hours to off-peak periods and earn benefits by selling excess energy to the utility grid during periods of high electricity prices. However, designing such a HEMS is challenging because the discrepancies due to model mismatch make erroneous predictions of the system dynamics, leading to a non-optimal decision making. Besides, uncertainties in the house thermodynamics, misprediction in the forecasting of PV generation, outdoor temperature, and user load demand make the problem more challenging. We solve this issue by approximating the optimal policy by a parameterized MPC scheme and updating the parameters via a compatible delayed deterministic actor-critic (with gradient Q-learning critic, i.e., CDDAC-GQ) algorithm. Simulation results show that the proposed MPC-based RL HEMS can effectively deliver a policy that satisfies both indoor thermal comfort and economic costs even in the case of inaccurate model and system uncertainties. Furthermore, we conduct a thorough comparison between the CDDAC-GQ algorithm and the conventional twin delayed deep deterministic policy gradient (TD3) algorithm, the results of which affirm the efficacy of our proposed method in addressing complex HEMS problems.

引用

页码：145264 / 145280

页数：17

共 60 条

[1] Artificial neural network (ANN) based model predictive control (MPC) and optimization of HVAC systems: A state of the art review and case study of a residential HVAC system
Afram, Abdul
Janabi-Sharifi, Farrokh
Fung, Alan S.
Raahemifar, Kaamran
[J]. ENERGY AND BUILDINGS, 2017, 141 : 96 - 113
[2] Alrumayh O, 2015, IEEE ELECTR POW ENER, P152, DOI 10.1109/EPEC.2015.7379942
[3] Automated Demand Response From Home Energy Management System Under Dynamic Pricing and Power and Comfort Constraints
Althaher, Sereen
Mancarella, Pierluigi
Mutale, Joseph
[J]. IEEE TRANSACTIONS ON SMART GRID, 2015, 6 (04) : 1874 - 1883
[4] DRL-HEMS: Deep Reinforcement Learning Agent for Demand Response in Home Energy Management Systems Considering Customers and Operators Perspectives
Amer, Aya A.
Shaban, Khaled
Massoud, Ahmed M.
[J]. IEEE TRANSACTIONS ON SMART GRID, 2023, 14 (01) : 239 - 250
[5] CasADi: a software framework for nonlinear optimization and optimal control
Andersson, Joel A. E.
Gillis, Joris
Horn, Greg
Rawlings, James B.
Diehl, Moritz
[J]. MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) : 1 - 36
[6] Home energy management under realistic and uncertain conditions: A comparison of heuristic, deterministic, and stochastic control methods
Blonsky, Michael
McKenna, Killian
Maguire, Jeff
Vincent, Tyrone
[J]. APPLIED ENERGY, 2022, 325
[7] Design of Ensemble Forecasting Models for Home Energy Management Systems
Bot, Karol
Santos, Samira
Laouali, Inoussa
Ruano, Antonio
Ruano, Maria da Graca
[J]. ENERGIES, 2021, 14 (22)
[8] Energy management in residential microgrid using model predictive control-based reinforcement learning and Shapley value
Cai, Wenqi
Kordabad, Arash Bahari
Gros, Sebastien
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
[9] Optimal Management of the Peak Power Penalty for Smart Grids Using MPC-based Reinforcement Learning
Cai, Wenqi
Esfahani, Hossein N.
Kordabad, Arash B.
Gros, Sebastien
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 6365 - 6370
[10] MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles
Cai, Wenqi
Kordabad, Arash B.
Esfahani, Hossein N.
Lekkas, Anastasios M.
Gros, Sebastien
[J]. 2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 2990 - 2995

← 1 2 3 4 5 6 →