A Multi-Agent Deep Constrained Q-Learning Method for Smart Building Energy Management Under Uncertainties

被引：0

作者：

Saberi, Hossein ^{[1
,2
]}

Zhang, Cuo ^{[3
]}

Dong, Zhao Yang ^{[4
]}

机构：

[1] Univ New South Wales, Sch Photovolta & Renewable Energy Engn, Sydney, NSW 2052, Australia

[2] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia

[3] Univ Sydney, Sch Elect & Comp Engn, Sydney, NSW 2006, Australia

[4] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

来源：

IEEE TRANSACTIONS ON SMART GRID | 2024年 / 15卷 / 05期

基金：

澳大利亚研究理事会;

关键词：

Water heating; Uncertainty; Costs; Smart buildings; HVAC; Resistance heating; Q-learning; Data-driven optimization; deep reinforcement learning; constrained Q-learning; building energy management system; uncertainty; DEMAND RESPONSE; REINFORCEMENT; COORDINATION; LOAD;

D O I：

10.1109/TSG.2024.3386896

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Data-driven energy management with flexible appliances in smart buildings is a key towards power system operational intelligence. However, the low efficiency of existing deep reinforcement learning (DRL) methods in terms of optimization and computational performance, caused by reward shaping, large neural networks, system-wide constraints and reward allocation of photovoltaic power generation, signifies the need for new system-specific DRL methods. To address these challenges, this paper proposes a multi-agent deep constrained Q-learning method to obtain online optimal solutions for smart building energy management in presence of various uncertainties. The proposed method minimizes daily energy cost via real-time adjustment of flexible appliances, and addressing impacts of the uncertainties. A deep constrained Q-learning algorithm is developed to effectively avoid reward shaping. By adopting multi-layer perception to estimate thermodynamics and electric vehicle charging states, and developing appliance-specific logic, it is novel to calculate the joint safe action space of all appliances during the training process. A multi-agent approach is developed to address the system-wide constraints and the reward allocation, directly in the Q-update, where hyper-parameters of individual agents are tuned separately. Numerical simulation results verify the high efficiency of the proposed method in daily energy cost minimization and online energy management.

引用

页码：4649 / 4661

页数：13

共 50 条

[1] Multi-agent deep reinforcement learning for Smart building energy management with chance constraints
Deng, Jingchuan
Wang, Xinsheng
Meng, Fangang
ENERGY AND BUILDINGS, 2025, 331
[2] Fuzzy Q-Learning for multi-agent decentralized energy management in microgrids
Kofinas, P.
Dounis, A., I
Vouros, G. A.
APPLIED ENERGY, 2018, 219 : 53 - 67
[3] Regularized Softmax Deep Multi-Agent Q-Learning
Pan, Ling
Rashid, Tabish
Peng, Bei
Huang, Longbo
Whiteson, Shimon
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[4] Modular Production Control with Multi-Agent Deep Q-Learning
Gankin, Dennis
Mayer, Sebastian
Zinn, Jonas
Vogel-Heuser, Birgit
Endisch, Christian
2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
[5] Q-learning in Multi-Agent Cooperation
Hwang, Kao-Shing
Chen, Yu-Jen
Lin, Tzung-Feng
2008 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2008, : 239 - 244
[6] Multi-Agent Advisor Q-Learning
Subramanian S.G.
Taylor M.E.
Larson K.
Crowley M.
Journal of Artificial Intelligence Research, 2022, 74 : 1 - 74
[7] Multi-Agent Advisor Q-Learning
Subramanian, Sriram Ganapathi
Taylor, Matthew E.
Larson, Kate
Crowley, Mark
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1 - 74
[8] Multi-Agent Advisor Q-Learning
Subramanian, Sriram Ganapathi
Taylor, Matthew E.
Larson, Kate
Crowley, Mark
PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6884 - 6889
[9] Multi-Agent Coordination Method Based on Fuzzy Q-Learning
Peng, Jun
Liu, Miao
Wu, Min
Zhang, Xiaoyong
Lin, Kuo-Chi
2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5411 - +
[10] Deep Q-Learning for Decentralized Multi-Agent Inspection of a Tumbling Target
Aurand, Joshua
Cutlip, Steven
Lei, Henry
Lang, Kendra
Phillips, Sean
JOURNAL OF SPACECRAFT AND ROCKETS, 2024, 61 (02) : 341 - 354

← 1 2 3 4 5 →