A Multi-Agent Deep Constrained Q-Learning Method for Smart Building Energy Management Under Uncertainties

被引:0
|
作者
Saberi, Hossein [1 ,2 ]
Zhang, Cuo [3 ]
Dong, Zhao Yang [4 ]
机构
[1] Univ New South Wales, Sch Photovolta & Renewable Energy Engn, Sydney, NSW 2052, Australia
[2] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
[3] Univ Sydney, Sch Elect & Comp Engn, Sydney, NSW 2006, Australia
[4] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
澳大利亚研究理事会;
关键词
Water heating; Uncertainty; Costs; Smart buildings; HVAC; Resistance heating; Q-learning; Data-driven optimization; deep reinforcement learning; constrained Q-learning; building energy management system; uncertainty; DEMAND RESPONSE; REINFORCEMENT; COORDINATION; LOAD;
D O I
10.1109/TSG.2024.3386896
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Data-driven energy management with flexible appliances in smart buildings is a key towards power system operational intelligence. However, the low efficiency of existing deep reinforcement learning (DRL) methods in terms of optimization and computational performance, caused by reward shaping, large neural networks, system-wide constraints and reward allocation of photovoltaic power generation, signifies the need for new system-specific DRL methods. To address these challenges, this paper proposes a multi-agent deep constrained Q-learning method to obtain online optimal solutions for smart building energy management in presence of various uncertainties. The proposed method minimizes daily energy cost via real-time adjustment of flexible appliances, and addressing impacts of the uncertainties. A deep constrained Q-learning algorithm is developed to effectively avoid reward shaping. By adopting multi-layer perception to estimate thermodynamics and electric vehicle charging states, and developing appliance-specific logic, it is novel to calculate the joint safe action space of all appliances during the training process. A multi-agent approach is developed to address the system-wide constraints and the reward allocation, directly in the Q-update, where hyper-parameters of individual agents are tuned separately. Numerical simulation results verify the high efficiency of the proposed method in daily energy cost minimization and online energy management.
引用
收藏
页码:4649 / 4661
页数:13
相关论文
共 50 条
  • [1] Multi-agent deep reinforcement learning for Smart building energy management with chance constraints
    Deng, Jingchuan
    Wang, Xinsheng
    Meng, Fangang
    ENERGY AND BUILDINGS, 2025, 331
  • [2] Fuzzy Q-Learning for multi-agent decentralized energy management in microgrids
    Kofinas, P.
    Dounis, A., I
    Vouros, G. A.
    APPLIED ENERGY, 2018, 219 : 53 - 67
  • [3] Regularized Softmax Deep Multi-Agent Q-Learning
    Pan, Ling
    Rashid, Tabish
    Peng, Bei
    Huang, Longbo
    Whiteson, Shimon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] Modular Production Control with Multi-Agent Deep Q-Learning
    Gankin, Dennis
    Mayer, Sebastian
    Zinn, Jonas
    Vogel-Heuser, Birgit
    Endisch, Christian
    2021 26TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2021,
  • [5] Q-learning in Multi-Agent Cooperation
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lin, Tzung-Feng
    2008 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS, 2008, : 239 - 244
  • [6] Multi-Agent Advisor Q-Learning
    Subramanian S.G.
    Taylor M.E.
    Larson K.
    Crowley M.
    Journal of Artificial Intelligence Research, 2022, 74 : 1 - 74
  • [7] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 74 : 1 - 74
  • [8] Multi-Agent Advisor Q-Learning
    Subramanian, Sriram Ganapathi
    Taylor, Matthew E.
    Larson, Kate
    Crowley, Mark
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 6884 - 6889
  • [9] Multi-Agent Coordination Method Based on Fuzzy Q-Learning
    Peng, Jun
    Liu, Miao
    Wu, Min
    Zhang, Xiaoyong
    Lin, Kuo-Chi
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 5411 - +
  • [10] Deep Q-Learning for Decentralized Multi-Agent Inspection of a Tumbling Target
    Aurand, Joshua
    Cutlip, Steven
    Lei, Henry
    Lang, Kendra
    Phillips, Sean
    JOURNAL OF SPACECRAFT AND ROCKETS, 2024, 61 (02) : 341 - 354