A Multi-Agent Deep Constrained Q-Learning Method for Smart Building Energy Management Under Uncertainties

被引:0
作者
Saberi, Hossein [1 ,2 ]
Zhang, Cuo [3 ]
Dong, Zhao Yang [4 ]
机构
[1] Univ New South Wales, Sch Photovolta & Renewable Energy Engn, Sydney, NSW 2052, Australia
[2] Univ New South Wales, Sch Elect Engn & Telecommun, Sydney, NSW 2052, Australia
[3] Univ Sydney, Sch Elect & Comp Engn, Sydney, NSW 2006, Australia
[4] City Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
基金
澳大利亚研究理事会;
关键词
Water heating; Uncertainty; Costs; Smart buildings; HVAC; Resistance heating; Q-learning; Data-driven optimization; deep reinforcement learning; constrained Q-learning; building energy management system; uncertainty; DEMAND RESPONSE; REINFORCEMENT; COORDINATION; LOAD;
D O I
10.1109/TSG.2024.3386896
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Data-driven energy management with flexible appliances in smart buildings is a key towards power system operational intelligence. However, the low efficiency of existing deep reinforcement learning (DRL) methods in terms of optimization and computational performance, caused by reward shaping, large neural networks, system-wide constraints and reward allocation of photovoltaic power generation, signifies the need for new system-specific DRL methods. To address these challenges, this paper proposes a multi-agent deep constrained Q-learning method to obtain online optimal solutions for smart building energy management in presence of various uncertainties. The proposed method minimizes daily energy cost via real-time adjustment of flexible appliances, and addressing impacts of the uncertainties. A deep constrained Q-learning algorithm is developed to effectively avoid reward shaping. By adopting multi-layer perception to estimate thermodynamics and electric vehicle charging states, and developing appliance-specific logic, it is novel to calculate the joint safe action space of all appliances during the training process. A multi-agent approach is developed to address the system-wide constraints and the reward allocation, directly in the Q-update, where hyper-parameters of individual agents are tuned separately. Numerical simulation results verify the high efficiency of the proposed method in daily energy cost minimization and online energy management.
引用
收藏
页码:4649 / 4661
页数:13
相关论文
共 50 条
  • [31] Consensus of discrete-time multi-agent system based on Q-learning
    Zhu Z.-B.
    Wang F.-Y.
    Yin Y.-H.
    Liu Z.-X.
    Chen Z.-Q.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (07): : 997 - 1005
  • [32] Multi-agent Q-Learning control of spacecraft formation flying reconfiguration trajectories
    Kankashvar, Mohammadrasoul
    Bolandi, Hossein
    Mozayani, Nasser
    ADVANCES IN SPACE RESEARCH, 2023, 71 (03) : 1627 - 1643
  • [33] Q-LEARNING BY THE nth STEP STATE AND MULTI-AGENT NEGOTIATION IN UNKNOWN ENVIRONMENT
    Job, Josip
    Jovic, Franjo
    Livada, Caslav
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2012, 19 (03): : 529 - 534
  • [34] Distributed dynamic spectrum access through multi-agent deep recurrent Q-learning in cognitive radio network
    Giri, Manish Kumar
    Majumder, Saikat
    PHYSICAL COMMUNICATION, 2023, 58
  • [35] Robust H∞ Output Consensus in Heterogeneous Multi-agent Discrete-Time Systems Using Q-Learning Algorithm
    Valadbeigi, Amir Parviz
    Soltanian, Farzad
    Shasadeghi, Mokhtar
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2025, : 251 - 263
  • [36] Deep Recurrent Learning versus Q-Learning for Energy Management Systems in Next Generation Network
    Dridi, Aicha
    Boucetta, Cherifa
    Moungla, Hassine
    Afifi, Hossam
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
  • [37] A Dynamic Building Method of Mobile Agent Migrating Path Based on Q-Learning
    Cheng, Yuan-cai
    Wang, Xiao-lin
    2014 IEEE 7TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC), 2014, : 270 - 275
  • [38] Decentralized Multi Agent Deep Reinforcement Q-Learning for Intelligent Traffic Controller
    Thamilselvam, B.
    Kalyanasundaram, Subrahmanyam
    Rao, M. V. Panduranga
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2023, PT I, 2023, 675 : 45 - 56
  • [39] Multi-agent deep reinforcement learning based demand response for discrete manufacturing systems energy management
    Lu, Renzhi
    Li, Yi-Chang
    Li, Yuting
    Jiang, Junhui
    Ding, Yuemin
    APPLIED ENERGY, 2020, 276
  • [40] A Real-Time Energy Management Strategy of Flexible Smart Traction Power Supply System Based on Deep Q-Learning
    Ying, Yichen
    Tian, Zhongbei
    Wu, Mingli
    Liu, Qiujiang
    Tricoli, Pietro
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 8938 - 8948