Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training

被引:0
|
作者
Xu, Shichao [1 ]
Fu, Yangyang [2 ]
Wang, Yixuan [1 ]
Yang, Zhuoran [3 ]
Huang, Chao [4 ]
O'Neill, Zheng [2 ]
Wang, Zhaoran [1 ]
Zhu, Qi [1 ]
机构
[1] Northwestern Univ, McCormick Sch Engn, Evanston, IL 60208 USA
[2] Texas A&M Univ, Dept Mech Engn, College Stn, TX 77843 USA
[3] Yale Univ, Dept Operat Res & Financial Engn, New Haven, CT 06520 USA
[4] UNIV LIVERPOOL, Dept Comp Sci, LIVERPOOL L69 3BX, England
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
美国国家科学基金会;
关键词
HVAC control; Reinforcement learning; Deep learning; MODEL-PREDICTIVE CONTROL; ENERGY; SYSTEM;
D O I
10.1038/s41598-025-91326-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Building heating, ventilation, and air conditioning (HVAC) systems account for nearly half of building energy consumption and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$20\%$$\end{document} of total energy consumption in the US. Their operation is also crucial for ensuring the physical and mental health of building occupants. Compared with traditional model-based HVAC control methods, the recent model-free deep reinforcement learning (DRL) based methods have shown good performance while do not require the development of detailed and costly physical models. However, these model-free DRL approaches often suffer from long training time to reach a good performance, which is a major obstacle for their practical deployment. In this work, we present a systematic approach to accelerate online reinforcement learning for HVAC control by taking full advantage of the knowledge from domain experts in various forms. Specifically, the algorithm stages include learning expert functions from existing abstract physical models and from historical data via offline reinforcement learning, integrating the expert functions with rule-based guidelines, conducting training guided by the integrated expert function and performing policy initialization from distilled expert function. Moreover, to ensure that the learned DRL-based HVAC controller can effectively keep room temperature within the comfortable range for occupants, we design a runtime shielding framework to reduce the temperature violation rate and incorporate the learned controller into it. Experimental results demonstrate up to 8.8X speedup in DRL training from our approach over previous methods, with low temperature violation rate.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Whole building energy model for HVAC optimal control: A practical framework based on deep reinforcement learning
    Zhang, Zhiang
    Chong, Adrian
    Pan, Yuqi
    Zhang, Chenlu
    Lam, Khee Poh
    ENERGY AND BUILDINGS, 2019, 199 : 472 - 490
  • [22] Prospects and challenges of reinforcement learning- based HVAC control
    Ajifowowe, Iyanu
    Chang, Hojong
    Lee, Chae Seok
    Chang, Seongju
    JOURNAL OF BUILDING ENGINEERING, 2024, 98
  • [23] Development of Reinforcement Learning-based Energy Management Agent for HVAC Facilities and ESS
    Kwon K.-B.
    Hong S.-M.
    Heo J.-H.
    Jung H.
    Park J.-Y.
    Transactions of the Korean Institute of Electrical Engineers, 2022, 71 (10): : 1434 - 1440
  • [24] Satellite fault tolerant attitude control based on expert guided exploration of reinforcement learning agent
    Henna, Hicham
    Toubakh, Houari
    Kafi, Mohamed Redouane
    Gursoy, Oemer
    Sayed-Mouchaweh, Moamar
    Djemai, Mohamed
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2024,
  • [25] Poster Abstract: Data Efficient HVAC Control using Gaussian Process-based Reinforcement Learning
    An, Zhiyu
    Ding, Xianzhong
    Du, Wan
    PROCEEDINGS OF THE 21ST ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, SENSYS 2023, 2023, : 538 - 539
  • [26] Bargaining solutions in heterogeneous networks: A reinforcement learning-based approach
    Ebrahimkhani, Atena
    Akhbari, Bahareh
    IET COMMUNICATIONS, 2021, 15 (18) : 2315 - 2329
  • [27] An efficient reinforcement learning-based Botnet detection approach
    Alauthman, Mohammad
    Aslam, Nauman
    Al-kasassbeh, Mouhammd
    Khan, Suleman
    Al-Qerem, Ahmad
    Choo, Kim-Kwang Raymond
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2020, 150 (150)
  • [28] A Reinforcement Learning-based Multipath Scheduling for Heterogeneous Wireless Networks
    Nguyen, Thanh Trung
    Vu, Minh Hai
    Le Nguyen, Phi
    Do, Phan Thuan
    Nguyen, Kien
    2022 IEEE 8TH WORLD FORUM ON INTERNET OF THINGS, WF-IOT, 2022,
  • [29] An Enhanced Adaptivity of Reinforcement Learning-Based Temperature Control in Buildings Using Generalized Training
    Taboga, Vincent
    Bellahsen, Amine
    Dagdougui, Hanane
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (02): : 255 - 266
  • [30] Deep reinforcement learning-based influence maximization for heterogeneous hypergraphs
    Sun, Yanhao
    Wu, Jie
    Song, Nuan
    Lin, Tianwei
    Li, Longxiang
    Li, Dong
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2025, 660