Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training

被引:0
|
作者
Xu, Shichao [1 ]
Fu, Yangyang [2 ]
Wang, Yixuan [1 ]
Yang, Zhuoran [3 ]
Huang, Chao [4 ]
O'Neill, Zheng [2 ]
Wang, Zhaoran [1 ]
Zhu, Qi [1 ]
机构
[1] Northwestern Univ, McCormick Sch Engn, Evanston, IL 60208 USA
[2] Texas A&M Univ, Dept Mech Engn, College Stn, TX 77843 USA
[3] Yale Univ, Dept Operat Res & Financial Engn, New Haven, CT 06520 USA
[4] UNIV LIVERPOOL, Dept Comp Sci, LIVERPOOL L69 3BX, England
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
美国国家科学基金会;
关键词
HVAC control; Reinforcement learning; Deep learning; MODEL-PREDICTIVE CONTROL; ENERGY; SYSTEM;
D O I
10.1038/s41598-025-91326-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Building heating, ventilation, and air conditioning (HVAC) systems account for nearly half of building energy consumption and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$20\%$$\end{document} of total energy consumption in the US. Their operation is also crucial for ensuring the physical and mental health of building occupants. Compared with traditional model-based HVAC control methods, the recent model-free deep reinforcement learning (DRL) based methods have shown good performance while do not require the development of detailed and costly physical models. However, these model-free DRL approaches often suffer from long training time to reach a good performance, which is a major obstacle for their practical deployment. In this work, we present a systematic approach to accelerate online reinforcement learning for HVAC control by taking full advantage of the knowledge from domain experts in various forms. Specifically, the algorithm stages include learning expert functions from existing abstract physical models and from historical data via offline reinforcement learning, integrating the expert functions with rule-based guidelines, conducting training guided by the integrated expert function and performing policy initialization from distilled expert function. Moreover, to ensure that the learned DRL-based HVAC controller can effectively keep room temperature within the comfortable range for occupants, we design a runtime shielding framework to reduce the temperature violation rate and incorporate the learned controller into it. Experimental results demonstrate up to 8.8X speedup in DRL training from our approach over previous methods, with low temperature violation rate.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Accelerate Online Reinforcement Learning for Building HVAC Control with Heterogeneous Expert Guidances
    Xu, Shichao
    Fu, Yangyang
    Wang, Yixuan
    Yang, Zhuoran
    O'Neill, Zheng
    Wang, Zhaoran
    Zhu, Qi
    PROCEEDINGS OF THE 2022 THE 9TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, BUILDSYS 2022, 2022, : 89 - 98
  • [2] Explaining Deep Reinforcement Learning-Based Methods for Control of Building HVAC Systems
    Jimenez-Raboso, Javier
    Manjavacas, Antonio
    Campoy-Nieves, Alejandro
    Molina-Solana, Miguel
    Gomez-Romero, Juan
    EXPLAINABLE ARTIFICIAL INTELLIGENCE, XAI 2023, PT II, 2023, 1902 : 237 - 255
  • [3] Transfer Learning Applied to Reinforcement Learning-Based HVAC Control
    Lissa P.
    Schukat M.
    Barrett E.
    SN Computer Science, 2020, 1 (3)
  • [4] Reinforcement Learning for Control of Building HVAC Systems
    Raman, Naren Srivaths
    Devraj, Adithya M.
    Barooah, Prabir
    Meyn, Sean P.
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2326 - 2332
  • [5] Deep Reinforcement Learning for Building HVAC Control
    Wei, Tianshu
    Wang, Yanzhi
    Zhu, Qi
    PROCEEDINGS OF THE 2017 54TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2017,
  • [6] Expert-Guided Deep Reinforcement Learning for Flexible Job Shop Scheduling Problem
    Zhang, Wenqiang
    Geng, Huili
    Bao, Xuan
    Gen, Mitsuo
    Zhang, Guohui
    Deng, Miaolei
    BIO-INSPIRED COMPUTING: THEORIES AND APPLICATIONS, PT 2, BIC-TA 2023, 2024, 2062 : 50 - 60
  • [7] Expert-guided imitation learning for energy management: Evaluating GAIL's performance in building control applications
    Liu, Mingzhe
    Guo, Mingyue
    Fu, Yangyang
    O'Neill, Zheng
    Gao, Yuan
    APPLIED ENERGY, 2024, 372
  • [8] Dynamic scheduling in modern processing systems using expert-guided distributed reinforcement learning
    Qu, Shuhui
    Wang, Jie
    Jasperneite, Juergen
    2019 24TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2019, : 459 - 466
  • [9] EVALUATING REDUCED ORDER MODELS FOR TRAINING REINFORCEMENT LEARNING AGENTS FOR BUILDING HVAC CONTROL
    Kerr, Matthew
    Howard, Bianca
    PROCEEDINGS OF ASME 2024 18TH INTERNATIONAL CONFERENCE ON ENERGY SUSTAINABILITY, ES2024, 2024,
  • [10] Reinforcement learning-based HVAC control agent for optimal control of particulate matter in railway stations
    Kwon K.-B.
    Hong S.
    Heo J.-H.
    Jung H.
    Park J.-Y.
    Transactions of the Korean Institute of Electrical Engineers, 2021, 70 (10): : 1594 - 1600