Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training

被引:0
|
作者
Xu, Shichao [1 ]
Fu, Yangyang [2 ]
Wang, Yixuan [1 ]
Yang, Zhuoran [3 ]
Huang, Chao [4 ]
O'Neill, Zheng [2 ]
Wang, Zhaoran [1 ]
Zhu, Qi [1 ]
机构
[1] Northwestern Univ, McCormick Sch Engn, Evanston, IL 60208 USA
[2] Texas A&M Univ, Dept Mech Engn, College Stn, TX 77843 USA
[3] Yale Univ, Dept Operat Res & Financial Engn, New Haven, CT 06520 USA
[4] UNIV LIVERPOOL, Dept Comp Sci, LIVERPOOL L69 3BX, England
来源
SCIENTIFIC REPORTS | 2025年 / 15卷 / 01期
基金
美国国家科学基金会;
关键词
HVAC control; Reinforcement learning; Deep learning; MODEL-PREDICTIVE CONTROL; ENERGY; SYSTEM;
D O I
10.1038/s41598-025-91326-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Building heating, ventilation, and air conditioning (HVAC) systems account for nearly half of building energy consumption and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$20\%$$\end{document} of total energy consumption in the US. Their operation is also crucial for ensuring the physical and mental health of building occupants. Compared with traditional model-based HVAC control methods, the recent model-free deep reinforcement learning (DRL) based methods have shown good performance while do not require the development of detailed and costly physical models. However, these model-free DRL approaches often suffer from long training time to reach a good performance, which is a major obstacle for their practical deployment. In this work, we present a systematic approach to accelerate online reinforcement learning for HVAC control by taking full advantage of the knowledge from domain experts in various forms. Specifically, the algorithm stages include learning expert functions from existing abstract physical models and from historical data via offline reinforcement learning, integrating the expert functions with rule-based guidelines, conducting training guided by the integrated expert function and performing policy initialization from distilled expert function. Moreover, to ensure that the learned DRL-based HVAC controller can effectively keep room temperature within the comfortable range for occupants, we design a runtime shielding framework to reduce the temperature violation rate and incorporate the learned controller into it. Experimental results demonstrate up to 8.8X speedup in DRL training from our approach over previous methods, with low temperature violation rate.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Deep Reinforcement Learning-based Traffic Signal Control
    Ruan, Junyun
    Tang, Jinzhuo
    Gao, Ge
    Shi, Tianyu
    Khamis, Alaa
    2023 IEEE INTERNATIONAL CONFERENCE ON SMART MOBILITY, SM, 2023, : 21 - 26
  • [42] Reinforcement Learning-based Product Delivery Frequency Control
    Liu, Yang
    Chen, Zhengxing
    Virochsiri, Kittipat
    Wang, Juan
    Wu, Jiahao
    Liang, Feng
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15355 - 15361
  • [43] Reinforcement Learning-Based Control of a Power Electronic Converter
    Alfred, Dajr
    Czarkowski, Dariusz
    Teng, Jiaxin
    MATHEMATICS, 2024, 12 (05)
  • [44] Synchronous Reinforcement Learning-Based Control for Cognitive Autonomy
    Vamvoudakis, Kyriakos G.
    Kokolakis, Nick-Marios T.
    FOUNDATIONS AND TRENDS IN SYSTEMS AND CONTROL, 2020, 8 (1-2): : 1 - 175
  • [45] A DEEP REINFORCEMENT LEARNING APPROACH TO USING WHOLE BUILDING ENERGY MODEL FOR HVAC OPTIMAL CONTROL
    Zhang, Zhiang
    Chong, Adrian
    Pan, Yuqi
    Zhang, Chenlu
    Lu, Siliang
    Lam, Khee Poh
    2018 BUILDING PERFORMANCE ANALYSIS CONFERENCE AND SIMBUILD, 2018, : 675 - 682
  • [46] Deep Reinforcement Learning-Based Task Scheduling in Heterogeneous MEC Networks
    Shang, Ying
    Li, Jinglei
    Qin, Meng
    Yang, Qinghai
    2022 IEEE 95TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-SPRING), 2022,
  • [47] Successful application of predictive information in deep reinforcement learning control: A case study based on an office building HVAC system
    Gao, Yuan
    Shi, Shanrui
    Miyata, Shohei
    Akashi, Yasunori
    ENERGY, 2024, 291
  • [48] Reinforcement Learning Control Algorithm for HVAC Retrofitting: Application to a Supermarket Building Model by Dynamic Simulation
    Mastropietro, Antonio
    Castiglione, Fabio
    Ballesio, Stefano
    Fabrizio, Enrico
    PROCEEDINGS OF BUILDING SIMULATION 2019: 16TH CONFERENCE OF IBPSA, 2020, : 1412 - 1419
  • [49] Enhancing vehicle ride comfort through deep reinforcement learning with expert-guided soft-hard constraints and system characteristic considerations
    Wang, Cheng
    Cui, Xiaoxian
    Zhao, Shijie
    Zhou, Xinran
    Song, Yaqi
    Wang, Yang
    Guo, Konghui
    ADVANCED ENGINEERING INFORMATICS, 2024, 59
  • [50] Data-Efficient Deep Reinforcement Learning-Based Optimal Generation Control in DC Microgrids
    Fan, Zhen
    Zhang, Wei
    Liu, Wenxin
    IEEE SYSTEMS JOURNAL, 2024, 18 (01): : 426 - 437