Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning

被引:1
|
作者
Kadamala, Kevlyn [1 ]
Chambers, Des [1 ]
Barrett, Enda [1 ]
机构
[1] Univ Galway, Galway, Ireland
基金
爱尔兰科学基金会;
关键词
Imitation learning; Reinforcement learning; Continuous HVAC control;
D O I
10.1007/978-3-031-70378-2_16
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores the application of imitation learning (IL) and reinforcement learning (RL) in HVAC control. IL learns to perform tasks by imitating a demonstrator, utilising a dataset of demonstrations. However, the performance of IL is highly dependent on the quality of the expert demonstration data. On the other hand, RL can adapt control policies based on different objectives, but for larger problems, it can be sample inefficient, requiring significant time and resources for training. To overcome the limitations of both RL and IL, we propose a combined methodology where IL is used for pre-training and RL for fine-tuning. We introduce a fine-tuning methodology to HVAC control inspired by a robot navigation task. Using the 5-Zone residential building environment provided by Sinergym, we collect state-action pairs from interactions with the environment using a rule-based policy to create a dataset of expert demonstrations. Our experiments show that this combined methodology improves the efficiency and performance of the RL agent by 1% to 11.35% compared to existing literature. This study contributes to the ongoing discourse on how imitation learning can enhance the performance of reinforcement learning in building control systems.
引用
收藏
页码:256 / 270
页数:15
相关论文
共 50 条
  • [41] Transfer learning for occupancy-based HVAC control: A data-driven approach using unsupervised learning of occupancy profiles and deep reinforcement learning
    Esrafilian-Najafabadi, Mohammad
    Haghighat, Fariborz
    ENERGY AND BUILDINGS, 2023, 300
  • [42] Generating stable molecules using imitation and reinforcement learning
    Meldgaard, Soren Ager
    Koehler, Jonas
    Mortensen, Henrik Lund
    Christiansen, Mads-Peter, V
    Noe, Frank
    Hammer, Bjork
    MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):
  • [43] A Policy-Guided Imitation Approach for Offline Reinforcement Learning
    Xu, Haoran
    Jiang, Li
    Li, Jianxiong
    Zhan, Xianyuan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [44] A HYBRID MULTIAGENT REINFORCEMENT LEARNING APPROACH USING STRATEGIES AND FUSION
    Partalas, Ioannis
    Feneris, Ioannis
    Vlahavas, Ioannis
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (05) : 945 - 962
  • [45] A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning
    Dossa, Rousslan Fernand Julien
    Lian, Xinyu
    Nomoto, Hirokazu
    Matsubara, Takashi
    Uehara, Kuniaki
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [46] Energy management for hybrid electric vehicles based on imitation reinforcement learning
    Liu, Yonggang
    Wu, Yitao
    Wang, Xiangyu
    Li, Liang
    Zhang, Yuanjian
    Chen, Zheng
    ENERGY, 2023, 263
  • [47] Hybrid Reinforcement Learning- based Approach for Agent Motion Control
    Albers, Albert
    Obando, Hermann Sommer
    Gudematsch, Christoph
    2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2012, : 160 - 165
  • [48] Enhancing UAV Aerial Docking: A Hybrid Approach Combining Offline and Online Reinforcement Learning
    Feng, Yuting
    Yang, Tao
    Yu, Yushu
    DRONES, 2024, 8 (05)
  • [49] Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning
    Stork, Joerg
    Zaefferer, Martin
    Bartz-Beielstein, Thomas
    Eiben, A. E.
    PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 934 - 942
  • [50] Multi-source transfer learning method for enhancing the deployment of deep reinforcement learning in multi-zone building HVAC control
    Hou, Fangli
    Cheng, Jack C. P.
    Kwok, Helen H. L.
    Ma, Jun
    ENERGY AND BUILDINGS, 2024, 322