Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning

被引：1

作者：

Kadamala, Kevlyn ^{[1
]}

Chambers, Des ^{[1
]}

Barrett, Enda ^{[1
]}

机构：

[1] Univ Galway, Galway, Ireland

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-APPLIED DATA SCIENCE TRACK, PT IX, ECML PKDD 2024 | 2024年 / 14949卷

基金：

爱尔兰科学基金会;

关键词：

Imitation learning; Reinforcement learning; Continuous HVAC control;

D O I：

10.1007/978-3-031-70378-2_16

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper explores the application of imitation learning (IL) and reinforcement learning (RL) in HVAC control. IL learns to perform tasks by imitating a demonstrator, utilising a dataset of demonstrations. However, the performance of IL is highly dependent on the quality of the expert demonstration data. On the other hand, RL can adapt control policies based on different objectives, but for larger problems, it can be sample inefficient, requiring significant time and resources for training. To overcome the limitations of both RL and IL, we propose a combined methodology where IL is used for pre-training and RL for fine-tuning. We introduce a fine-tuning methodology to HVAC control inspired by a robot navigation task. Using the 5-Zone residential building environment provided by Sinergym, we collect state-action pairs from interactions with the environment using a rule-based policy to create a dataset of expert demonstrations. Our experiments show that this combined methodology improves the efficiency and performance of the RL agent by 1% to 11.35% compared to existing literature. This study contributes to the ongoing discourse on how imitation learning can enhance the performance of reinforcement learning in building control systems.

引用

页码：256 / 270

页数：15

共 50 条

[41] Transfer learning for occupancy-based HVAC control: A data-driven approach using unsupervised learning of occupancy profiles and deep reinforcement learning
Esrafilian-Najafabadi, Mohammad
Haghighat, Fariborz
ENERGY AND BUILDINGS, 2023, 300
[42] Generating stable molecules using imitation and reinforcement learning
Meldgaard, Soren Ager
Koehler, Jonas
Mortensen, Henrik Lund
Christiansen, Mads-Peter, V
Noe, Frank
Hammer, Bjork
MACHINE LEARNING-SCIENCE AND TECHNOLOGY, 2022, 3 (01):
[43] A Policy-Guided Imitation Approach for Offline Reinforcement Learning
Xu, Haoran
Jiang, Li
Li, Jianxiong
Zhan, Xianyuan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[44] A HYBRID MULTIAGENT REINFORCEMENT LEARNING APPROACH USING STRATEGIES AND FUSION
Partalas, Ioannis
Feneris, Ioannis
Vlahavas, Ioannis
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2008, 17 (05) : 945 - 962
[45] A Human-Like Agent Based on a Hybrid of Reinforcement and Imitation Learning
Dossa, Rousslan Fernand Julien
Lian, Xinyu
Nomoto, Hirokazu
Matsubara, Takashi
Uehara, Kuniaki
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[46] Energy management for hybrid electric vehicles based on imitation reinforcement learning
Liu, Yonggang
Wu, Yitao
Wang, Xiangyu
Li, Liang
Zhang, Yuanjian
Chen, Zheng
ENERGY, 2023, 263
[47] Hybrid Reinforcement Learning- based Approach for Agent Motion Control
Albers, Albert
Obando, Hermann Sommer
Gudematsch, Christoph
2012 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2012, : 160 - 165
[48] Enhancing UAV Aerial Docking: A Hybrid Approach Combining Offline and Online Reinforcement Learning
Feng, Yuting
Yang, Tao
Yu, Yushu
DRONES, 2024, 8 (05)
[49] Surrogate Models for Enhancing the Efficiency of Neuroevolution in Reinforcement Learning
Stork, Joerg
Zaefferer, Martin
Bartz-Beielstein, Thomas
Eiben, A. E.
PROCEEDINGS OF THE 2019 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE (GECCO'19), 2019, : 934 - 942
[50] Multi-source transfer learning method for enhancing the deployment of deep reinforcement learning in multi-zone building HVAC control
Hou, Fangli
Cheng, Jack C. P.
Kwok, Helen H. L.
Ma, Jun
ENERGY AND BUILDINGS, 2024, 322

← 1 2 3 4 5 →