Reinforcement learning building control approach harnessing imitation learning

被引:27
作者
Dey, Sourav [1 ]
Marzullo, Thibault [2 ]
Zhang, Xiangyu [2 ]
Henze, Gregor [1 ,2 ,3 ]
机构
[1] Univ Colorado, Dept Civil Environm & Architectural Engn, Boulder, CO 80309 USA
[2] Natl Renewable Energy Lab, Golden, CO USA
[3] Renewable & Sustainable Energy Inst, Boulder, CO 80303 USA
关键词
Reinforcement learning; Building controls; Imitation learning; Artificial intelligence;
D O I
10.1016/j.egyai.2023.100255
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) has shown significant success in sequential decision making in fields like autonomous vehicles, robotics, marketing and gaming industries. This success has attracted the attention to the RL control approach for building energy systems which are becoming complicated due to the need to optimize for multiple, potentially conflicting, goals like occupant comfort, energy use and grid interactivity. However, for real world applications, RL has several drawbacks like requiring large training data and time, and unstable control behavior during the early exploration process making it infeasible for an application directly to building control tasks. To address these issues, an imitation learning approach is utilized herein where the RL agents starts with a policy transferred from accepted rule based policies and heuristic policies. This approach is successful in reducing the training time, preventing the unstable early exploration behavior and improving upon an accepted rule-based policy - all of these make RL a more practical control approach for real world applications in the domain of building controls.
引用
收藏
页数:13
相关论文
共 50 条
[31]   Hybrid of Reinforcement and Imitation Learning for Human-Like Agents [J].
Dossa, Rousslan F. J. ;
Lian, Xinyu ;
Nomoto, Hirokazu ;
Matsubara, Takashi ;
Uehara, Kuniaki .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (09) :1960-1970
[32]   A reinforcement learning approach to automatic generation control [J].
Ahamed, TPI ;
Rao, PSN ;
Sastry, PS .
ELECTRIC POWER SYSTEMS RESEARCH, 2002, 63 (01) :9-26
[33]   A Reinforcement Learning Approach for Continuum Robot Control [J].
Turhan Can Kargin ;
Jakub Kołota .
Journal of Intelligent & Robotic Systems, 2023, 109
[34]   Reinforcement learning control approach for autonomous microgrids [J].
Mahmoud, M. S. ;
Abouheaf, M. ;
Sharaf, A. .
INTERNATIONAL JOURNAL OF MODELLING AND SIMULATION, 2021, 41 (01) :1-10
[35]   Autonomous HVAC Control, A Reinforcement Learning Approach [J].
Barrett, Enda ;
Linder, Stephen .
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT III, 2015, 9286 :3-19
[36]   A Reinforcement Learning Approach for Continuum Robot Control [J].
Kargin, Turhan Can ;
Kolota, Jakub .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 109 (04)
[37]   Addressing Delays in Reinforcement Learning via Delayed Adversarial Imitation Learning [J].
Xie, Minzhi ;
Xia, Bo ;
Yu, Yalou ;
Wang, Xueqian ;
Chang, Yongzhe .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 :271-282
[38]   Imitation Learning Based on Deep Reinforcement Learning for Solving Scheduling Problems [J].
Nahhas, Abdulrahman ;
Kharitonov, Andrey ;
Haertel, Christian ;
Turowski, Klaus .
PROCEEDINGS OF THE 57TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2024, :1649-1658
[39]   Inverse Reinforcement Learning for Trajectory Imitation Using Static Output Feedback Control [J].
Xue, Wenqian ;
Lian, Bosen ;
Fan, Jialu ;
Chai, Tianyou ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) :1695-1707
[40]   Supervised optimal control in complex continuous systems with trajectory imitation and reinforcement learning [J].
Liu, Yingjun ;
Liu, Fuchun ;
Huang, Renwei .
SCIENTIFIC REPORTS, 2025, 15 (01)