A Long-Short Term Memory Recurrent Neural Network Based Reinforcement Learning Controller for Office Heating Ventilation and Air Conditioning Systems

被引:127
作者
Wang, Yuan [1 ]
Velswamy, Kirubakaran [1 ]
Huang, Biao [1 ]
机构
[1] Univ Alberta, Dept Chem Engn, Edmonton, AB T6G 2R3, Canada
关键词
HVAC; reinforcement learning; artificial neural networks; BUILDING THERMAL STORAGE; PI-CONTROLLER; HVAC;
D O I
10.3390/pr5030046
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
Energy optimization in buildings by controlling the Heating Ventilation and Air Conditioning (HVAC) system is being researched extensively. In this paper, a model-free actor-critic Reinforcement Learning (RL) controller is designed using a variant of artificial recurrent neural networks called Long-Short-Term Memory (LSTM) networks. Optimization of thermal comfort alongside energy consumption is the goal in tuning this RL controller. The test platform, our office space, is designed using SketchUp. Using OpenStudio, the HVAC system is installed in the office. The control schemes (ideal thermal comfort, a traditional control and the RL control) are implemented in MATLAB. Using the Building Control Virtual Test Bed (BCVTB), the control of the thermostat schedule during each sample time is implemented for the office in EnergyPlus alongside local weather data. Results from training and validation indicate that the RL controller improves thermal comfort by an average of 15% and energy efficiency by an average of 2.5% as compared to other strategies mentioned.
引用
收藏
页数:18
相关论文
共 42 条
[1]   Synthesis of reinforcement learning, neural networks and PI control applied to a simulated heating coil [J].
Anderson, CW ;
Hittle, DC ;
Katz, AD ;
Kretchmar, RM .
ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1997, 11 (04) :421-429
[2]   MIMO robust control for HVAC systems [J].
Anderson, Michael ;
Buehner, Michael ;
Young, Peter ;
Hittle, Douglas ;
Anderson, Charles ;
Tu, Jilin ;
Hodgson, David .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2008, 16 (03) :475-483
[3]  
[Anonymous], 2008, INT J COMPUTATIONAL
[4]  
[Anonymous], 2000, P 16 C UNC ART INT S
[5]  
[Anonymous], 2014, P ANN C NEUR INF PRO, DOI [DOI 10.1021/acs.analchem.7b05329, DOI 10.48550/ARXIV.1409.3215]
[6]   Development of an adaptive Smith predictor-based self-tuning PI controller for an HVAC system in a test room [J].
Bai, Jianbo ;
Wang, Shengwei ;
Zhang, Xiaosong .
ENERGY AND BUILDINGS, 2008, 40 (12) :2244-2252
[7]   A new adaptive PI controller and its application in HVAC systems [J].
Bai, Jianbo ;
Zhang, Xiaosong .
ENERGY CONVERSION AND MANAGEMENT, 2007, 48 (04) :1043-1054
[8]  
Barrett E., 2015, LECT NOTES COMPUTER, V3
[9]   A Real-Time Optimization Framework for the Iterative Controller Tuning Problem [J].
Bunin, Gene A. ;
Francois, Gregory ;
Bonvin, Dominique .
PROCESSES, 2013, 1 (02) :203-237
[10]   DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving [J].
Chen, Chenyi ;
Seff, Ari ;
Kornhauser, Alain ;
Xiao, Jianxiong .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2722-2730