Application of deep Q-networks for model-free optimal control balancing between different HVAC systems

被引:78
作者
Ahn, Ki Uhn [1 ]
Park, Cheol Soo [2 ]
机构
[1] Seoul Natl Univ, Inst Engn Res, Seoul, South Korea
[2] Seoul Natl Univ, Coll Engn, Inst Engn Res, Dept Architecture & Architectural Engn,Inst Const, 1 Gwanak Ro, Seoul 08826, South Korea
关键词
BUILDING ENERGY; PART; OPTIMIZATION; STRATEGIES; VENTILATION; PREDICTION;
D O I
10.1080/23744731.2019.1680234
中图分类号
O414.1 [热力学];
学科分类号
摘要
A deep Q-network (DQN) was applied for model-free optimal control balancing between different HVAC systems. The DQN was coupled to a reference office building: an EnergyPlus simulation model provided by the U.S. Department of Energy. The building was air-conditioned with four air-handling units (AHUs), two electric chillers, a cooling tower, and two pumps. EnergyPlus simulation results for eleven days (July 1-11) and three subsequent days (July 12-14) were used to improve the DQN policy and test the optimal control. The optimization goal was to minimize the building's energy use while maintaining the indoor CO2 concentration below 1,000 ppm. It was revealed that the DQN-a reinforcement learning method-can improve its control policy based on prior actions, states, and rewards. The DQN lowered the total energy usage by 15.7% in comparison with the baseline operation while maintaining the indoor CO2 concentration below 1,000 ppm. Compared to model predictive control, the DQN does not require a simulation model, or a predetermined prediction horizon, thus delivering model-free optimal control. Furthermore, it was demonstrated that the DQN can find balanced control actions between different energy consumers in the building, such as chillers, pumps, and AHUs.
引用
收藏
页码:61 / 74
页数:14
相关论文
共 39 条
[1]   Theory and applications of HVAC control systems - A review of model predictive control (MPC) [J].
Afram, Abdul ;
Janabi-Sharifi, Farrokh .
BUILDING AND ENVIRONMENT, 2014, 72 :343-355
[2]   Predictability of occupant presence and performance gap in building energy simulation [J].
Ahn, Ki-Uhn ;
Kim, Deuk-Woo ;
Park, Cheol-Soo ;
de Wilde, Pieter .
APPLIED ENERGY, 2017, 208 :1639-1652
[3]   Correlation between occupants and energy consumption [J].
Ahn, Ki-Uhn ;
Park, Cheol-Soo .
ENERGY AND BUILDINGS, 2016, 116 :420-433
[4]   A review on simulation-based optimization methods applied to building performance analysis [J].
Anh-Tuan Nguyen ;
Reiter, Sigrid ;
Rigo, Philippe .
APPLIED ENERGY, 2014, 113 :1043-1058
[5]   Review of Control Techniques for HVAC Systems-Nonlinearity Approaches Based on Fuzzy Cognitive Maps [J].
Behrooz, Farinaz ;
Mariun, Norman ;
Marhaban, Mohammad Hamiruce ;
Radzi, Mohd Amran Mohd ;
Ramli, Abdul Rahman .
ENERGIES, 2018, 11 (03)
[6]  
Busoniu L, 2010, AUTOM CONTROL ENG SE, P1, DOI 10.1201/9781439821091-f
[7]   Optimal control of HVAC and window systems for natural ventilation through reinforcement learning [J].
Chen, Yujiao ;
Norford, Leslie K. ;
Samuelson, Holly W. ;
Malkawi, Ali .
ENERGY AND BUILDINGS, 2018, 169 :195-205
[8]   Satisfaction based Q-learning for integrated lighting and blind control [J].
Cheng, Zhijin ;
Zhao, Qianchuan ;
Wang, Fulin ;
Jiang, Yi ;
Xia, Li ;
Ding, Jinlei .
ENERGY AND BUILDINGS, 2016, 127 :43-55
[9]   A model predictive control optimization environment for real-time commercial building application [J].
Corbin, Charles D. ;
Henze, Gregor P. ;
May-Ostendorp, Peter .
JOURNAL OF BUILDING PERFORMANCE SIMULATION, 2013, 6 (03) :159-174
[10]  
Dawson DarrenM., 2003, Robot manipulator control: theory and practice