Experimental analysis of data-driven control for a building heating system

被引:99
作者
Costanzo, G. T. [1 ]
Iacovella, S. [2 ,4 ]
Ruelens, F. [2 ,4 ]
Leurs, T. [2 ]
Claessens, B. J. [3 ,4 ]
机构
[1] Danish Tech Univ, Dept Elect Engn, Frederiksborgvej 399, DK-4000 Roskilde, Denmark
[2] Katholieke Univ Leuven, Elect Engn, Kasteelpk Arenberg 10,Bus 2445, B-3001 Leuven, Belgium
[3] Flemish Inst Technol Res VITO, Boeretang 200, B-2400 Mol, Belgium
[4] EnergyVille, Thor Pk Poort Genk 8130, B-3600 Genk, Belgium
关键词
Thermostatically controlled load; Batch reinforcement learning; Demand response; Data-driven modeling; Fitted Q-iteration; MODEL-PREDICTIVE CONTROL;
D O I
10.1016/j.segan.2016.02.002
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Driven by the opportunity to harvest the flexibility related to building climate control for demand response applications, this work presents a data-driven control approach building upon recent advancements in reinforcement learning. More specifically, model-assisted batch reinforcement learning is applied to the setting of building climate control subjected to dynamic pricing. The underlying sequential decision making problem is cast into a Markov decision problem, after which the control algorithm is detailed. In this work, fitted Q-iteration is used to construct a policy from a batch of experimental tuples. In those regions of the state space where the experimental sample density is low, virtual support tuples are added using an artificial neural network. Finally, the resulting policy is shaped using domain knowledge. The control approach has been evaluated quantitatively using a simulation and qualitatively in a living lab. From the quantitative analysis it has been found that the control approach converges in approximately 20 days to obtain a control policy with a performance within 90% of the mathematical optimum. The experimental analysis confirms that within 10 to 20 days sensible policies are obtained that can be used for different outside temperature regimes. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:81 / 90
页数:10
相关论文
共 56 条
[1]   Theory and applications of HVAC control systems - A review of model predictive control (MPC) [J].
Afram, Abdul ;
Janabi-Sharifi, Farrokh .
BUILDING AND ENVIRONMENT, 2014, 72 :343-355
[2]  
[Anonymous], GUR OPT REF MAN
[3]  
[Anonymous], IFAC S POW PLANTS PO
[4]  
[Anonymous], MODEL BASED REINFORC
[5]  
[Anonymous], 1963, DYNAMIC PROGRAMMING
[6]  
[Anonymous], 2013, Advanced Textbooks in Control and Signal Processing
[7]  
[Anonymous], THESIS TU DENMARK LY
[8]  
[Anonymous], BELG EL TRANSM SYST
[9]  
[Anonymous], 2012, CLIN DEV IMMUNOL
[10]  
[Anonymous], TREE BASED BATCH MOD