Actor-critic learning for optimal building energy management with phase change materials

被引:17
作者
Rahimpour, Zahra [1 ]
Verbic, Gregor [1 ]
Chapman, Archie C. [2 ]
机构
[1] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW, Australia
[2] Univ Queensland, Sch Informat Technol & Elect Engn, Brisbane, Qld, Australia
关键词
Actor-critic; Approximate dynamic programming; Deep deterministic policy gradient; Home energy management; Phase change materials;
D O I
10.1016/j.epsr.2020.106543
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Energy management in buildings using phase change materials (PCM) to improve thermal performance is challenging due to the nonlinear thermal capacity of the PCM. To address this problem, this paper adopts a model-free actor-critic on-policy reinforcement learning method based on deep deterministic policy gradient (DDPG). The proposed approach overcomes the major weakness of model-based approaches, such as approximate dynamic programming (ADP), which require an explicit thermal model of the building under control. This requirement makes a plug-and-play implementation of the energy management algorithm in an existing smart meter difficult due to the wide variety of building design and construction types. To overcome this difficulty, we use a DDPG algorithm that can learn policies in continuous action spaces without access to the full dynamics of the building. We demonstrate the competitive performance of DDPG by benchmarking it against an ADP-based approach with access to the full thermal dynamics of the building.
引用
收藏
页数:7
相关论文
共 23 条
[1]   Energy saving potential of phase change materials in major Australian cities [J].
Alam, Morshed ;
Jamil, Hasnat ;
Sanjayan, Jay ;
Wilson, John .
ENERGY AND BUILDINGS, 2014, 78 :192-201
[2]  
American Society of Heating, 2009, REFR AIR COND ENG HD
[3]  
[Anonymous], 2011, INNOVATIVE SMART GRI
[4]   Energy refurbishment of existing buildings through the use of phase change materials: Energy savings and indoor comfort in the cooling season [J].
Ascione, Fabrizio ;
Bianco, Nicola ;
De Masi, Rosa Francesca ;
de' Rossi, Filippo ;
Vanoli, Giuseppe Peter .
APPLIED ENERGY, 2014, 113 :990-1007
[5]  
Bellman R. E., 2015, Applied dynamic programming
[6]   Experimental Study of PCM Inclusion in Different Building Envelopes [J].
Castellon, C. ;
Castell, A. ;
Medrano, M. ;
Martorell, I. ;
Cabeza, L. F. .
JOURNAL OF SOLAR ENERGY ENGINEERING-TRANSACTIONS OF THE ASME, 2009, 131 (04) :0410061-0410066
[7]  
Chapman A.C., 2019, 13 POW 2019 IEEE
[8]  
Evola G., 2011, P BULD SIM 12 C INT
[9]  
Gao G., 2019, ARXIV1901046932019
[10]   Low-order model for the simulation of a building and its heating system [J].
School of Engineering, University of Northumbria, Newcastle upon Tyne NE1 8ST, United Kingdom ;
不详 .
Building Services Engineering Research and Technology, 2000, 21 (03) :199-208