Data-driven control of room temperature and bidirectional EV charging using deep reinforcement learning: Simulations and experiments

被引:33
作者
Svetozarevic, B. [1 ]
Baumann, C. [1 ,2 ]
Muntwiler, S. [2 ]
Di Natale, L. [1 ]
Zeilinger, M. N. [2 ]
Heer, P. [1 ]
机构
[1] Empa, Urban Energy Syst Lab, Dubendorf, Switzerland
[2] Swiss Fed Inst Technol, Inst Dynam Syst & Control, Zurich, Switzerland
基金
瑞士国家科学基金会;
关键词
Data-driven building control; Deep reinforcement learning; Room temperature control; Thermal comfort; EV charging; Recurrent neural networks; MODEL-PREDICTIVE CONTROL; BUILDING ENERGY; DEMAND RESPONSE; HVAC-SYSTEMS; OPTIMIZATION; COMFORT; ALGORITHMS; MANAGEMENT; LOAD; MPC;
D O I
10.1016/j.apenergy.2021.118127
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The control of modern buildings is a complex multi-loop problem due to the integration of renewable energy generation, storage devices, and electric vehicles (EVs). Additionally, it is a complex multi-criteria problem due to the need to optimize overall energy use while satisfying users' comfort. Both conventional rule-based (RB) controllers, which are difficult to apply in multi-loop settings, and advanced model-based controllers, which require an accurate building model, cannot fulfil the requirements of the building automation industry to solve this problem optimally at low development and commissioning costs. This work presents a fully data-driven pipeline to obtain an optimal control policy from historical building and weather data, thus avoiding the need for complex physics-based modelling. We demonstrate the potential of this method by jointly controlling a room temperature and an EV to minimize the cost of electricity while retaining the comfort of the occupants. We model the room temperature with a recurrent neural network and use it as a simulation environment to learn a deep reinforcement learning (DRL) control policy. It achieves on average 17% energy savings and 19% better comfort satisfaction than a standard RB room temperature controller. When a bidirectional EV is connected additionally and a two-tariff electricity pricing is applied, it successfully leverages the battery and decreases the overall cost of electricity. Finally, we deployed it on a real building, where it achieved up to 30% energy savings while maintaining similar comfort levels compared to a conventional RB room temperature controller.
引用
收藏
页数:16
相关论文
共 89 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Artificial neural network (ANN) based model predictive control (MPC) and optimization of HVAC systems: A state of the art review and case study of a residential HVAC system [J].
Afram, Abdul ;
Janabi-Sharifi, Farrokh ;
Fung, Alan S. ;
Raahemifar, Kaamran .
ENERGY AND BUILDINGS, 2017, 141 :96-113
[3]  
[Anonymous], 2020, 2020 IEEE S SER, DOI DOI 10.1109/ssci47803.2020.9308468
[4]  
[Anonymous], 2013, IEEE PES ISGT EUROPE, DOI DOI 10.1109/ISGTEUROPE.2013.6695263
[5]  
[Anonymous], 2016, Openai gym
[6]  
[Anonymous], 2021, EMPA NEST-Exploring the Future of Buildings
[7]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38
[8]   Provably safe and robust learning-based model predictive control [J].
Aswani, Anil ;
Gonzalez, Humberto ;
Sastry, S. Shankar ;
Tomlin, Claire .
AUTOMATICA, 2013, 49 (05) :1216-1226
[9]   Reducing Transient and Steady State Electricity Consumption in HVAC Using Learning-Based Model-Predictive Control [J].
Aswani, Anil ;
Master, Neal ;
Taneja, Jay ;
Culler, David ;
Tomlin, Claire .
PROCEEDINGS OF THE IEEE, 2012, 100 (01) :240-253
[10]   Hyperopt: A Python library for model selection and hyperparameter optimization [J].
Bergstra, James ;
Komer, Brent ;
Eliasmith, Chris ;
Yamins, Dan ;
Cox, David D .
Computational Science and Discovery, 2015, 8 (01)