Controlling distributed energy resources via deep reinforcement learning for load flexibility and energy efficiency

被引:69
作者
Touzani, Samir [1 ]
Prakash, Anand Krishnan [1 ]
Wang, Zhe [1 ]
Agarwal, Shreya [1 ]
Pritoni, Marco [1 ]
Kiran, Mariam [1 ]
Brown, Richard [1 ]
Granderson, Jessica [1 ]
机构
[1] Lawrence Berkeley Natl Lab, Berkeley, CA 94720 USA
关键词
Deep reinforcement learning; Deep deterministic policy gradient algorithm; Smart buildings; Control systems; Distributed energy resources; Load flexibility; Energy efficiency; DEMAND RESPONSE; MANAGEMENT; ALGORITHM; LEVEL; MODEL;
D O I
10.1016/j.apenergy.2021.117733
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Behind-the-meter distributed energy resources (DERs), including building solar photovoltaic (PV) technology and electric battery storage, are increasingly being considered as solutions to support carbon reduction goals and increase grid reliability and resiliency. However, dynamic control of these resources in concert with traditional building loads, to effect efficiency and demand flexibility, is not yet commonplace in commercial control products. Traditional rule-based control algorithms do not offer integrated closed-loop control to optimize across systems, and most often, PV and battery systems are operated for energy arbitrage and demand charge management, and not for the provision of grid services. More advanced control approaches, such as MPC control have not been widely adopted in industry because they require significant expertise to develop and deploy. Recent advances in deep reinforcement learning (DRL) offer a promising option to optimize the operation of DER systems and building loads with reduced setup effort. However, there are limited studies that evaluate the efficacy of these methods to control multiple building subsystems simultaneously. Additionally, most of the research has been conducted in simulated environments as opposed to real buildings. This paper proposes a DRL approach that uses a deep deterministic policy gradient algorithm for integrated control of HVAC and electric battery storage systems in the presence of on-site PV generation. The DRL algorithm, trained on synthetic data, was deployed in a physical test building and evaluated against a baseline that uses the current best-in-class rule-based control strategies. Performance in delivering energy efficiency, load shift, and load shed was tested using price-based signals. The results showed that the DRL-based controller can produce cost savings of up to 39.6% as compared to the baseline controller, while maintaining similar thermal comfort in the building. The project team has also integrated the simulation components developed during this work as an OpenAIGym environment and made it publicly available so that prospective DRL researchers can leverage this environment to evaluate alternate DRL algorithms.
引用
收藏
页数:18
相关论文
共 66 条
[1]   Demand Response Strategy Based on Reinforcement Learning and Fuzzy Reasoning for Home Energy Management [J].
Alfaverh, Fayiz ;
Denai, M. ;
Sun, Yichuang .
IEEE ACCESS, 2020, 8 :39310-39321
[2]  
Andersson C., 2016, PYFMI PYTHON PACKAGE
[3]  
[Anonymous], 2015, ACS SYM SER
[4]  
[Anonymous], 2008, IFAC Proc. Vol, DOI [DOI 10.3182/20080706-5-KR-1001.01934, 10.3182/20080706-5-KR-1001.01934]
[5]   Optimal use of incentive and price based demand response to reduce costs and price volatility [J].
Asadinejad, Ailin ;
Tomsovic, Kevin .
ELECTRIC POWER SYSTEMS RESEARCH, 2017, 144 :215-223
[6]  
ASHRAE, 2018, NEW GUID STAND ADV S
[7]   Reinforcement learning for whole-building HVAC control and demand response [J].
Azuatalam, Donald ;
Lee, Wee-Lih ;
de Nijs, Frits ;
Liebman, Ariel .
ENERGY AND AI, 2020, 2
[8]   A survey on behind the meter energy management systems in smart grid [J].
Bayram, Islam Safak ;
Ustun, Taha Selim .
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2017, 72 :1208-1232
[9]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[10]  
BL Energy Technologies Area, 2020, FLEXLAB ADV INT BUIL