Operational optimization for off-grid renewable building energy system using deep reinforcement learning

被引：54

作者：

Gao, Yuan ^{[1
]}

Matsunami, Yuki ^{[2
]}

Miyata, Shohei ^{[1
]}

Akashi, Yasunori ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Engn, Dept Architecture, Tokyo, Japan

[2] Takasago Thermal Engn Co Ltd, Tokyo, Japan

来源：

APPLIED ENERGY | 2022年 / 325卷

关键词：

Reinforcement learning; Off-grid operation; Operational optimization; Deep learning; MODEL; SOLAR; WIND; GAME; GO;

D O I：

10.1016/j.apenergy.2022.119783

中图分类号：

TE [石油、天然气工业]; TK [能源与动力工程];

学科分类号：

0807 ; 0820 ;

摘要：

With the application of renewable energy in single office buildings, an increasing number of power grids require building systems coupled with renewable energy to realize off-grid operation. However, the uncertainty of renewable energy sources and the safety of the corresponding energy storage equipment have become major challenges for these systems. Reinforcement learning has made considerable progress in the field of building control as an advanced control algorithm; however, research on its application to the off-grid operation of renewable energy systems, particularly the specific reward function design is limited. Therefore, this study considered the off-grid operation of a renewable building energy system and the safety (prevention of deterioration) of its battery as optimization goals. This study is based on a real building energy system, through the operation control of generators, solar photovoltaics, and batteries to achieve optimization purposes. Aiming at these two optimization goals, this study introduces a detailed reward function design and complete reinforcement learning workflow through a Gaussian distribution. Two deep reinforcement learning (DRL) algorithms were trained and verified by offline reinforcement learning based on the measured data-sets of actual existing buildings in Japan. The results show that the proposed reinforcement learning design can better achieve the two optimization goals of off-grid operation and battery safety under ordinary and extreme conditions. On off-grid operation tasks, the best DRL algorithm can achieve a mean hourly grid power purchase error of less than 2 kWh for the entire optimization window. Furthermore, the proposed deep reinforcement learning algorithms can simultaneously maintain a maximum average 7.72 h of battery unsafe state over the entire 168-hour optimization window.

引用

页数：19

共 57 条

[1] A review on recent size optimization methodologies for standalone solar and wind hybrid renewable energy system [J].

Al-Falahi, Monaaf D. A. ;

Jayasinghe, S. D. G. ;

Enshaei, H. .

ENERGY CONVERSION AND MANAGEMENT, 2017, 143 :252-274

[2] Theories and methodology of complementary hydro/photovoltaic operation: Applications to short-term scheduling [J].

An, Yuan ;

Fang, Wei ;

Ming, Bo ;

Huang, Qiang .

JOURNAL OF RENEWABLE AND SUSTAINABLE ENERGY, 2015, 7 (06)

[3]

Balakrishnan S., 2020, PROC NEURIP, P4187

[4] Practical factors of envelope model setup and their effects on the performance of model predictive control for building heating, ventilating, and air conditioning systems [J].

Blum, D. H. ;

Arendt, K. ;

Rivalin, L. ;

Piette, M. A. ;

Wetter, M. ;

Veje, C. T. .

APPLIED ENERGY, 2019, 236 :410-425

[5] Multi-armed bandit problem with known trend [J].

Bouneffouf, Djallel ;

Feraud, Raphael .

NEUROCOMPUTING, 2016, 205 :16-21

[6]

Brockman Greg, 2016, arXiv

[7] The complementary nature between wind and photovoltaic generation in Brazil and the role of energy storage in utility-scale hybrid power plants [J].

Campos, Rafael Antunes ;

do Nascimento, Lucas Rafael ;

Ruther, Ricardo .

ENERGY CONVERSION AND MANAGEMENT, 2020, 221

[8]

Ceusters G, 2021, ARXIV

[9] Gnu-RL: A Precocial Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy [J].

Chen, Bingqing ;

Cai, Zicheng ;

Berges, Mario .

BUILDSYS'19: PROCEEDINGS OF THE 6TH ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILDINGS, CITIES, AND TRANSPORTATION, 2019, :316-325

[10] Experimental analysis of data-driven control for a building heating system [J].

Costanzo, G. T. ;

Iacovella, S. ;

Ruelens, F. ;

Leurs, T. ;

Claessens, B. J. .

SUSTAINABLE ENERGY GRIDS & NETWORKS, 2016, 6 :81-90

← 1 2 3 4 5 6 →