Innovative energy solutions: Evaluating reinforcement learning algorithms for battery storage optimization in residential settings

被引：0

作者：

Dou, Zhenlan ^{[1
]}

Zhang, Chunyan ^{[1
]}

Li, Junqiang ^{[2
]}

Li, Dezhi ^{[3
]}

Wang, Miao ^{[3
]}

Sun, Lue ^{[3
]}

Wang, Yong ^{[2
]}

机构：

[1] State Grid Shanghai Municipal Elect Power Co, Shanghai 200122, Peoples R China

[2] Nanchang Univ, Sch Informat Engn, Nanchang 330031, Peoples R China

[3] China Elect Power Res Inst, Beijing Key Lab Demand Side Multienergy Carriers O, Beijing 100192, Peoples R China

来源：

PROCESS SAFETY AND ENVIRONMENTAL PROTECTION | 2024年 / 191卷

关键词：

Reinforcement learning; Optimal controlling; Operation scheduling; Building energy Management; Energy storage; Solar PV system; SYSTEM; MANAGEMENT; OPERATION; BEHAVIOR; BIOMASS;

D O I：

10.1016/j.psep.2024.09.123

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

The implementation of BESS (battery energy storage systems) and the efficient optimization of their scheduling are crucial research challenges in effectively managing the intermittency and volatility of solar-PV (photovoltaic) systems. Nevertheless, an examination of the existing body of knowledge uncovers notable deficiencies in the ideal arrangement of energy systems' timetables. Most models primarily concentrate on a single aim, whereas only a few tackle the intricacies of multi-objective scenarios. This study examines homes connected to the power grid equipped with a BESS and a solar PV system. It leverages four distinct reinforcement learning (RL) algorithms, selected for their unique training methodologies, to develop effective scheduling models. The findings demonstrate that the RL model using Trust Region Policy Optimization (TRPO) effectively manages the BESS and PV system despite real-world uncertainties. This case study confirms the suitability and effectiveness of this approach. The TRPO-based RL framework surpasses previous models in decision-making by choosing the most optimal BESS scheduling strategies. The TRPO model exhibited the highest mean self-sufficiency rates compared to the A3C (Asynchronous Advantage Actor-Critic), DDPG (Deep Deterministic Policy Gradient), and TAC (Twin Actor Cretic) models, surpassing them by similar to 3%, 0.72%, and 3.5%, correspondingly. This results in enhanced autonomy and economic benefits by adapting to dynamic real-world conditions. Consequently, our approach was strategically designed to deliver an optimized outcome. This framework is primarily intended for seamless integration into an automated energy plant environment, facilitating regular electricity trading among multiple buildings. Backed by initiatives like the Renewable Energy Certificate weight, this technology is expected to play a crucial role in maintaining a balance between power generation and consumption. The MILP (Mixed Integer Linear Programming) architecture achieved a self-sufficiency rate of 29.12%, surpassing the rates of A3C, TRPO, DDPG, and TAC by 2.48%, 0.64%, 2%, and 3.04%, correspondingly.

引用

页码：2203 / 2221

页数：19

共 50 条

[41] Reinforcement learning-based optimal scheduling model of battery energy storage system at the building level
Kang, Hyuna
Jung, Seunghoon
Kim, Hakpyeong
Jeoung, Jaewon
Hong, Taehoon
RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2024, 190
[42] An Optimal Day-ahead Bidding Strategy and Operation for Battery Energy Storage System by Reinforcement Learning
Dong, Yi
Zhao, Tianqiao
Ding, Zhengtao
IFAC PAPERSONLINE, 2020, 53 (02): : 13190 - 13195
[43] Innovative Application of Computer Game Algorithms of Surakarta Based on Reinforcement Learning
Tao, Jun
Wu, Gui
Zeng, Peng
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2312 - 2315
[44] A Strategic Day-ahead bidding strategy and operation for battery energy storage system by reinforcement learning
Dong, Yi
Dong, Zhen
Zhao, Tianqiao
Ding, Zhengtao
ELECTRIC POWER SYSTEMS RESEARCH, 2021, 196
[45] Optimally sizing of battery energy storage capacity by operational optimization of residential PV-Battery systems: An Australian household case study
Mulleriyawage, U. G. K.
Shen, W. X.
RENEWABLE ENERGY, 2020, 160 (160) : 852 - 864
[46] Battery Scheduling Optimization and Potential Revenue for Residential Storage Price Arbitrage
Paulauskas, Nerijus
Kapustin, Vsevolod
BATTERIES-BASEL, 2024, 10 (07):
[47] Reinforcement Learning Algorithms For Navigating Multiagent In Auto Storage System
Hieu The Pham
Thang Quoc Nguyen
Thai-Minh Truong
Thanh-Binh Tran
Thinh Ba Vuong
PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 264 - 271
[48] Community energy storage operation via reinforcement learning with eligibility traces
Duque, Edgar Mauricio Salazar
Giraldo, Juan S.
Vergara, Pedro P.
Nguyen, Phuong
van der Molen, Anne
Slootweg, Han
ELECTRIC POWER SYSTEMS RESEARCH, 2022, 212
[49] Reinforcement learning-based scheduling strategy for energy storage in microgrid
Zhou, Kunshu
Zhou, Kaile
Yang, Shanlin
JOURNAL OF ENERGY STORAGE, 2022, 51
[50] A GaN-Based Battery Energy Storage System for Residential Application
Moradpour, Milad
Ghani, Pooya
Gatto, Gianluca
7TH INTERNATIONAL CONFERENCE ON CLEAN ELECTRICAL POWER (ICCEP 2019): RENEWABLE ENERGY RESOURCES IMPACT, 2019, : 427 - 432

← 1 2 3 4 5 →