Innovative energy solutions: Evaluating reinforcement learning algorithms for battery storage optimization in residential settings

被引:0
|
作者
Dou, Zhenlan [1 ]
Zhang, Chunyan [1 ]
Li, Junqiang [2 ]
Li, Dezhi [3 ]
Wang, Miao [3 ]
Sun, Lue [3 ]
Wang, Yong [2 ]
机构
[1] State Grid Shanghai Municipal Elect Power Co, Shanghai 200122, Peoples R China
[2] Nanchang Univ, Sch Informat Engn, Nanchang 330031, Peoples R China
[3] China Elect Power Res Inst, Beijing Key Lab Demand Side Multienergy Carriers O, Beijing 100192, Peoples R China
关键词
Reinforcement learning; Optimal controlling; Operation scheduling; Building energy Management; Energy storage; Solar PV system; SYSTEM; MANAGEMENT; OPERATION; BEHAVIOR; BIOMASS;
D O I
10.1016/j.psep.2024.09.123
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The implementation of BESS (battery energy storage systems) and the efficient optimization of their scheduling are crucial research challenges in effectively managing the intermittency and volatility of solar-PV (photovoltaic) systems. Nevertheless, an examination of the existing body of knowledge uncovers notable deficiencies in the ideal arrangement of energy systems' timetables. Most models primarily concentrate on a single aim, whereas only a few tackle the intricacies of multi-objective scenarios. This study examines homes connected to the power grid equipped with a BESS and a solar PV system. It leverages four distinct reinforcement learning (RL) algorithms, selected for their unique training methodologies, to develop effective scheduling models. The findings demonstrate that the RL model using Trust Region Policy Optimization (TRPO) effectively manages the BESS and PV system despite real-world uncertainties. This case study confirms the suitability and effectiveness of this approach. The TRPO-based RL framework surpasses previous models in decision-making by choosing the most optimal BESS scheduling strategies. The TRPO model exhibited the highest mean self-sufficiency rates compared to the A3C (Asynchronous Advantage Actor-Critic), DDPG (Deep Deterministic Policy Gradient), and TAC (Twin Actor Cretic) models, surpassing them by similar to 3%, 0.72%, and 3.5%, correspondingly. This results in enhanced autonomy and economic benefits by adapting to dynamic real-world conditions. Consequently, our approach was strategically designed to deliver an optimized outcome. This framework is primarily intended for seamless integration into an automated energy plant environment, facilitating regular electricity trading among multiple buildings. Backed by initiatives like the Renewable Energy Certificate weight, this technology is expected to play a crucial role in maintaining a balance between power generation and consumption. The MILP (Mixed Integer Linear Programming) architecture achieved a self-sufficiency rate of 29.12%, surpassing the rates of A3C, TRPO, DDPG, and TAC by 2.48%, 0.64%, 2%, and 3.04%, correspondingly.
引用
收藏
页码:2203 / 2221
页数:19
相关论文
共 50 条
  • [41] Reinforcement learning-based optimal scheduling model of battery energy storage system at the building level
    Kang, Hyuna
    Jung, Seunghoon
    Kim, Hakpyeong
    Jeoung, Jaewon
    Hong, Taehoon
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2024, 190
  • [42] An Optimal Day-ahead Bidding Strategy and Operation for Battery Energy Storage System by Reinforcement Learning
    Dong, Yi
    Zhao, Tianqiao
    Ding, Zhengtao
    IFAC PAPERSONLINE, 2020, 53 (02): : 13190 - 13195
  • [43] Innovative Application of Computer Game Algorithms of Surakarta Based on Reinforcement Learning
    Tao, Jun
    Wu, Gui
    Zeng, Peng
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2312 - 2315
  • [44] A Strategic Day-ahead bidding strategy and operation for battery energy storage system by reinforcement learning
    Dong, Yi
    Dong, Zhen
    Zhao, Tianqiao
    Ding, Zhengtao
    ELECTRIC POWER SYSTEMS RESEARCH, 2021, 196
  • [45] Optimally sizing of battery energy storage capacity by operational optimization of residential PV-Battery systems: An Australian household case study
    Mulleriyawage, U. G. K.
    Shen, W. X.
    RENEWABLE ENERGY, 2020, 160 (160) : 852 - 864
  • [46] Battery Scheduling Optimization and Potential Revenue for Residential Storage Price Arbitrage
    Paulauskas, Nerijus
    Kapustin, Vsevolod
    BATTERIES-BASEL, 2024, 10 (07):
  • [47] Reinforcement Learning Algorithms For Navigating Multiagent In Auto Storage System
    Hieu The Pham
    Thang Quoc Nguyen
    Thai-Minh Truong
    Thanh-Binh Tran
    Thinh Ba Vuong
    PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 264 - 271
  • [48] Community energy storage operation via reinforcement learning with eligibility traces
    Duque, Edgar Mauricio Salazar
    Giraldo, Juan S.
    Vergara, Pedro P.
    Nguyen, Phuong
    van der Molen, Anne
    Slootweg, Han
    ELECTRIC POWER SYSTEMS RESEARCH, 2022, 212
  • [49] Reinforcement learning-based scheduling strategy for energy storage in microgrid
    Zhou, Kunshu
    Zhou, Kaile
    Yang, Shanlin
    JOURNAL OF ENERGY STORAGE, 2022, 51
  • [50] A GaN-Based Battery Energy Storage System for Residential Application
    Moradpour, Milad
    Ghani, Pooya
    Gatto, Gianluca
    7TH INTERNATIONAL CONFERENCE ON CLEAN ELECTRICAL POWER (ICCEP 2019): RENEWABLE ENERGY RESOURCES IMPACT, 2019, : 427 - 432