Reward design for intelligent deep reinforcement learning based power flow control using topology optimization

被引：2

作者：

Hrgovic, Ivana ^{[1
]}

Pavic, Ivica ^{[1
]}

机构：

[1] Univ Zagreb, Fac Elect Engn & Comp, Unska 3, Zagreb, Croatia

来源：

SUSTAINABLE ENERGY GRIDS & NETWORKS | 2025年 / 41卷

关键词：

Power system optimization applications; Deep reinforcement learning; Power flow control; Topology optimization; Reward;

D O I：

10.1016/j.segan.2024.101580

中图分类号：

TE [石油、天然气工业]; TK [能源与动力工程];

学科分类号：

0807 ; 0820 ;

摘要：

Power flow control is a critical aspect of preventing overloads in electrical networks, which can lead to severe consequences such as disconnections, cascading outages, and system blackouts. The congestion management problem is well-studied and solved via several different techniques. However, the dynamic and evolving nature of modern power systems, marked by increased renewable energy integration, grid interconnection, and evolving market structures, necessitates innovative solutions that can adapt to changing conditions in real-time. This research focuses on the intricate challenges of power flow control and congestion management using the technical, cost-effective method of network reconfigurations with busbar splitting. Busbar splitting presents a complex optimization problem due to the vast combinatorial possibilities of busbar splitting configurations. To address this challenge, we turn to Reinforcement Learning (RL), a dynamic and adaptive approach known for its real-time decision-making capabilities, ability to learn complex patterns, and flexibility in handling uncertainties. Several studies researched the RL based power flow control using network reconfiguration and gave solutions utilizing various techniques. Reward signal, as the main component of RL methods do not seem to evolve at the same pace. In this paper, we propose a novel Multi-Objective Reward (MOR) design focused on reliable power system control and prevention/reduction of overloads. Our results indicate that the proposed approach outperforms its contenders from the literature in both reliability and optimality of power flow control. Training involved eight different reward strategies, including our MOR approach and seven rewards from existing literature. The best designs from the literature managed a mean control duration of 1250 time-steps, while our approach extended this to nearly 5000 time-steps. In tests across 100 unseen scenarios, the MOR reward enabled the agent to maintain reliable network control for 21 days-outperforming the best agent from the literature by 10 days. Additionally, the agent trained with MOR significantly reduced both the frequency and severity of overloads.

引用

页数：9

共 31 条

[1] Reinforcement learning for control: Performance, stability, and deep approximators [J].

Busoniu, Lucian ;

de Bruin, Tim ;

Tolic, Domagoj ;

Kober, Jens ;

Palunko, Ivana .

ANNUAL REVIEWS IN CONTROL, 2018, 46 :8-28

[2] Reinforcement Learning and Its Applications in Modern Power and Energy Systems: A Review [J].

Cao, Di ;

Hu, Weihao ;

Zhao, Junbo ;

Zhang, Guozhou ;

Zhang, Bin ;

Liu, Zhou ;

Chen, Zhe ;

Blaabjerg, Frede .

JOURNAL OF MODERN POWER SYSTEMS AND CLEAN ENERGY, 2020, 8 (06) :1029-1042

[3] Active Power Correction Strategies Based on Deep Reinforcement Learning-Part II: A Distributed Solution for Adaptability [J].

Chen, Siyuan ;

Duan, Jiajun ;

Bai, Yuyang ;

Zhang, Jun ;

Shi, Di ;

Wang, Zhiwei ;

Dong, Xuzhu ;

Sun, Yuanzhang .

CSEE JOURNAL OF POWER AND ENERGY SYSTEMS, 2022, 8 (04) :1134-1144

[4] Reinforcement Learning for Selective Key Applications in Power Systems: Recent Advances and Future Challenges [J].

Chen, Xin ;

Qu, Guannan ;

Tang, Yujie ;

Low, Steven ;

Li, Na .

IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (04) :2935-2958

[5] High Performance Computing Reinforcement Learning Framework for Power System Control [J].

Damjanovic, Ivana ;

Pavic, Ivica ;

Brcic, Mario ;

Jercic, Roko .

2023 IEEE POWER & ENERGY SOCIETY INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE, ISGT, 2023,

[6] Deep Reinforcement Learning-Based Approach for Autonomous Power Flow Control Using Only Topology Changes [J].

Damjanovic, Ivana ;

Pavic, Ivica ;

Puljiz, Mate ;

Brcic, Mario .

ENERGIES, 2022, 15 (19)

[7] Deep-Reinforcement-Learning-Based Autonomous Voltage Control for Power Grid Operations [J].

Duan, Jiajun ;

Shi, Di ;

Diao, Ruisheng ;

Li, Haifeng ;

Wang, Zhiwei ;

Zhang, Bei ;

Bian, Desong ;

Yi, Zhehan .

IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (01) :814-817

[8] Methods and Methodologies for Congestion Alleviation in the DPS: A Comprehensive Review [J].

Gautam, Anurag ;

Ibraheem ;

Sharma, Gulshan ;

Ahmer, Mohammad F. ;

Krishnan, Narayanan .

ENERGIES, 2023, 16 (04)

[9]

github, .Grid2Op

[10] (Deep) Reinforcement learning for electric power system control and related problems: A short review and perspectives [J].

Glavic, Mevludin .

ANNUAL REVIEWS IN CONTROL, 2019, 48 :22-35

← 1 2 3 4 →