A Two-Stage Multi-Agent Deep Reinforcement Learning Method for Urban Distribution Network Reconfiguration Considering Switch Contribution

被引：10

作者：

Gao, Hongjun ^{[1
]}

Jiang, Siyuan ^{[1
]}

Li, Zhengmao ^{[2
]}

Wang, Renjun ^{[1
]}

Liu, Youbo ^{[1
]}

Liu, Junyong ^{[1
]}

机构：

[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China

[2] Aalto Univ, Sch Elect Engn, Espoo 02150, Finland

来源：

IEEE TRANSACTIONS ON POWER SYSTEMS | 2024年 / 39卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Switches; Control systems; Substations; Aerospace electronics; Deep reinforcement learning; Distribution networks; Voltage; Urban distribution network (UDN); reconfiguration; switch contribution; multi-agent deep reinforcement learning (MADRL); enhanced QMIX algorithm; two-stage learning structure;

D O I：

10.1109/TPWRS.2024.3371093

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the ever-escalating scale of urban distribution networks (UDNs), the traditional model-based reconfiguration methods are becoming inadequate for smart system control. On the contrary, the data-driven deep reinforcement learning method can facilitate the swift decision-making but the large action space would adversely affect the learning performance of its agents. Consequently, this paper presents a novel multi-agent deep reinforcement learning method for the reconfiguration of UDNs by introducing the concept of "switch contribution". First, a quantification method is proposed based on the mathematical UDN reconfiguration model. The contributions of controllable switches are effective quantified. By excluding the controllable switches with low contributions during network reconfiguration, the dimensionality of action space can be significantly reduced. Then, an improved QMIX algorithm is introduced to improve the policy of multiple agents by assigning the weights. Besides, a novel two-stage learning structure based on a reward-sharing mechanism is presented to further decompose tasks and enhance the learning efficiency of multiple agents. In the first stage, agents control the switches with higher contributions while switches with lower contributions will be controlled in the second stage. During the two-stage process, the proposed reward-sharing mechanism could guarantee a reliable UND reconfiguration and the convergence of our learning method. Finally, numerical results based on a practical 297-node system are performed to validate our method's effectiveness.

引用

页码：7064 / 7076

页数：13

共 50 条

[31] Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains
Ocana, Jim Martin Catacora
Riccio, Francesco
Capobianco, Roberto
Nardi, Daniele
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1865 - 1867
[32] Important Scientific Problems of Multi-Agent Deep Reinforcement Learning
Sun C.-Y.
Mu C.-X.
Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (07): : 1301 - 1312
[33] Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning
Hu, Tianlun
Liao, Qi
Liu, Qiang
Carle, Georg
IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2023, 4 : 1141 - 1155
[34] Intelligent multicast routing method based on multi-agent deep reinforcement learning in SDWN
Hu, Hongwen
Ye, Miao
Zhao, Chenwei
Jiang, Qiuxiang
Xue, Xingsi
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 17158 - 17196
[35] Consensus Multi-Agent Reinforcement Learning for Volt-VAR Control in Power Distribution Networks
Gao, Yuanqi
Wang, Wei
Yu, Nanpeng
IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 3594 - 3604
[36] A Multi-Agent Reinforcement Learning Architecture for Network Slicing Orchestration
Mason, Federico
Nencioni, Gianfranco
Zanella, Andrea
2021 19TH MEDITERRANEAN COMMUNICATION AND COMPUTER NETWORKING CONFERENCE (MEDCOMNET), 2021,
[37] Two-stage deep reinforcement learning method for agile optical satellite scheduling problem
Liu, Zheng
Xiong, Wei
Jia, Zhuoya
Han, Chi
COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
[38] Multi-agent deep reinforcement learning for resilience optimization of building structures considering utility interactions for functionality
Anwar, Ghazanfar Ali
Akber, Muhammad Zeshan
COMPUTERS & STRUCTURES, 2025, 310
[39] A Two-Stage Target Search and Tracking Method for UAV Based on Deep Reinforcement Learning
Liu, Mei
Wei, Jingbo
Liu, Kun
DRONES, 2024, 8 (10)
[40] Multi-Agent Deep Reinforcement Learning with Graph Attention Network for Traffic Signal Control in Multiple-Intersection Urban Areas
Yang, Guoqing
Wen, Xin
Chen, Fuqiang
TRANSPORTATION RESEARCH RECORD, 2025,

← 1 2 3 4 5 →