A Two-Stage Multi-Agent Deep Reinforcement Learning Method for Urban Distribution Network Reconfiguration Considering Switch Contribution

被引：10

作者：

Gao, Hongjun ^{[1
]}

Jiang, Siyuan ^{[1
]}

Li, Zhengmao ^{[2
]}

Wang, Renjun ^{[1
]}

Liu, Youbo ^{[1
]}

Liu, Junyong ^{[1
]}

机构：

[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China

[2] Aalto Univ, Sch Elect Engn, Espoo 02150, Finland

来源：

IEEE TRANSACTIONS ON POWER SYSTEMS | 2024年 / 39卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Switches; Control systems; Substations; Aerospace electronics; Deep reinforcement learning; Distribution networks; Voltage; Urban distribution network (UDN); reconfiguration; switch contribution; multi-agent deep reinforcement learning (MADRL); enhanced QMIX algorithm; two-stage learning structure;

D O I：

10.1109/TPWRS.2024.3371093

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the ever-escalating scale of urban distribution networks (UDNs), the traditional model-based reconfiguration methods are becoming inadequate for smart system control. On the contrary, the data-driven deep reinforcement learning method can facilitate the swift decision-making but the large action space would adversely affect the learning performance of its agents. Consequently, this paper presents a novel multi-agent deep reinforcement learning method for the reconfiguration of UDNs by introducing the concept of "switch contribution". First, a quantification method is proposed based on the mathematical UDN reconfiguration model. The contributions of controllable switches are effective quantified. By excluding the controllable switches with low contributions during network reconfiguration, the dimensionality of action space can be significantly reduced. Then, an improved QMIX algorithm is introduced to improve the policy of multiple agents by assigning the weights. Besides, a novel two-stage learning structure based on a reward-sharing mechanism is presented to further decompose tasks and enhance the learning efficiency of multiple agents. In the first stage, agents control the switches with higher contributions while switches with lower contributions will be controlled in the second stage. During the two-stage process, the proposed reward-sharing mechanism could guarantee a reliable UND reconfiguration and the convergence of our learning method. Finally, numerical results based on a practical 297-node system are performed to validate our method's effectiveness.

引用

页码：7064 / 7076

页数：13

共 50 条

[21] Multi-agent Deep Reinforcement Learning for Zero Energy Communities
Prasad, Amit
Dusparic, Ivana
PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
[22] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
Shao, Kun
Zhu, Yuanheng
Tang, Zhentao
Zhao, Dongbin
2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[23] A survey on scalability and transferability of multi-agent deep reinforcement learning
Yan C.
Xiang X.-J.
Xu X.
Wang C.
Zhou H.
Shen L.-C.
Kongzhi yu Juece/Control and Decision, 2022, 37 (12): : 3083 - 3102
[24] Formation Control of Multi-agent Based on Deep Reinforcement Learning
Pan, Chao
Nian, Xiaohong
Dai, Xunhua
Wang, Haibo
Xiong, Hongyun
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1149 - 1159
[25] Multi-agent deep reinforcement learning strategy for distributed energy
Xi, Lei
Sun, Mengmeng
Zhou, Huan
Xu, Yanchun
Wu, Junnan
Li, Yanying
MEASUREMENT, 2021, 185
[26] Deep Model Compression via Two-Stage Deep Reinforcement Learning
Zhan, Huixin
Lin, Wei-Ming
Cao, Yongcan
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 238 - 254
[27] Solving dynamic distribution network reconfiguration using deep reinforcement learning
Kundacina, Ognjen B.
Vidovic, Predrag M.
Petkovic, Milan R.
ELECTRICAL ENGINEERING, 2022, 104 (03) : 1487 - 1501
[28] Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-Agent Urban Driving Environment
Sharif, Aizaz
Marijan, Dusica
2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 785 - 796
[29] Solving dynamic distribution network reconfiguration using deep reinforcement learning
Ognjen B. Kundačina
Predrag M. Vidović
Milan R. Petković
Electrical Engineering, 2022, 104 : 1487 - 1501
[30] QDN: An Efficient Value Decomposition Method for Cooperative Multi-agent Deep Reinforcement Learning
Xie, Zaipeng
Zhang, Yufeng
Shao, Pengfei
Zhao, Weiyi
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1204 - 1211

← 1 2 3 4 5 →