A Two-Stage Multi-Agent Deep Reinforcement Learning Method for Urban Distribution Network Reconfiguration Considering Switch Contribution

被引:10
|
作者
Gao, Hongjun [1 ]
Jiang, Siyuan [1 ]
Li, Zhengmao [2 ]
Wang, Renjun [1 ]
Liu, Youbo [1 ]
Liu, Junyong [1 ]
机构
[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China
[2] Aalto Univ, Sch Elect Engn, Espoo 02150, Finland
基金
中国国家自然科学基金;
关键词
Switches; Control systems; Substations; Aerospace electronics; Deep reinforcement learning; Distribution networks; Voltage; Urban distribution network (UDN); reconfiguration; switch contribution; multi-agent deep reinforcement learning (MADRL); enhanced QMIX algorithm; two-stage learning structure;
D O I
10.1109/TPWRS.2024.3371093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the ever-escalating scale of urban distribution networks (UDNs), the traditional model-based reconfiguration methods are becoming inadequate for smart system control. On the contrary, the data-driven deep reinforcement learning method can facilitate the swift decision-making but the large action space would adversely affect the learning performance of its agents. Consequently, this paper presents a novel multi-agent deep reinforcement learning method for the reconfiguration of UDNs by introducing the concept of "switch contribution". First, a quantification method is proposed based on the mathematical UDN reconfiguration model. The contributions of controllable switches are effective quantified. By excluding the controllable switches with low contributions during network reconfiguration, the dimensionality of action space can be significantly reduced. Then, an improved QMIX algorithm is introduced to improve the policy of multiple agents by assigning the weights. Besides, a novel two-stage learning structure based on a reward-sharing mechanism is presented to further decompose tasks and enhance the learning efficiency of multiple agents. In the first stage, agents control the switches with higher contributions while switches with lower contributions will be controlled in the second stage. During the two-stage process, the proposed reward-sharing mechanism could guarantee a reliable UND reconfiguration and the convergence of our learning method. Finally, numerical results based on a practical 297-node system are performed to validate our method's effectiveness.
引用
收藏
页码:7064 / 7076
页数:13
相关论文
共 50 条
  • [21] Multi-agent Deep Reinforcement Learning for Zero Energy Communities
    Prasad, Amit
    Dusparic, Ivana
    PROCEEDINGS OF 2019 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES EUROPE (ISGT-EUROPE), 2019,
  • [22] Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward
    Shao, Kun
    Zhu, Yuanheng
    Tang, Zhentao
    Zhao, Dongbin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [23] A survey on scalability and transferability of multi-agent deep reinforcement learning
    Yan C.
    Xiang X.-J.
    Xu X.
    Wang C.
    Zhou H.
    Shen L.-C.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (12): : 3083 - 3102
  • [24] Formation Control of Multi-agent Based on Deep Reinforcement Learning
    Pan, Chao
    Nian, Xiaohong
    Dai, Xunhua
    Wang, Haibo
    Xiong, Hongyun
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 1149 - 1159
  • [25] Multi-agent deep reinforcement learning strategy for distributed energy
    Xi, Lei
    Sun, Mengmeng
    Zhou, Huan
    Xu, Yanchun
    Wu, Junnan
    Li, Yanying
    MEASUREMENT, 2021, 185
  • [26] Deep Model Compression via Two-Stage Deep Reinforcement Learning
    Zhan, Huixin
    Lin, Wei-Ming
    Cao, Yongcan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 238 - 254
  • [27] Solving dynamic distribution network reconfiguration using deep reinforcement learning
    Kundacina, Ognjen B.
    Vidovic, Predrag M.
    Petkovic, Milan R.
    ELECTRICAL ENGINEERING, 2022, 104 (03) : 1487 - 1501
  • [28] Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-Agent Urban Driving Environment
    Sharif, Aizaz
    Marijan, Dusica
    2022 IEEE 22ND INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY, QRS, 2022, : 785 - 796
  • [29] Solving dynamic distribution network reconfiguration using deep reinforcement learning
    Ognjen B. Kundačina
    Predrag M. Vidović
    Milan R. Petković
    Electrical Engineering, 2022, 104 : 1487 - 1501
  • [30] QDN: An Efficient Value Decomposition Method for Cooperative Multi-agent Deep Reinforcement Learning
    Xie, Zaipeng
    Zhang, Yufeng
    Shao, Pengfei
    Zhao, Weiyi
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1204 - 1211