A Two-Stage Multi-Agent Deep Reinforcement Learning Method for Urban Distribution Network Reconfiguration Considering Switch Contribution

被引:10
|
作者
Gao, Hongjun [1 ]
Jiang, Siyuan [1 ]
Li, Zhengmao [2 ]
Wang, Renjun [1 ]
Liu, Youbo [1 ]
Liu, Junyong [1 ]
机构
[1] Sichuan Univ, Coll Elect Engn, Chengdu 610065, Peoples R China
[2] Aalto Univ, Sch Elect Engn, Espoo 02150, Finland
基金
中国国家自然科学基金;
关键词
Switches; Control systems; Substations; Aerospace electronics; Deep reinforcement learning; Distribution networks; Voltage; Urban distribution network (UDN); reconfiguration; switch contribution; multi-agent deep reinforcement learning (MADRL); enhanced QMIX algorithm; two-stage learning structure;
D O I
10.1109/TPWRS.2024.3371093
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
With the ever-escalating scale of urban distribution networks (UDNs), the traditional model-based reconfiguration methods are becoming inadequate for smart system control. On the contrary, the data-driven deep reinforcement learning method can facilitate the swift decision-making but the large action space would adversely affect the learning performance of its agents. Consequently, this paper presents a novel multi-agent deep reinforcement learning method for the reconfiguration of UDNs by introducing the concept of "switch contribution". First, a quantification method is proposed based on the mathematical UDN reconfiguration model. The contributions of controllable switches are effective quantified. By excluding the controllable switches with low contributions during network reconfiguration, the dimensionality of action space can be significantly reduced. Then, an improved QMIX algorithm is introduced to improve the policy of multiple agents by assigning the weights. Besides, a novel two-stage learning structure based on a reward-sharing mechanism is presented to further decompose tasks and enhance the learning efficiency of multiple agents. In the first stage, agents control the switches with higher contributions while switches with lower contributions will be controlled in the second stage. During the two-stage process, the proposed reward-sharing mechanism could guarantee a reliable UND reconfiguration and the convergence of our learning method. Finally, numerical results based on a practical 297-node system are performed to validate our method's effectiveness.
引用
收藏
页码:7064 / 7076
页数:13
相关论文
共 50 条
  • [31] Cooperative Multi-Agent Deep Reinforcement Learning in Soccer Domains
    Ocana, Jim Martin Catacora
    Riccio, Francesco
    Capobianco, Roberto
    Nardi, Daniele
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1865 - 1867
  • [32] Important Scientific Problems of Multi-Agent Deep Reinforcement Learning
    Sun C.-Y.
    Mu C.-X.
    Zidonghua Xuebao/Acta Automatica Sinica, 2020, 46 (07): : 1301 - 1312
  • [33] Inter-Cell Network Slicing With Transfer Learning Empowered Multi-Agent Deep Reinforcement Learning
    Hu, Tianlun
    Liao, Qi
    Liu, Qiang
    Carle, Georg
    IEEE OPEN JOURNAL OF THE COMMUNICATIONS SOCIETY, 2023, 4 : 1141 - 1155
  • [34] Intelligent multicast routing method based on multi-agent deep reinforcement learning in SDWN
    Hu, Hongwen
    Ye, Miao
    Zhao, Chenwei
    Jiang, Qiuxiang
    Xue, Xingsi
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 17158 - 17196
  • [35] Consensus Multi-Agent Reinforcement Learning for Volt-VAR Control in Power Distribution Networks
    Gao, Yuanqi
    Wang, Wei
    Yu, Nanpeng
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (04) : 3594 - 3604
  • [36] A Multi-Agent Reinforcement Learning Architecture for Network Slicing Orchestration
    Mason, Federico
    Nencioni, Gianfranco
    Zanella, Andrea
    2021 19TH MEDITERRANEAN COMMUNICATION AND COMPUTER NETWORKING CONFERENCE (MEDCOMNET), 2021,
  • [37] Two-stage deep reinforcement learning method for agile optical satellite scheduling problem
    Liu, Zheng
    Xiong, Wei
    Jia, Zhuoya
    Han, Chi
    COMPLEX & INTELLIGENT SYSTEMS, 2025, 11 (01)
  • [38] Multi-agent deep reinforcement learning for resilience optimization of building structures considering utility interactions for functionality
    Anwar, Ghazanfar Ali
    Akber, Muhammad Zeshan
    COMPUTERS & STRUCTURES, 2025, 310
  • [39] A Two-Stage Target Search and Tracking Method for UAV Based on Deep Reinforcement Learning
    Liu, Mei
    Wei, Jingbo
    Liu, Kun
    DRONES, 2024, 8 (10)
  • [40] Multi-Agent Deep Reinforcement Learning with Graph Attention Network for Traffic Signal Control in Multiple-Intersection Urban Areas
    Yang, Guoqing
    Wen, Xin
    Chen, Fuqiang
    TRANSPORTATION RESEARCH RECORD, 2025,