Hierarchical multi-agent reinforcement learning for repair crews dispatch control towards multi-energy microgrid resilience

被引:41
作者
Qiu, Dawei [1 ]
Wang, Yi [1 ]
Zhang, Tingqi [2 ,3 ]
Sun, Mingyang [3 ]
Strbac, Goran
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] State Grid Liaoning Elect Power Co Ltd, Elect Power Res Inst, Shenyang 110000, Peoples R China
[3] Zhejiang Univ, Dept Control Sci & Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Repair crews; Multi-energy microgrid; Resilience; Power-gas-transportation network; Hierarchical multi-agent reinforcement; learning; UNBALANCED DISTRIBUTION-SYSTEMS; SERVICE RESTORATION; POWER; STRATEGIES; RESOURCE;
D O I
10.1016/j.apenergy.2023.120826
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Extreme events are greatly impacting the normal operations of microgrids, which can lead to severe outages and affect the continuous supply of energy to customers, incurring substantial restoration costs. Repair crews (RCs) are regarded as crucial resources to provide system resilience owing to their mobility and flexibility characteristics in handling both transportation and energy systems. Nevertheless, effectively coordinating the dispatch of RCs towards system resilience is a complex decision-making problem, especially in the context of a multi-energy microgrid (MEMG) with enormous dynamics and uncertainties. To this end, this paper formulates the dispatch problem of RCs in a coupled transportation and power-gas network as a decentralized partially observable Markov decision process (Dec-POMDP). To solve this Dec-POMDP, a hierarchical multi -agent reinforcement learning (MARL) algorithm is proposed by featuring a two-level framework, where the high-level action is used for switching decision-making between transportation and power-gas networks, and the lower-level action constructed via the multi-agent proximal policy optimization (MAPPO) algorithm is used to compute the routing and repairing decisions of RCs in the transportation and power-gas networks, respectively. The proposed algorithm also introduces an abstracted critic network by integrating the load restoration status, which captures the system dynamics and stabilizes the training performance with privacy protection. Extensive case studies are evaluated on a coupled 6-bus power and 6-bus gas network integrated with a 9-node 12-edge transportation network. The proposed algorithm outperforms the conventional MARL algorithms in terms of policy quality, learning stability, and computational performance. Furthermore, the dispatch strategies of RCs are analyzed and their corresponding benefits for load restoration are also evaluated. Finally, the scalability of the proposed method is also investigated for a larger 33-bus power and 15-bus gas network integrated with an 18-node 27-edge transportation network.
引用
收藏
页数:19
相关论文
共 49 条
[1]   Crude oil time series prediction model based on LSTM network with chaotic Henry gas solubility optimization [J].
Altan, Aytac ;
Karasu, Seckin .
ENERGY, 2022, 242
[2]   Repair and Resource Scheduling in Unbalanced Distribution Systems Using Neighborhood Search [J].
Arif, Anmar ;
Wang, Zhaoyu ;
Chen, Chen ;
Wang, Jianhui .
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (01) :673-685
[3]   Optimizing Service Restoration in Distribution Systems With Uncertain Repair Time and Demand [J].
Arif, Anmar ;
Ma, Shanshan ;
Wang, Zhaoyu ;
Wang, Jianhui ;
Ryan, Sarah M. ;
Chen, Chen .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (06) :6828-6838
[4]   Modeling and evaluating nodal resilience of multi-energy systems under windstorms [J].
Bao, Minglei ;
Ding, Yi ;
Sang, Maosheng ;
Li, Daqing ;
Shao, Changzheng ;
Yan, Jinyue .
APPLIED ENERGY, 2020, 270
[5]  
Camacho E. F., 2013, Model predictive control, V2nd
[6]   Intelligent hurricane resilience enhancement of power distribution systems via deep reinforcement learning [J].
Dehghani, Nariman L. ;
Jeddi, Ashkan B. ;
Shafieezadeh, Abdollah .
APPLIED ENERGY, 2021, 285
[7]   Multiperiod Distribution System Restoration With Routing Repair Crews, Mobile Electric Vehicles, and Soft-Open-Point Networked Microgrids [J].
Ding, Tao ;
Wang, Zekai ;
Jia, Wenhao ;
Chen, Bo ;
Chen, Chen ;
Shahidehpour, Mohammad .
IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (06) :4795-4808
[8]   Multi-Stage Stochastic Programming With Nonanticipativity Constraints for Expansion of Combined Power and Natural Gas Systems [J].
Ding, Tao ;
Hu, Yuan ;
Bie, Zhaohong .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (01) :317-328
[9]   An Enhanced IEEE 33 Bus Benchmark Test System for Distribution System Studies [J].
Dolatabadi, Sarineh Hacopian ;
Ghorbanian, Maedeh ;
Siano, Pierluigi ;
Hatziargyriou, Nikos D. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2021, 36 (03) :2565-2572
[10]   Robust Network Hardening Strategy for Enhancing Resilience of Integrated Electricity and Natural Gas Distribution Systems Against Natural Disasters [J].
He, Chuan ;
Dai, Chenxi ;
Wu, Lei ;
Liu, Tianqi .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (05) :5787-5798