Safe Deep Reinforcement Learning for Microgrid Energy Management in Distribution Networks With Leveraged SpatialTemporal Perception

被引:35
作者
Ye, Yujian [1 ]
Wang, Hongru [2 ]
Chen, Peiling [1 ]
Tang, Yi [1 ]
Strbac, Goran [3 ]
机构
[1] Southeast Univ, Sch Elect Engn, Nanjing 210096, Peoples R China
[2] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 210096, Peoples R China
[3] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
基金
英国工程与自然科学研究理事会;
关键词
Energy management; Optimization; Uncertainty; Reactive power; Distribution networks; HVAC; Process control; microgrids; network constraints; power system spatial-temporal perception; safe deep reinforcement learning;
D O I
10.1109/TSG.2023.3243170
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Microgrids (MG) have recently attracted great interest as an effective solution to the challenging problem of distributed energy resources ’ management in distribution networks. In this context, despite deep reinforcement learning (DRL) constitutes a well-suited model-free and data-driven methodological framework, its application to MG energy management is still challenging, driven by their limitations on environment status perception and constraint satisfaction. In this paper, the MG energy management problem is formalized as a Constrained Markov Decision Process, and is solved with the state-of-the-art interior-point policy optimization (IPO) method. In contrast to conventional DRL approaches, IPO facilitates efficient learning in multi-dimensional, continuous state and action spaces, while promising satisfaction of complex network constraints of the distribution network. The generalization capability of IPO is further enhanced through the extraction of spatial-temporal correlation features from original MG operating status, combining the strength of edge conditioned convolutional network and long short-term memory network. Case studies based on an IEEE 15-bus and 123-bus test feeders with real-world data demonstrate the superior performance of the proposed method in improving MG cost effectiveness, safeguarding the secure operation of the network and uncertainty adaptability, through performance benchmarking against model-based and DRL-based baseline methods. Finally, case studies also analyze the computational and scalability performance of proposed and baseline methods.
引用
收藏
页码:3759 / 3775
页数:17
相关论文
共 57 条
[1]   Optimal Distributed Energy Resource Coordination: A Decomposition Method Based on Distribution Locational Marginal Costs [J].
Andrianesis, Panagiotis ;
Caramanis, Michael ;
Li, Na .
IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (02) :1200-1212
[2]  
[Anonymous], 2017, RAINF TEMP FOR OBS V
[3]  
Ausgrid, 2020, Time of Use Pricing
[4]   A Bi-Level Framework for Supply and Demand Side Energy Management in an Islanded Microgrid [J].
Azzam, Sarah M. ;
Elshabrawy, Tallal ;
Ashour, Mohamed .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 19 (01) :220-231
[5]   Theory and Applications of Robust Optimization [J].
Bertsimas, Dimitris ;
Brown, David B. ;
Caramanis, Constantine .
SIAM REVIEW, 2011, 53 (03) :464-501
[6]   A Robust Optimization Approach for Demand Side Scheduling Considering Uncertainty of Manually Operated Appliances [J].
Du, Yuefang F. ;
Jiang, Lin ;
Li, Yuanzheng ;
Wu, Qinghua .
IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (02) :743-755
[7]  
EnergyAustralia, 2020, Solar Rebates and Feed-in Tariffs
[8]   Reinforcement Learning Approach for Optimal Distributed Energy Management in a Microgrid [J].
Foruzan, Elham ;
Soh, Leen-Kiat ;
Asgarpoor, Sohrab .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2018, 33 (05) :5749-5758
[9]   AC OPF for Smart Distribution Networks: An Efficient and Robust Quadratic Approach [J].
Franco, John F. ;
Ochoa, Luis F. ;
Romero, Ruben .
IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (05) :4613-4623
[10]  
Francois-Lavet V., 2016, P EUROPEAN WORKSHOP