共 64 条
Towards Risk-Aware Real-Time Security Constrained Economic Dispatch: A Tailored Deep Reinforcement Learning Approach
被引:16
作者:
Hu, Jianxiong
[1
]
Ye, Yujian
[2
,3
]
Tang, Yi
[1
]
Strbac, Goran
[4
]
机构:
[1] Southeast Univ, Sch Elect Engn, Nanjing 210096, Peoples R China
[2] Southeast Univ, Sch Elect Engn, Nanjing 210096, Peoples R China
[3] Southeast Univ, Jiangsu Prov Key Lab Smart Grid Technol & Equipme, Nanjing 210096, Peoples R China
[4] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
基金:
英国工程与自然科学研究理事会;
中国国家自然科学基金;
关键词:
Data-driven methods;
deep reinforcement learning;
knowledge-integrated learning;
power system risk evaluation;
real-time security constrained economic dispatch;
RELIABILITY ASSESSMENT;
ENERGY MANAGEMENT;
POWER;
SYSTEM;
GENERATION;
STRATEGY;
OPTIMIZATION;
D O I:
10.1109/TPWRS.2023.3288039
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
In the presence of increasing uncertainties brought by intermittent renewable energy sources, security and risk management continue to be the most critical concern in modern power system operation. Risk-aware, real-time security constrained economic dispatch (RT-SCED) provides an efficient solution towards promptly, economically and robustly responding to the changes in the power system operating state. Despite different model-based methods have been developed to handle uncertainties, significant computation burden arise to incorporate N-1 contingency constraints with a higher temporal resolution in RT-SCED. Driven by similar computational challenges, risk evaluation is often overlooked in the current application of deep reinforcement learning (DRL) based data-driven methods in RT-SCED. This article proposes a DRL-based risk-aware RT-SCED methodological framework by incorporating a novel data-driven risk evaluation model to foster efficient agent-environment interactions. The real-time dispatch policies are constructed with an improved twin delayed deep deterministic policy gradient method. The policy network features a residual network architecture and incorporates an active power allocation mechanism to integrate empirical dispatch knowledge, preventing early termination and fostering more efficient learning behavior. Case studies validate the superior performance of the proposed method in risk-aware RT-SCED on cost efficiency, uncertainty adaptability and computational efficiency, through benchmarking against model-based and data-driven baseline methods.
引用
收藏
页码:3972 / 3986
页数:15
相关论文