Twin-Delayed Deep Deterministic Policy Gradient for Low-Frequency Oscillation Damping Control

被引:8
作者
Cui, Qiushi [1 ,2 ]
Kim, Gyoungjae [1 ]
Weng, Yang [1 ]
机构
[1] Arizona State Univ, Sch Elect Comp & Energy Engn, 551 East Tyler Mall, Tempe, AZ 85281 USA
[2] Chongqing Univ, Sch Elect Engn, Chongqing 400044, Peoples R China
关键词
latency; twin-delayed deep deterministic policy gradient; damping control; wide-area measurement systems; low-frequency oscillations; INTER-AREA OSCILLATIONS; POWER-SYSTEM; DESIGN; COORDINATION; OPTIMIZATION; STABILITY;
D O I
10.3390/en14206695
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
Due to the large scale of power systems, latency uncertainty in communications can cause severe problems in wide-area measurement systems. To resolve this issue, a significant amount of past work focuses on using emerging technology, including machine learning methods such as Q-learning, for addressing latency issues in modern controls. Although the method can deal with the stochastic characteristics of communication latency, the Q-values can be overestimated in Q-learning methods, leading to high bias. To address the overestimation bias issue, we redesign the learning structure of the deep deterministic policy gradient (DDPG). Then we develop a damping control twin-delayed deep deterministic policy gradient method to handle the damping control issue under unknown latency in the power network. The purpose is to address the damping control issue under unknown latency in the power network. This paper will create a novel reward algorithm, taking into account the machine speed deviation, the episode termination prevention, and the feedback from action space. In this way, the system optimally damps down frequency oscillations while maintaining the system's stability and reliable operation within defined limits. The simulation results verify the proposed algorithm in various perspectives, including the latency sensitivity analysis under high renewable energy penetration and the comparison with conventional and machine learning control algorithms. The proposed method shows a fast learning curve and good control performance under varying communication latency.
引用
收藏
页数:13
相关论文
共 42 条
[1]   Damping controller design for power system oscillations using global signals [J].
AboulEla, ME ;
Sallam, AA ;
McCalley, JD ;
Fouad, AA .
IEEE TRANSACTIONS ON POWER SYSTEMS, 1996, 11 (02) :767-773
[2]   Causes of the 2003 major grid blackouts in north America and Europe, and recommended means to improve System Dynamic Performance [J].
Andersson, G ;
Donalek, P ;
Farmer, R ;
Hatziargyriou, N ;
Kamwa, I ;
Kundur, P ;
Martins, N ;
Paserba, J ;
Pourbeik, P ;
Sanchez-Gasca, J ;
Schulz, R ;
Stankovic, A ;
Taylor, C ;
Vittal, V .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2005, 20 (04) :1922-1928
[3]   A novel modular Q-learning architecture to improve performance under incomplete learning in a grid soccer game [J].
Araghi, Sahar ;
Khosravi, Abbas ;
Johnstone, Michael ;
Creighton, Douglas .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2013, 26 (09) :2164-2171
[4]   Damping Inter-Area Oscillations Based on a Model Predictive Control (MPC) HVDC Supplementary Controller [J].
Azad, Sahar Pirooz ;
Iravani, Reza ;
Tate, Joseph Euzebe .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (03) :3174-3183
[5]   Fixed Low-Order Wide-Area Damping Controller Considering Time Delays and Power System Operation Uncertainties [J].
Bento, Murilo E. C. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2020, 35 (05) :3918-3926
[6]   Coordination of PSSs and SVC Damping Controller to Improve Probabilistic Small-Signal Stability of Power System With Wind Farm Integration [J].
Bian, X. Y. ;
Geng, Yan ;
Lo, Kwok L. ;
Fu, Yang ;
Zhou, Q. B. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2016, 31 (03) :2371-2382
[7]   Reinforcement Learning Based Recloser Control for Distribution Cables With Degraded Insulation Level [J].
Cui, Qiushi ;
Hashmy, Syed Muhammad Yousaf ;
Weng, Yang ;
Dyer, Michael .
IEEE TRANSACTIONS ON POWER DELIVERY, 2021, 36 (02) :1118-1127
[8]   Intelligent-based multi-robot path planning inspired by improved classical Q-learning and improved particle swarm optimization with perturbed velocity [J].
Das, P. K. ;
Behera, H. S. ;
Panigrahi, B. K. .
ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2016, 19 (01) :651-669
[9]   Q-Learning-Based Damping Control of Wide-Area Power Systems Under Cyber Uncertainties [J].
Duan, Jiajun ;
Xu, Hao ;
Liu, Wenxin .
IEEE TRANSACTIONS ON SMART GRID, 2018, 9 (06) :6408-6418
[10]  
Erlich I., 2011, 2011 IEEE TRONDHEIM, P1