Hybrid Reinforcement Learning for Power Transmission Network Self-Healing Considering Wind Power

被引:10
作者
Sun, Runjia [1 ]
Liu, Yutian [1 ]
机构
[1] Shandong Univ, Minist Educ, Key Lab Power Syst Intelligent Dispatch & Control, Jinan 250061, Peoples R China
关键词
Wind power generation; Generators; Wind farms; Power systems; Voltage; Training; Reliability; Monte Carlo tree search (MCTS); power system restoration; reinforcement learning (RL); self-healing; MODEL-PREDICTIVE CONTROL; SYSTEM RESTORATION; STRATEGY; DISPATCH;
D O I
10.1109/TNNLS.2021.3136554
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transmission network self-healing considering uncertain wind power becomes crucial with increasing penetration of wind power. A hybrid reinforcement learning (HRL) method combining offline self-learning with online Monte Carlo tree search (MCTS) is designed to deal with the strong uncertainty induced by wind power restoration. The HRL method trains a policy network with offline self-learning based on historical wind and transmission system data. It then applies the policy network to guide MCTS to realize step-by-step transmission network self-healing based on real-time and forecast data in different wind power scenarios. Besides, a model predictive control method for active power dispatch is proposed to improve wind power generation credibility during self-healing. Simulation results of both test and real-life power systems demonstrate that the proposed method can realize online transmission system self-healing reliably. Comparisons among different reinforcement learning methods indicate that the number of scenarios dominated by HRL is more than twice that dominated by MCTS and a dozen times that dominated by deep Q-network. Meanwhile, the online method is more flexible in uncertain wind power scenarios than optimization methods.
引用
收藏
页码:6405 / 6415
页数:11
相关论文
共 34 条
[1]   Frequency response of prime movers during restoration [J].
Adibi, MM ;
Borkoski, JN ;
Kafka, RJ ;
Volkmann, TL .
IEEE TRANSACTIONS ON POWER SYSTEMS, 1999, 14 (02) :751-756
[2]   A FRAMEWORK FOR POWER-SYSTEM RESTORATION FOLLOWING A MAJOR POWER FAILURE [J].
ANCONA, JJ .
IEEE TRANSACTIONS ON POWER SYSTEMS, 1995, 10 (03) :1480-1485
[3]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[4]   Development of a Black Start Decision Supporting System for Isolated Power Systems [J].
Chou, Yi-Ting ;
Liu, Chih-Wen ;
Wang, Yi-Jen ;
Wu, Chin-Chung ;
Lin, Chao-Chi .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (03) :2202-2210
[5]   Current methods and advances in forecasting of wind power generation [J].
Foley, Aoife M. ;
Leahy, Paul G. ;
Marvuglia, Antonino ;
McKeogh, Eamon J. .
RENEWABLE ENERGY, 2012, 37 (01) :1-8
[6]   Model-Driven Architecture of Extreme Learning Machine to Extract Power Flow Features [J].
Gao, Qian ;
Yang, Zhifang ;
Yu, Juan ;
Dai, Wei ;
Lei, Xingyu ;
Tang, Bo ;
Xie, Kaigui ;
Li, Wenyuan .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) :4680-4690
[7]  
Hinton Geoff., 1994, ADV NEURAL INFORM PR, V6
[8]   Optimal and Autonomous Control Using Reinforcement Learning: A Survey [J].
Kiumarsi, Bahare ;
Vamvoudakis, Kyriakos G. ;
Modares, Hamidreza ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) :2042-2062
[9]   Bandit based Monte-Carlo planning [J].
Kocsis, Levente ;
Szepesvari, Csaba .
MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 :282-293
[10]   Parallel bisecting k-means with prediction clustering algorithm [J].
Li, Yanjun ;
Chung, Soon M. .
JOURNAL OF SUPERCOMPUTING, 2007, 39 (01) :19-37