Hybrid Reinforcement Learning for Power Transmission Network Self-Healing Considering Wind Power

被引:9
作者
Sun, Runjia [1 ]
Liu, Yutian [1 ]
机构
[1] Shandong Univ, Minist Educ, Key Lab Power Syst Intelligent Dispatch & Control, Jinan 250061, Peoples R China
关键词
Wind power generation; Generators; Wind farms; Power systems; Voltage; Training; Reliability; Monte Carlo tree search (MCTS); power system restoration; reinforcement learning (RL); self-healing; MODEL-PREDICTIVE CONTROL; SYSTEM RESTORATION; STRATEGY; DISPATCH;
D O I
10.1109/TNNLS.2021.3136554
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transmission network self-healing considering uncertain wind power becomes crucial with increasing penetration of wind power. A hybrid reinforcement learning (HRL) method combining offline self-learning with online Monte Carlo tree search (MCTS) is designed to deal with the strong uncertainty induced by wind power restoration. The HRL method trains a policy network with offline self-learning based on historical wind and transmission system data. It then applies the policy network to guide MCTS to realize step-by-step transmission network self-healing based on real-time and forecast data in different wind power scenarios. Besides, a model predictive control method for active power dispatch is proposed to improve wind power generation credibility during self-healing. Simulation results of both test and real-life power systems demonstrate that the proposed method can realize online transmission system self-healing reliably. Comparisons among different reinforcement learning methods indicate that the number of scenarios dominated by HRL is more than twice that dominated by MCTS and a dozen times that dominated by deep Q-network. Meanwhile, the online method is more flexible in uncertain wind power scenarios than optimization methods.
引用
收藏
页码:6405 / 6415
页数:11
相关论文
共 34 条
  • [1] Frequency response of prime movers during restoration
    Adibi, MM
    Borkoski, JN
    Kafka, RJ
    Volkmann, TL
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 1999, 14 (02) : 751 - 756
  • [2] A FRAMEWORK FOR POWER-SYSTEM RESTORATION FOLLOWING A MAJOR POWER FAILURE
    ANCONA, JJ
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 1995, 10 (03) : 1480 - 1485
  • [3] A Survey of Monte Carlo Tree Search Methods
    Browne, Cameron B.
    Powley, Edward
    Whitehouse, Daniel
    Lucas, Simon M.
    Cowling, Peter I.
    Rohlfshagen, Philipp
    Tavener, Stephen
    Perez, Diego
    Samothrakis, Spyridon
    Colton, Simon
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) : 1 - 43
  • [4] Development of a Black Start Decision Supporting System for Isolated Power Systems
    Chou, Yi-Ting
    Liu, Chih-Wen
    Wang, Yi-Jen
    Wu, Chin-Chung
    Lin, Chao-Chi
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2013, 28 (03) : 2202 - 2210
  • [5] Current methods and advances in forecasting of wind power generation
    Foley, Aoife M.
    Leahy, Paul G.
    Marvuglia, Antonino
    McKeogh, Eamon J.
    [J]. RENEWABLE ENERGY, 2012, 37 (01) : 1 - 8
  • [6] Model-Driven Architecture of Extreme Learning Machine to Extract Power Flow Features
    Gao, Qian
    Yang, Zhifang
    Yu, Juan
    Dai, Wei
    Lei, Xingyu
    Tang, Bo
    Xie, Kaigui
    Li, Wenyuan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (10) : 4680 - 4690
  • [7] Hinton G. E., 1993, Advances in Neural Information Processing Systems, P3, DOI [DOI 10.5555/2987189.2987190, 10.5555/2987189.2987190]
  • [8] Optimal and Autonomous Control Using Reinforcement Learning: A Survey
    Kiumarsi, Bahare
    Vamvoudakis, Kyriakos G.
    Modares, Hamidreza
    Lewis, Frank L.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2042 - 2062
  • [9] Bandit based Monte-Carlo planning
    Kocsis, Levente
    Szepesvari, Csaba
    [J]. MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 : 282 - 293
  • [10] Parallel bisecting k-means with prediction clustering algorithm
    Li, Yanjun
    Chung, Soon M.
    [J]. JOURNAL OF SUPERCOMPUTING, 2007, 39 (01) : 19 - 37