Brain-Inspired Deep Meta-Reinforcement Learning for Active Coordinated Fault-Tolerant Load Frequency Control of Multi-Area Grids

被引:23
作者
Li, Jiawen [1 ,2 ]
Zhou, Tao [1 ]
Cui, Haoyang [1 ]
机构
[1] Shanghai Univ Elect Power, Sch Elect & Informat Engn, Shanghai 201306, Peoples R China
[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Elect Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Frequency control; Fault tolerant systems; Fault tolerance; Fluctuations; Robust control; Power systems; Actuators; Load frequency control; fault tolerance control; performance-based frequency regulation; meta-deterministic policy gradient algorithm; brain-inspired;
D O I
10.1109/TASE.2023.3263005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes an active coordinated fault tolerance load frequency control (AFCT-LFC) method, which effectively prevents sudden frequency changes caused by unit actuator failures or unplanned decommissioning in a multi-area interconnected grid subject to the performance-based frequency regulation market mechanism. It can also reduce regulation mileage payments and achieve multi-objective active fault-tolerant control. In addition, this paper proposes a brain-Inspired deep meta-deterministic policy gradient algorithm (BIMA-DMDPG), which adopts multi-agent centralized training, equates the controller of each area as an agent capable of independent decision making, and implements distributed training by dividing the environment into multiple environments. In addition, meta-reinforcement learning is employed to realize multi-task collaborative learning. The optimal policy is actively selected under different fault conditions to achieve active fault-tolerant control. The superior performance of the method is verified in a four-area LFC model of the China Southern Grid (CSG), in which it is tested alongside a selection of existing algorithms. Note to Practitioners-AFCT-LFC is based on advanced artificial intelligence algorithm, which can effectively identify any fault in multi-area grids and make rapid response to achieve active fault-tolerant control. Compared with the existing model-based fault-tolerant control methods, the BIMA-DMDPG algorithm proposed in this paper does not need to rely on accurate mathematical models, and can be applied to practice through simple training, which is very suitable for practical applications. Therefore, AFCT-LFC is an advanced adaptive active fault-tolerant control method that can be truly applied in practice because of its fast-decision-making ability and performance.
引用
收藏
页码:2518 / 2530
页数:13
相关论文
共 28 条
  • [21] Neural RRT*: Learning-Based Optimal Path Planning
    Wang, Jiankun
    Chi, Wenzheng
    Li, Chenming
    Wang, Chaoqun
    Meng, Max Q. -H.
    [J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (04) : 1748 - 1758
  • [22] Smart generation control based on multi-agent reinforcement learning with the idea of the time tunnel
    Xi, Lei
    Chen, Jianfeng
    Huang, Yuehua
    Xu, Yanchun
    Liu, Lang
    Zhou, Yimin
    Li, Yudan
    [J]. ENERGY, 2018, 153 : 977 - 987
  • [23] A wolf pack hunting strategy based virtual tribes control for automatic generation control of smart grid
    Xi, Lei
    Yu, Tao
    Yang, Bo
    Zhang, Xiaoshun
    Qiu, Xuanyu
    [J]. APPLIED ENERGY, 2016, 178 : 198 - 211
  • [24] Yang C, 2018, CHIN CONTR CONF, P8946, DOI 10.23919/ChiCC.2018.8483467
  • [25] A Penalty Scheme for Mitigating Uninstructed Deviation of Generation Outputs From Variable Renewables in a Distribution Market
    Yang, Jiajia
    Dong, Zhao Yang
    Wen, Fushuan
    Chen, Qixin
    Luo, Fengji
    Liu, Weijia
    Zhan, Junpeng
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (05) : 4056 - 4069
  • [26] Stochastic Optimal Relaxed Automatic Generation Control in Non-Markov Environment Based on Multi-Step Q(λ) Learning
    Yu, Tao
    Zhou, Bin
    Chan, Ka Wing
    Chen, Liang
    Yang, Bo
    [J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2011, 26 (03) : 1272 - 1282
  • [27] Accelerating bio-inspired optimizer with transfer reinforcement learning for reactive power optimization
    Zhang, Xiaoshun
    Yu, Tao
    Yang, Bo
    Cheng, Lefeng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 116 : 26 - 38
  • [28] Adaptive decentralized load frequency control of multi-area power systems
    Zribi, A
    Al-Rashed, A
    Alrifai, A
    [J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2005, 27 (08) : 575 - 583