Brain-Inspired Deep Meta-Reinforcement Learning for Active Coordinated Fault-Tolerant Load Frequency Control of Multi-Area Grids

被引：23

作者：

Li, Jiawen ^{[1
,2
]}

Zhou, Tao ^{[1
]}

Cui, Haoyang ^{[1
]}

机构：

[1] Shanghai Univ Elect Power, Sch Elect & Informat Engn, Shanghai 201306, Peoples R China

[2] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Dept Elect Engn, Shanghai 200240, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年 / 21卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Frequency control; Fault tolerant systems; Fault tolerance; Fluctuations; Robust control; Power systems; Actuators; Load frequency control; fault tolerance control; performance-based frequency regulation; meta-deterministic policy gradient algorithm; brain-inspired;

D O I：

10.1109/TASE.2023.3263005

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes an active coordinated fault tolerance load frequency control (AFCT-LFC) method, which effectively prevents sudden frequency changes caused by unit actuator failures or unplanned decommissioning in a multi-area interconnected grid subject to the performance-based frequency regulation market mechanism. It can also reduce regulation mileage payments and achieve multi-objective active fault-tolerant control. In addition, this paper proposes a brain-Inspired deep meta-deterministic policy gradient algorithm (BIMA-DMDPG), which adopts multi-agent centralized training, equates the controller of each area as an agent capable of independent decision making, and implements distributed training by dividing the environment into multiple environments. In addition, meta-reinforcement learning is employed to realize multi-task collaborative learning. The optimal policy is actively selected under different fault conditions to achieve active fault-tolerant control. The superior performance of the method is verified in a four-area LFC model of the China Southern Grid (CSG), in which it is tested alongside a selection of existing algorithms. Note to Practitioners-AFCT-LFC is based on advanced artificial intelligence algorithm, which can effectively identify any fault in multi-area grids and make rapid response to achieve active fault-tolerant control. Compared with the existing model-based fault-tolerant control methods, the BIMA-DMDPG algorithm proposed in this paper does not need to rely on accurate mathematical models, and can be applied to practice through simple training, which is very suitable for practical applications. Therefore, AFCT-LFC is an advanced adaptive active fault-tolerant control method that can be truly applied in practice because of its fast-decision-making ability and performance.

引用

页码：2518 / 2530

页数：13

共 28 条

[21] Neural RRT*: Learning-Based Optimal Path Planning
Wang, Jiankun
Chi, Wenzheng
Li, Chenming
Wang, Chaoqun
Meng, Max Q. -H.
[J]. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2020, 17 (04) : 1748 - 1758
[22] Smart generation control based on multi-agent reinforcement learning with the idea of the time tunnel
Xi, Lei
Chen, Jianfeng
Huang, Yuehua
Xu, Yanchun
Liu, Lang
Zhou, Yimin
Li, Yudan
[J]. ENERGY, 2018, 153 : 977 - 987
[23] A wolf pack hunting strategy based virtual tribes control for automatic generation control of smart grid
Xi, Lei
Yu, Tao
Yang, Bo
Zhang, Xiaoshun
Qiu, Xuanyu
[J]. APPLIED ENERGY, 2016, 178 : 198 - 211
[24] Yang C, 2018, CHIN CONTR CONF, P8946, DOI 10.23919/ChiCC.2018.8483467
[25] A Penalty Scheme for Mitigating Uninstructed Deviation of Generation Outputs From Variable Renewables in a Distribution Market
Yang, Jiajia
Dong, Zhao Yang
Wen, Fushuan
Chen, Qixin
Luo, Fengji
Liu, Weijia
Zhan, Junpeng
[J]. IEEE TRANSACTIONS ON SMART GRID, 2020, 11 (05) : 4056 - 4069
[26] Stochastic Optimal Relaxed Automatic Generation Control in Non-Markov Environment Based on Multi-Step Q(λ) Learning
Yu, Tao
Zhou, Bin
Chan, Ka Wing
Chen, Liang
Yang, Bo
[J]. IEEE TRANSACTIONS ON POWER SYSTEMS, 2011, 26 (03) : 1272 - 1282
[27] Accelerating bio-inspired optimizer with transfer reinforcement learning for reactive power optimization
Zhang, Xiaoshun
Yu, Tao
Yang, Bo
Cheng, Lefeng
[J]. KNOWLEDGE-BASED SYSTEMS, 2017, 116 : 26 - 38
[28] Adaptive decentralized load frequency control of multi-area power systems
Zribi, A
Al-Rashed, A
Alrifai, A
[J]. INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2005, 27 (08) : 575 - 583

← 1 2 3 →