Reinforcement Learning-Aided Performance-Driven Fault-Tolerant Control of Feedback Control Systems

被引：22

作者：

Hua, Changsheng ^{[1
]}

Li, Linlin ^{[2
]}

Ding, Steven X. ^{[1
]}

机构：

[1] Univ Duisburg Essen, Inst Automat Control & Complex Syst AKS, D-33415 Verl, Germany

[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Minist Educ, Key Lab Knowledge Automat Ind Proc, Beijing 100083, Peoples R China

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2022年 / 67卷 / 06期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Degradation; System performance; Fault tolerant systems; Fault tolerance; Trajectory; Stochastic processes; Estimation; Data-driven; fault-tolerant control (FTC); performance degradation recovery; reinforcement learning (RL); DESIGN;

D O I：

10.1109/TAC.2021.3088397

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This article is concerned with a fault-tolerant control (FTC) scheme for feedback control systems with multiplicative faults by optimizing system performance with the aid of a reinforcement learning (RL) approach. To be specific, initially, based on the Youla-Kucera (YK) and dual YK parameterizations, a new performance-driven FTC method is proposed and its capability in dealing with multiplicative faults is proven. Then, data-driven implementation of this method using RL is elaborated. This implementation shows that RL can be applied efficiently by utilizing both plant model and data to recover the fault-induced system performance degradation. Finally, a benchmark study on an inverted pendulum system demonstrates the application of the proposed performance-driven FTC method.

引用

页码：3013 / 3020

页数：8

共 24 条

[1] Natural gradient works efficiently in learning [J].

Amari, S .

NEURAL COMPUTATION, 1998, 10 (02) :251-276

[2] From Youla-Kucera to identification, adaptive and nonlinear control [J].

Anderson, BDO .

AUTOMATICA, 1998, 34 (12) :1485-1506

[3]

[Anonymous], 1999, SYSTEM IDENTIFICATIO, DOI DOI 10.1002/047134608X.W1046

[4] Plug-and-Play Control-Modifying Control Systems Online [J].

Bendtsen, Jan ;

Trangbaek, Klaus ;

Stoustrup, Jakob .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2013, 21 (01) :79-93

[5]

Blanke M., 2016, Diagnosis and fault-tolerant control, DOI DOI 10.1007/978-3-540-35653-0

[6]

Ding SX, 2013, ADV IND CONTROL, P3, DOI 10.1007/978-1-4471-4799-2_1

[7]

Ding SX, 2014, ADV IND CONTROL, P1, DOI 10.1007/978-1-4471-6410-4

[8] A New Method for Fault Tolerant Control through Q-Learning [J].

Hua, Changsheng ;

Ding, Steven X. ;

Shardt, Yuri A. W. .

IFAC PAPERSONLINE, 2018, 51 (24) :38-45

[9]

Kakade S, 2002, ADV NEUR IN, V14, P1531

[10] Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control [J].

Lewis, Frank L. ;

Vrabie, Draguna .

IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2009, 9 (03) :32-50

← 1 2 3 →