Reinforcement Learning-Aided Performance-Driven Fault-Tolerant Control of Feedback Control Systems

被引:22
作者
Hua, Changsheng [1 ]
Li, Linlin [2 ]
Ding, Steven X. [1 ]
机构
[1] Univ Duisburg Essen, Inst Automat Control & Complex Syst AKS, D-33415 Verl, Germany
[2] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Minist Educ, Key Lab Knowledge Automat Ind Proc, Beijing 100083, Peoples R China
基金
中国国家自然科学基金; 北京市自然科学基金;
关键词
Degradation; System performance; Fault tolerant systems; Fault tolerance; Trajectory; Stochastic processes; Estimation; Data-driven; fault-tolerant control (FTC); performance degradation recovery; reinforcement learning (RL); DESIGN;
D O I
10.1109/TAC.2021.3088397
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article is concerned with a fault-tolerant control (FTC) scheme for feedback control systems with multiplicative faults by optimizing system performance with the aid of a reinforcement learning (RL) approach. To be specific, initially, based on the Youla-Kucera (YK) and dual YK parameterizations, a new performance-driven FTC method is proposed and its capability in dealing with multiplicative faults is proven. Then, data-driven implementation of this method using RL is elaborated. This implementation shows that RL can be applied efficiently by utilizing both plant model and data to recover the fault-induced system performance degradation. Finally, a benchmark study on an inverted pendulum system demonstrates the application of the proposed performance-driven FTC method.
引用
收藏
页码:3013 / 3020
页数:8
相关论文
共 24 条
[1]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[2]   From Youla-Kucera to identification, adaptive and nonlinear control [J].
Anderson, BDO .
AUTOMATICA, 1998, 34 (12) :1485-1506
[3]  
[Anonymous], 1999, SYSTEM IDENTIFICATIO, DOI DOI 10.1002/047134608X.W1046
[4]   Plug-and-Play Control-Modifying Control Systems Online [J].
Bendtsen, Jan ;
Trangbaek, Klaus ;
Stoustrup, Jakob .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2013, 21 (01) :79-93
[5]  
Blanke M., 2016, Diagnosis and fault-tolerant control, DOI DOI 10.1007/978-3-540-35653-0
[6]  
Ding SX, 2013, ADV IND CONTROL, P3, DOI 10.1007/978-1-4471-4799-2_1
[7]  
Ding SX, 2014, ADV IND CONTROL, P1, DOI 10.1007/978-1-4471-6410-4
[8]   A New Method for Fault Tolerant Control through Q-Learning [J].
Hua, Changsheng ;
Ding, Steven X. ;
Shardt, Yuri A. W. .
IFAC PAPERSONLINE, 2018, 51 (24) :38-45
[9]  
Kakade S, 2002, ADV NEUR IN, V14, P1531
[10]   Reinforcement Learning and Adaptive Dynamic Programming for Feedback Control [J].
Lewis, Frank L. ;
Vrabie, Draguna .
IEEE CIRCUITS AND SYSTEMS MAGAZINE, 2009, 9 (03) :32-50