An Online Fault Tolerant Actor-critic Neuro-control for a Class of Nonlinear Systems using Neural Network HJB Approach

被引：19

作者：

Chang, Seung Jin ^{[1
]}

Lee, Jae Young ^{[1
]}

Park, Jin Bae ^{[1
]}

Choi, Yoon Ho ^{[2
]}

机构：

[1] Yonsei Univ, Dept Elect & Elect Engn, Seoul 120749, South Korea

[2] Kyonggi Univ, Dept Elect Engn, Suwon 443760, Kyonggi Do, South Korea

来源：

INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS | 2015年 / 13卷 / 02期

关键词：

Adaptive fault diagnosis observer (AFDO); critic neural network; fault tolerant actor-critic neuro-control scheme; fault tolerant control (FTC); Lyapunov analysis; SWITCHING PARAMETERS; TIME; STABILITY;

D O I：

10.1007/s12555-014-0034-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose an actor-critic neuro-control for a class of continuous-time nonlinear systems under nonlinear abrupt faults, which is combined with an adaptive fault diagnosis observer (AFDO). Together with its estimation laws, an AFDO scheme, which estimates the faults in real time, is designed based on Lyapunov analysis. Then, based on the designed AFDO, a fault tolerant actor-critic control scheme is proposed where the critic neural network (NN) is used to approximate the value function and the actor NN updates the fault tolerant policy based on the approximated value function in the critic NN. The weight update laws for critic NN and actor NN are designed using the gradient descent method. By Lyapunov analysis, we prove the uniform ultimately boundedness (UUB) of all the states, their estimation errors, and NN weights of the fault tolerant system under the unpredictable faults. Finally, we verify the effectiveness of the proposed method through numerical simulations.

引用

页码：311 / 318

页数：8

共 30 条

[1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].

Al-Tamimi, Asma ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949

[2]

Bellman RE., 1957, Dynamic Programming

[3]

Bhasin S., 2010, P IEEE C DEC CONTR

[4]

Campos J., 1999, P IEEE AM CONTR C, V4

[5]

Dobre C, 2014, INT J INNOV COMPUT I, V10, P417

[6] Reinforcement learning in continuous time and space [J].

Doya, K .

NEURAL COMPUTATION, 2000, 12 (01) :219-245

[7]

Dreyfus SE, 1977, ART THEORY DYNAMIC P

[8] Sensor Fault Estimation and Compensation for Microsatellite Attitude Control Systems [J].

Gao, Zhi-Feng ;

Jiang, Bin ;

Shi, Peng ;

Cheng, Yue-Hua .

INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2010, 8 (02) :228-237

[9] An adaptive technique for robust diagnosis of faults with independent effects on system outputs [J].

Jiang, B ;

Wang, JL ;

Soh, YC .

INTERNATIONAL JOURNAL OF CONTROL, 2002, 75 (11) :792-802

[10]

Lewis F.L., 1986, OPTIMAL CONTROL

← 1 2 3 →