Fault-tolerant tracking control based on reinforcement learning with application to a steer-by-wire system

被引：8

作者：

Chen, Huan ^{[1
]}

Tu, Yidong ^{[1
]}

Wang, Hai ^{[2
]}

Shi, Kaibo ^{[3
]}

He, Shuping ^{[1
]}

机构：

[1] Anhui Univ, Sch Elect Engn & Automat, Anhui Engn Lab Human Robot Integrat Syst & Intell, Hefei, Peoples R China

[2] Murdoch Univ, Ctr Water Energy & Waste, Discipline Engn & Energy, Perth, WA 6150, Australia

[3] Chengdu Univ, Sch Elect Informat & Elect Engn, Chengdu 610106, Peoples R China

来源：

JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2022年 / 359卷 / 03期

基金：

中国国家自然科学基金;

关键词：

ACTUATOR FAULTS; TIME-SYSTEMS; DESIGN; STATE;

D O I：

10.1016/j.jfranklin.2021.12.012

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a novel complete model-free integral reinforcement learning (CMFIRL) algorithm based fault tolerant control scheme is proposed to solve the tracking problem of steer-by-wire (SBW) system. We begin with the recognition that the reference errors can eventually converge to zero based on the command generator model. Then an augmented tracking system is constructed with a corresponding performance index which is considered as a type of actuator failure. By using the reinforcement learning (RL) technique, three novel online update strategies are respectively developed to cope with the following three cases, i.e., model-based, partially model-free, and completely model-free. Especially, the RL algorithm for the complete model-free case eliminates the constraints of requiring the known system dynamics in fault-tolerant tracking controlling. The system stability and the convergence of the CMFIRL iteration algorithm are also rigorously proved. Finally, a simulation example is given to illustrate the effectiveness of the proposed approach. (C) 2021 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.

引用

页码：1152 / 1171

页数：20

共 34 条

[1] An Adjustable Steer-by-Wire Haptic-Interface Tracking Controller for Ground Vehicles
Baviskar, Abhijit
Wagner, John R.
Dawson, Darren M.
Braganza, David
Setlur, Pradeep
[J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2009, 58 (02) : 546 - 554
[2] Passive Actuators' Fault-Tolerant Control for Affine Nonlinear Systems
Benosman, M.
Lum, K. -Y.
[J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2010, 18 (01) : 152 - 163
[3] Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
Dierks, Travis
Thumati, Balaje T.
Jagannathan, S.
[J]. NEURAL NETWORKS, 2009, 22 (5-6) : 851 - 860
[4] Gadda C.D., 2004, P AVEC, V4, P779
[5] Fault estimation and fault-tolerant control for descriptor systems via proportional, multiple-integral and derivative observer design
Gao, Z.
Ding, S. X.
[J]. IET CONTROL THEORY AND APPLICATIONS, 2007, 1 (05) : 1208 - 1218
[6] Fault detection filter design for a class of nonlinear Markovian jumping systems with mode-dependent time-varying delays
He, Shuping
[J]. NONLINEAR DYNAMICS, 2018, 91 (03) : 1871 - 1884
[7] Fault Tolerant Sliding Mode Predictive Control for Uncertain Steer-by-Wire System
Huang, Chao
Naghdy, Fazel
Du, Haiping
[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (01) : 261 - 272
[8] Event-Based Adaptive Fixed-Time Fuzzy Control for Active Vehicle Suspension Systems With Time-Varying Displacement Constraint
Jia, Tinghan
Pan, Yingnan
Liang, Hongjing
Lam, Hak-Keung
[J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (08) : 2813 - 2821
[9] Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
Jiang, Yu
Jiang, Zhong-Ping
[J]. AUTOMATICA, 2012, 48 (10) : 2699 - 2704
[10] Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
Kiumarsi, Bahare
Lewis, Frank L.
Modares, Hamidreza
Karimpour, Ali
Naghibi-Sistani, Mohammad-Bagher
[J]. AUTOMATICA, 2014, 50 (04) : 1167 - 1175

← 1 2 3 4 →