Fault-tolerant tracking control based on reinforcement learning with application to a steer-by-wire system

被引:8
作者
Chen, Huan [1 ]
Tu, Yidong [1 ]
Wang, Hai [2 ]
Shi, Kaibo [3 ]
He, Shuping [1 ]
机构
[1] Anhui Univ, Sch Elect Engn & Automat, Anhui Engn Lab Human Robot Integrat Syst & Intell, Hefei, Peoples R China
[2] Murdoch Univ, Ctr Water Energy & Waste, Discipline Engn & Energy, Perth, WA 6150, Australia
[3] Chengdu Univ, Sch Elect Informat & Elect Engn, Chengdu 610106, Peoples R China
来源
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS | 2022年 / 359卷 / 03期
基金
中国国家自然科学基金;
关键词
ACTUATOR FAULTS; TIME-SYSTEMS; DESIGN; STATE;
D O I
10.1016/j.jfranklin.2021.12.012
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a novel complete model-free integral reinforcement learning (CMFIRL) algorithm based fault tolerant control scheme is proposed to solve the tracking problem of steer-by-wire (SBW) system. We begin with the recognition that the reference errors can eventually converge to zero based on the command generator model. Then an augmented tracking system is constructed with a corresponding performance index which is considered as a type of actuator failure. By using the reinforcement learning (RL) technique, three novel online update strategies are respectively developed to cope with the following three cases, i.e., model-based, partially model-free, and completely model-free. Especially, the RL algorithm for the complete model-free case eliminates the constraints of requiring the known system dynamics in fault-tolerant tracking controlling. The system stability and the convergence of the CMFIRL iteration algorithm are also rigorously proved. Finally, a simulation example is given to illustrate the effectiveness of the proposed approach. (C) 2021 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1152 / 1171
页数:20
相关论文
共 34 条
  • [1] An Adjustable Steer-by-Wire Haptic-Interface Tracking Controller for Ground Vehicles
    Baviskar, Abhijit
    Wagner, John R.
    Dawson, Darren M.
    Braganza, David
    Setlur, Pradeep
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2009, 58 (02) : 546 - 554
  • [2] Passive Actuators' Fault-Tolerant Control for Affine Nonlinear Systems
    Benosman, M.
    Lum, K. -Y.
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2010, 18 (01) : 152 - 163
  • [3] Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
    Dierks, Travis
    Thumati, Balaje T.
    Jagannathan, S.
    [J]. NEURAL NETWORKS, 2009, 22 (5-6) : 851 - 860
  • [4] Gadda C.D., 2004, P AVEC, V4, P779
  • [5] Fault estimation and fault-tolerant control for descriptor systems via proportional, multiple-integral and derivative observer design
    Gao, Z.
    Ding, S. X.
    [J]. IET CONTROL THEORY AND APPLICATIONS, 2007, 1 (05) : 1208 - 1218
  • [6] Fault detection filter design for a class of nonlinear Markovian jumping systems with mode-dependent time-varying delays
    He, Shuping
    [J]. NONLINEAR DYNAMICS, 2018, 91 (03) : 1871 - 1884
  • [7] Fault Tolerant Sliding Mode Predictive Control for Uncertain Steer-by-Wire System
    Huang, Chao
    Naghdy, Fazel
    Du, Haiping
    [J]. IEEE TRANSACTIONS ON CYBERNETICS, 2019, 49 (01) : 261 - 272
  • [8] Event-Based Adaptive Fixed-Time Fuzzy Control for Active Vehicle Suspension Systems With Time-Varying Displacement Constraint
    Jia, Tinghan
    Pan, Yingnan
    Liang, Hongjing
    Lam, Hak-Keung
    [J]. IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2022, 30 (08) : 2813 - 2821
  • [9] Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
    Jiang, Yu
    Jiang, Zhong-Ping
    [J]. AUTOMATICA, 2012, 48 (10) : 2699 - 2704
  • [10] Reinforcement Q-learning for optimal tracking control of linear discrete-time systems with unknown dynamics
    Kiumarsi, Bahare
    Lewis, Frank L.
    Modares, Hamidreza
    Karimpour, Ali
    Naghibi-Sistani, Mohammad-Bagher
    [J]. AUTOMATICA, 2014, 50 (04) : 1167 - 1175