Reinforcement Learning-Based Tracking Control for a Class of Discrete-Time Systems With Actuator Fault

被引:6
作者
Liu, Yingying [1 ,2 ]
Wang, Zhanshan [1 ,2 ]
机构
[1] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Liaoning, Peoples R China
[2] Northeastern Univ, Coll Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
Circuit faults; Mathematical models; Actuators; Fault detection; Reinforcement learning; Circuits and systems; Time measurement; Fault-tolerant tracking; reinforcement learning; actuator fault; detection mechanism; expanded time horizon; nested calculation;
D O I
10.1109/TCSII.2021.3131360
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Tracking control problem is a common problem and widely used in many fields. In this brief, the data-based tracking control with actuator fault is studied. To deal with the fault, many detection mechanisms and fault-tolerant tracking (FTT) controllers have been studied. However, the existing detection mechanisms cannot detect the systems with non-zero initial value. Furthermore, the existing FTT controllers require some designed parameters to obtain the fault. To solve above problems, a detection mechanism based on expanded time horizon and FTT controller based on reinforcement learning are proposed in this brief. The proposed detection mechanism is appropriate for arbitrary initial value, which consists of the initial value and past data. Besides, a nested calculation method is presented to obtain the fault information of FTT controller, which only uses the system data. Hence, the proposed FTT controller avoids the design of additional parameters for fault. Finally, the effectiveness of proposed methods is verified by simulation example.
引用
收藏
页码:2827 / 2831
页数:5
相关论文
共 26 条
[1]   A Novel Supertwisting Zeroing Neural Network With Application to Mobile Robot Manipulators [J].
Chen, Dechao ;
Li, Shuai ;
Wu, Qing .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (04) :1776-1787
[2]   Fixed-Time Prescribed Performance Adaptive Trajectory Tracking Control for a QUAV [J].
Cui, Guozeng ;
Yang, Wei ;
Yu, Jinpeng ;
Li, Ze ;
Tao, Chongben .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (02) :494-498
[3]   Fault-tolerant optimised tracking control for unknown discrete-time linear systems using a combined reinforcement learning and residual compensation methodology [J].
Han, Ke-Zhen ;
Feng, Jian ;
Cui, Xiaohong .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2017, 48 (13) :2811-2825
[4]   An integrated data-driven Markov parameters sequence identification and adaptive dynamic programming method to design fault-tolerant optimal tracking control for completely unknown model systems [J].
Han, Kezhen ;
Feng, Jian ;
Yao, Yu .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2017, 354 (13) :5280-5301
[5]   Fault diagnosis and fault-tolerant tracking control for discrete-time systems with faults and delays in actuator and measurement [J].
Han, Shi-Yuan ;
Chen, Yue-Hui ;
Tang, Gong-You .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2017, 354 (12) :4719-4738
[6]  
Inoue A, 2015, INT C ADV MECH SYST, P328, DOI 10.1109/ICAMechS.2015.7287083
[7]   Neural-network-based control scheme for a class of nonlinear systems with actuator faults via data-driven reinforcement learning method [J].
Jiang, He ;
Zhang, Huaguang ;
Liu, Yang ;
Han, Ji .
NEUROCOMPUTING, 2017, 239 :1-8
[8]   Obstacle avoidance and model-free tracking control for home automation using bio-inspired approach [J].
Khan A.T. ;
Li S. ;
Li Z. .
Advanced Control for Applications: Engineering and Industrial Systems, 2022, 4 (01)
[9]   Human guided cooperative robotic agents in smart home using beetle antennae search [J].
Khan, Ameer Tamoor ;
Li, Shuai ;
Cao, Xinwei .
SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (02)
[10]   Trajectory Optimization of 5-Link Biped Robot Using Beetle Antennae Search [J].
Khan, Ameer Tamoor ;
Li, Shuai ;
Zhou, Xuefeng .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2021, 68 (10) :3276-3280