Optimized dead-zone inverse control using reinforcement learning and sliding-mode mechanism fora class of high-order nonlinear systems

被引:0
作者
Ma, Shuaihua [1 ]
Sun, Wenxia [1 ]
Wen, Guoxing [2 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Sch Math & Stat, Jinan 250353, Shandong, Peoples R China
[2] Shandong Univ Aeronaut, Coll Sci, Binzhou 256600, Shandong, Peoples R China
关键词
Optimal control; Dead-zone inverse; Reinforcement learning (RL); Sliding-mode mechanism; High-order nonlinear system; ROBUST ADAPTIVE-CONTROL; TRACKING CONTROL;
D O I
10.1016/j.ejcon.2024.101132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An optimized control method is developed fora class of high-order nonlinear dynamic systems having controller dead-zone phenomenon. Dead-zone refers to the controller with zero behavior within a certain range, so it will inevitably affect system performance. In order to make the optimized control eliminate the effect of dead zone, the adaptive dead-zone inverse and reinforcement learning (RL) techniques are combined. The main idea is to find the desired optimized control using RL as the input of dead-zone inverse function and then to design the adaptive algorithm to estimate the unknown parameters of dead-zone inverse function, so that the competent system control can be yielded from the dead-zone function. However, most existing RL algorithms are difficult to apply in the dead zone inverse methods because of the algorithm complexity. The proposed RL greatly simplifies the algorithm because it derives the training rules from the negative gradient of a simple positive function yielded by the partial derivative of Hamilton-Jacobi-Bellman (HJB) equation. Meanwhile, the proposed dead-zone inverse method also requires fewer adaptive parameters. Finally, the proposed control is attested through theoretical proofs and simulation examples.
引用
收藏
页数:8
相关论文
共 40 条
[11]   Adaptive tracking of nonlinear systems with non-symmetric dead-zone input [J].
Ibrir, Salim ;
Xie, Wen Fang ;
Su, Chun-Yi .
AUTOMATICA, 2007, 43 (03) :522-530
[12]   Adaptive fault-tolerant optimized formation control for perturbed nonlinear multiagent systems [J].
Ji, Weiyu ;
Pan, Yingnan ;
Zhao, Meng .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (06) :3386-3407
[13]   Relaxed Conditions for Parameterized Linear Matrix Inequality in the Form of Double Fuzzy Summation [J].
Kim, Do Wan ;
Lee, Donghwan .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2024, 32 (03) :1608-1612
[14]   Observer-based adaptive fuzzy-neural control for a class of uncertain nonlinear systems with unknown dead-zone input [J].
Liu, Yan-Jun ;
Zhou, Ning .
ISA TRANSACTIONS, 2010, 49 (04) :462-469
[15]   Robust output feedback tracking control for a class of high-order time-delay nonlinear systems with input dead-zone and disturbances [J].
Liu, Zhenguo ;
Xue, Lingrong ;
Sun, Wei ;
Sun, Zongyao .
NONLINEAR DYNAMICS, 2019, 97 (02) :921-935
[16]   Nonlinear Filters-Based Adaptive Fuzzy Control of Strict-Feedback Nonlinear Systems With Unknown Asymmetric Dead-Zone Output [J].
Ma, Zhiyao ;
Tong, Shaocheng .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 21 (04) :5099-5109
[17]   Optimal tracking control of nonlinear partially-unknown constrained-input systems using integral reinforcement learning [J].
Modares, Hamidreza ;
Lewis, Frank L. .
AUTOMATICA, 2014, 50 (07) :1780-1792
[18]  
Pal B. B., 1995, Optimization, V35, P145, DOI 10.1080/02331939508844136
[19]   Adaptive output-feedback control for a class of nonlinear systems based on optimized backstepping technique [J].
Shen, Fei ;
Wang, Xinjun ;
Li, Haotian ;
Yin, Xinghui .
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2022, 36 (05) :1077-1097
[20]   Observer-based adaptive dynamic surface control of time-delayed non-strict systems with input nonlinearities [J].
Shojaei, Fatemeh ;
Zarei, Jafar ;
Saif, Mehrdad .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 360 (09) :6490-6514