UKF-Based Optimal Tracking Control for Uncertain Dynamic Systems With Asymmetric Input Constraints

被引:0
作者
Liu, Ning [1 ,2 ]
Zhang, Kun [3 ]
Xie, Xiangpeng [4 ]
Yue, Dong [5 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Jiangsu, Peoples R China
[2] Nanjing Univ Posts & Telecommun, Coll Artificial Intelligence, Nanjing 210023, Jiangsu, Peoples R China
[3] Beihang Univ, Sch Astronaut, Beijing 100190, Peoples R China
[4] Nanjing Univ Posts & Telecommun, Sch Internet Things, Nanjing 210023, Peoples R China
[5] Nanjing Univ Posts & Telecommun, Inst Adv Technol, Nanjing 210023, Peoples R China
基金
中国国家自然科学基金;
关键词
Asymmetric constraint; finite-horizon tracking control; neural networks; reinforcement learning; unscented Kalman filter (UKF); TIME NONLINEAR-SYSTEMS; CONTROL SCHEME; HJB SOLUTION; NETWORK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To enhance system robustness in the face of uncertainty and achieve adaptive optimization of control strategies, a novel algorithm based on the unscented Kalman filter (UKF) is developed. This algorithm addresses the finite-horizon optimal tracking control problem (FHOTCP) for nonlinear discrete-time (DT) systems with uncertainty and asymmetric input constraints. An augmented system is constructed with asymmetric control constraints being considered. The augmented problem is addressed with a DT Hamilton-Jacobi-Bellman equation (DTHJBE). By analyzing convergence with regard to the cost function and control law, the UKF-based iterative adaptive dynamic programming (ADP) algorithm is proposed. This algorithm approximates the solution of the DTHJBE, ensuring that the cost function converges to its optimal value within a bounded range. To execute the UKF-based iterative ADP algorithm, the actor-estimator-critic framework is built, in which the estimator refers to system state estimation through the application of UKF. Ultimately, simulation examples are presented to show the performance of the proposed method.
引用
收藏
页码:7224 / 7235
页数:12
相关论文
共 46 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]   Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].
Al-Tamimi, Asma ;
Lewis, Frank .
2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, :38-+
[3]   Static Output-Feedback ${H}_{∞}$ Control Design Procedures for Continuous-Time Systems With Different Levels of Model Knowledge [J].
Arogeti, Shai A. ;
Lewis, Frank L. .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (03) :1432-1446
[4]   DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&
[5]   Heuristic Dynamic Programming Using Echo State Network For Multivariable Tracking Control Of Wastewater Treatment Process [J].
Bo Ying-Chun ;
Qiao Jun-Fei .
ASIAN JOURNAL OF CONTROL, 2015, 17 (05) :1654-1666
[6]   ADAPTIVE GUARANTEED COST CONTROL OF SYSTEMS WITH UNCERTAIN PARAMETERS [J].
CHANG, SSL ;
PENG, TKC .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1972, AC17 (04) :474-&
[7]   Event-Triggered Optimal Control for Temperature Field of Roller Kiln Based on Adaptive Dynamic Programming [J].
Chen, Ning ;
Li, Binyan ;
Luo, Biao ;
Gui, Weihua ;
Yang, Chunhua .
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) :2805-2817
[8]   Data-Based Nonaffine Optimal Tracking Control Using Iterative DHP Approach [J].
Ha, Mingming ;
Wang, Ding ;
Liu, Derong .
IFAC PAPERSONLINE, 2020, 53 (02) :4246-4251
[9]   Online policy iteration ADP-based attitude-tracking control for hypersonic vehicles [J].
Han, Xiao ;
Zheng, Zongzhun ;
Liu, Lei ;
Wang, Bo ;
Cheng, Zhongtao ;
Fan, Huijin ;
Wang, Yongji .
AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 106
[10]   Model-Free H∞ Optimal Tracking Control of Constrained Nonlinear Systems via an Iterative Adaptive Learning Algorithm [J].
Hou, Jiaxu ;
Wang, Ding ;
Liu, Derong ;
Zhang, Yun .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11) :4097-4108