Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control

被引：118

作者：

Ha, Mingming ^{[1
]}

Wang, Ding ^{[2
,3
]}

Liu, Derong ^{[4
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

[2] Beijing Univ Technol, Fac Informat Technol, Beijing Lab Smart Environm Protect, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China

[3] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China

[4] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA

来源：

IEEE-CAA JOURNAL OF AUTOMATICA SINICA | 2022年 / 9卷 / 07期

基金：

中国国家自然科学基金; 北京市自然科学基金;

关键词：

Adaptive critic design; adaptive dynamic programming (ADP); approximate dynamic programming; discrete-time nonlinear systems; reinforcement learning; stability analysis; tracking control; value iteration (VI); TIME NONLINEAR-SYSTEMS; LINEAR-SYSTEMS;

D O I：

10.1109/JAS.2022.105692

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The core task of tracking control is to make the controlled plant track a desired trajectory. The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of time steps increases. In this paper, a new cost function is introduced to develop the value-iteration-based adaptive critic framework to solve the tracking control problem. Unlike the regulator problem, the iterative value function of tracking control problem cannot be regarded as a Lyapunov function. A novel stability analysis method is developed to guarantee that the tracking error converges to zero. The discounted iterative scheme under the new cost function for the special case of linear systems is elaborated. Finally, the tracking performance of the present scheme is demonstrated by numerical results and compared with those of the traditional approaches.

引用

页码：1262 / 1272

页数：11

共 47 条

[1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].

Al-Tamimi, Asma ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949

[2] Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming [J].

Bertsekas, Dimitri P. .

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) :500-509

[3] Practical tracking control of perturbed uncertain nonaffine systems with full state constraints [J].

Cao, Ye ;

Song, Yongduan ;

Wen, Changyun .

AUTOMATICA, 2019, 110

[4] Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics [J].

Chen, Ci ;

Modares, Hamidreza ;

Xie, Kan ;

Lewis, Frank L. ;

Wan, Yan ;

Xie, Shengli .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) :4423-4438

[5] Data-Based Nonaffine Optimal Tracking Control Using Iterative DHP Approach [J].

Ha, Mingming ;

Wang, Ding ;

Liu, Derong .

IFAC PAPERSONLINE, 2020, 53 (02) :4246-4251

[6] Generalized value iteration for discounted optimal control with stability analysis [J].

Ha, Mingming ;

Wang, Ding ;

Liu, Derong .

SYSTEMS & CONTROL LETTERS, 2021, 147 (147)

[7]

Ha MM, 2020, CHIN CONTR CONF, P1951, DOI 10.23919/CCC50068.2020.9188706

[8] Event-Triggered Adaptive Critic Control Design for Discrete-Time Constrained Nonlinear Systems [J].

Ha, Mingming ;

Wang, Ding ;

Liu, Derong .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09) :3158-3168

[9] Event-triggered constrained control with DHP implementation for nonaffine discrete-time systems [J].

Ha, Mingming ;

Wang, Ding ;

Liu, Derong .

INFORMATION SCIENCES, 2020, 519 :110-123

[10] Online policy iteration ADP-based attitude-tracking control for hypersonic vehicles [J].

Han, Xiao ;

Zheng, Zongzhun ;

Liu, Lei ;

Wang, Bo ;

Cheng, Zhongtao ;

Fan, Huijin ;

Wang, Yongji .

AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 106

← 1 2 3 4 5 →