Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control

被引:106
作者
Ha, Mingming [1 ]
Wang, Ding [2 ,3 ]
Liu, Derong [4 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[2] Beijing Univ Technol, Fac Informat Technol, Beijing Lab Smart Environm Protect, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[3] Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[4] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Adaptive critic design; adaptive dynamic programming (ADP); approximate dynamic programming; discrete-time nonlinear systems; reinforcement learning; stability analysis; tracking control; value iteration (VI); TIME NONLINEAR-SYSTEMS; LINEAR-SYSTEMS;
D O I
10.1109/JAS.2022.105692
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The core task of tracking control is to make the controlled plant track a desired trajectory. The traditional performance index used in previous studies cannot eliminate completely the tracking error as the number of time steps increases. In this paper, a new cost function is introduced to develop the value-iteration-based adaptive critic framework to solve the tracking control problem. Unlike the regulator problem, the iterative value function of tracking control problem cannot be regarded as a Lyapunov function. A novel stability analysis method is developed to guarantee that the tracking error converges to zero. The discounted iterative scheme under the new cost function for the special case of linear systems is elaborated. Finally, the tracking performance of the present scheme is demonstrated by numerical results and compared with those of the traditional approaches.
引用
收藏
页码:1262 / 1272
页数:11
相关论文
共 47 条
  • [1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    Al-Tamimi, Asma
    Lewis, Frank L.
    Abu-Khalaf, Murad
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 943 - 949
  • [2] Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming
    Bertsekas, Dimitri P.
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 500 - 509
  • [3] Practical tracking control of perturbed uncertain nonaffine systems with full state constraints
    Cao, Ye
    Song, Yongduan
    Wen, Changyun
    [J]. AUTOMATICA, 2019, 110
  • [4] Reinforcement Learning-Based Adaptive Optimal Exponential Tracking Control of Linear Systems With Unknown Dynamics
    Chen, Ci
    Modares, Hamidreza
    Xie, Kan
    Lewis, Frank L.
    Wan, Yan
    Xie, Shengli
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2019, 64 (11) : 4423 - 4438
  • [5] Data-Based Nonaffine Optimal Tracking Control Using Iterative DHP Approach
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 4246 - 4251
  • [6] Generalized value iteration for discounted optimal control with stability analysis
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. SYSTEMS & CONTROL LETTERS, 2021, 147 (147)
  • [7] Ha MM, 2020, CHIN CONTR CONF, P1951, DOI 10.23919/CCC50068.2020.9188706
  • [8] Event-Triggered Adaptive Critic Control Design for Discrete-Time Constrained Nonlinear Systems
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09): : 3158 - 3168
  • [9] Event-triggered constrained control with DHP implementation for nonaffine discrete-time systems
    Ha, Mingming
    Wang, Ding
    Liu, Derong
    [J]. INFORMATION SCIENCES, 2020, 519 : 110 - 123
  • [10] Online policy iteration ADP-based attitude-tracking control for hypersonic vehicles
    Han, Xiao
    Zheng, Zongzhun
    Liu, Lei
    Wang, Bo
    Cheng, Zhongtao
    Fan, Huijin
    Wang, Yongji
    [J]. AEROSPACE SCIENCE AND TECHNOLOGY, 2020, 106