Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach

被引:0
|
作者
Qinglai Wei
Derong Liu
Yancai Xu
机构
[1] Chinese Academy of Sciences,The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation
来源
Soft Computing | 2016年 / 20卷
关键词
Adaptive dynamic programming; Approximate dynamic programming; Adaptive critic designs; Optimal control; Neural networks; Nonlinear systems; Reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a novel value iteration adaptive dynamic programming (ADP) algorithm, called “generalized value iteration ADP” algorithm, is developed to solve infinite horizon optimal tracking control problems for a class of discrete-time nonlinear systems. The developed generalized value iteration ADP algorithm permits an arbitrary positive semi-definite function to initialize it, which overcomes the disadvantage of traditional value iteration algorithms. Convergence property is developed to guarantee that the iterative performance index function will converge to the optimum. Neural networks are used to approximate the iterative performance index function and compute the iterative control policy, respectively, to implement the iterative ADP algorithm. Finally, a simulation example is given to illustrate the performance of the developed algorithm.
引用
收藏
页码:697 / 706
页数:9
相关论文
共 50 条
  • [31] An iterative adaptive dynamic programming algorithm for optimal control of unknown discrete-time nonlinear systems with constrained inputs
    Liu, Derong
    Wang, Ding
    Yang, Xiong
    INFORMATION SCIENCES, 2013, 220 : 331 - 342
  • [32] Discrete-time Optimal Zero-sum Games for Nonlinear Systems via Adaptive Dynamic Programming
    Wei, Qinglai
    Song, Ruizhuo
    Xu, Yancai
    Liu, Derong
    Lin, Qiao
    2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 357 - 364
  • [33] Finite Horizon Optimal Tracking Control for a Class of Discrete-Time Nonlinear Systems
    Wei, Qinglai
    Wang, Ding
    Liu, Derong
    ADVANCES IN NEURAL NETWORKS - ISNN 2011, PT II, 2011, 6676 : 620 - 629
  • [34] Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems
    Zhu, Guangyu
    Li, Xiaolu
    Sun, Ranran
    Yang, Yiyuan
    Zhang, Peng
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2023, 10 (03) : 781 - 791
  • [35] Policy Iteration for Optimal Control of Discrete-Time Time-Varying Nonlinear Systems
    Guangyu Zhu
    Xiaolu Li
    Ranran Sun
    Yiyuan Yang
    Peng Zhang
    IEEE/CAA Journal of Automatica Sinica, 2023, 10 (03) : 781 - 791
  • [36] Approximate Optimal tracking Control for Nonlinear Discrete-time Switched Systems via Approximate Dynamic Programming
    Qin, Chunbin
    Huang, Yizhe
    Yang, Yabin
    Zhang, Jishi
    Liu, Xianxing
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1456 - 1461
  • [37] Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control
    Zhu, Yuanheng
    Zhao, Dongbin
    He, Haibo
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (11): : 3959 - 3971
  • [38] Output feedback tracking control of a class of continuous-time nonlinear systems via adaptive dynamic programming approach
    Yang, Yang
    Xu, Chuang
    Yue, Dong
    Xie, Xiangpeng
    INFORMATION SCIENCES, 2018, 469 : 1 - 13
  • [39] Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems
    Luo, Biao
    Liu, Derong
    Huang, Tingwen
    Yang, Xiong
    Ma, Hongwen
    INFORMATION SCIENCES, 2017, 411 : 66 - 83
  • [40] Parallel Cross Entropy Policy Gradient Adaptive Dynamic Programming for Optimal Tracking Control of Discrete-Time Nonlinear Systems
    Xu, Jiahui
    Wang, Jingcheng
    Rao, Jun
    Zhong, Yanjiu
    Wu, Shunyu
    Sun, Qifang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (06): : 3809 - 3821