Model-free Adaptive Dynamic Programming for Online optimal Solution of the Unknown Nonlinear Zero-Sum Differential Game

被引:0
|
作者
Qin, Chunbin [1 ]
Zhang, Huaguang [2 ]
Luo, Yanhong [2 ]
机构
[1] Henan Univ, Sch Comp & Informat Engn, Kaifeng 475004, Peoples R China
[2] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110004, Peoples R China
来源
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2014年
关键词
POLICY UPDATE ALGORITHM; STATE-FEEDBACK CONTROL; EQUATION; SYSTEMS; DESIGNS; SOLVE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It is well known that the two-player zero-sum differential game problem of the continuous-time nonlinear system relies on the solution of the Hamilton-Jacobi-Isaacs equation, which is a nonlinear partial differential equation that is difficult or impossible to solve. In this paper, a new model-free adaptive dynamic programming algorithm is developed for solving online the Hamilton-Jacobi-Isaacs equation for continuous-time nonlinear system with the fully unknown knowledge of the system dynamics. First, a simultaneous policy iteration algorithm will be given, which can solve the Hamilton-Jacobi-Isaacs equation in an off-line sense, in which the fully knowledge of the system dynamics is required. Second, based on the simultaneous policy iteration algorithm, a new model-free adaptive dynamic programming algorithm is developed for solving online the Hamilton-Jacobi-Isaacs equation, in which the fully knowledge of the system dynamics is not required. Finally, a numerical example is given to demonstrate the convergence and effectiveness of the proposed scheme.
引用
收藏
页码:3815 / 3820
页数:6
相关论文
共 50 条
  • [1] Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game
    Zhong, Xiangnan
    He, Haibo
    Wang, Ding
    Ni, Zhen
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) : 1633 - 1646
  • [2] Adaptive dynamic programming for online solution of a zero-sum differential game
    Vrabie D.
    Lewis F.
    Journal of Control Theory and Applications, 2011, 9 (03): : 353 - 360
  • [3] Adaptive dynamic programming for online solution of a zero-sum differential game
    Draguna VRABIE
    Frank LEWIS
    Journal of Control Theory and Applications, 2011, 9 (03) : 353 - 360
  • [4] Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data
    Zhu, Yuanheng
    Zhao, Dongbin
    Li, Xiangjun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 714 - 725
  • [5] Robust adaptive dynamic programming for a zero-sum differential game
    Yuan, Binbin
    Lu, Pingli
    Liu, Xiangdong
    Bian, Tao
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 2468 - 2473
  • [6] Robust Zero-Sum Differential Game for Uncertain Nonlinear systems via Adaptive Dynamic Programming
    Sun, Jingliang
    Liu, Chunsheng
    Wei, Along
    2016 IEEE CHINESE GUIDANCE, NAVIGATION AND CONTROL CONFERENCE (CGNCC), 2016, : 1387 - 1392
  • [7] Adaptive Dynamic Programming Algorithm for Finding Online the Equilibrium Solution of the Two-Player Zero-Sum Differential Game
    Vrabie, Draguna
    Lewis, Frank
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [8] Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game
    Wei, Qinglai
    Zhu, Liao
    Song, Ruizhuo
    Zhang, Pinjia
    Liu, Derong
    Xiao, Jun
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (02) : 879 - 892
  • [9] Online Iterative Adaptive Dynamic Programming Approach for Solving the Zero-Sum Game for Nonlinear Continuous-Time Systems with Partially Unknown Dynamics
    Fu, Bin
    Sun, Bo
    Guo, Hang
    Yang, Tao
    Fu, Wenxing
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2833 - 2842
  • [10] Distributed Zero-Sum Differential Game for Multi-Agent Nonlinear Systems via Adaptive Dynamic Programming
    Sun, Jingliang
    Liu, Chunsheng
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2770 - 2775