A Single-NN Iterative Adaptive Dynamic Programming Algorithm for Continuous-Time Nonlinear Zero-Sum Games

被引:0
|
作者
Song, Ruizhuo [1 ]
Li, Junsong [1 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
来源
2018 37TH CHINESE CONTROL CONFERENCE (CCC) | 2018年
基金
中国国家自然科学基金;
关键词
Adaptive dynamic programming (ADP); zero-sum game (ZSG); single NN; least-squares method; EQUATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper establishes an approximate optimal critic learning algorithm based on single-network adaptive dynamic programming (ADP) aiming at solving for continuous-time 2-player zero-sum games(ZSG). However, the situation where the accurate dynamics is influenced by disturbance will occur from time to time. Because neural network(NN) is used in this paper, we have to face the approximation error, which will disturb the control. In order to surmount this problem, we use online data to calculate the weights of NN, and design robust controller to stabilize the disturbed nonlinear system. In other way, we used policy iteration and integral reinforcement learning to settle the Hamilton-Jacobi-Isaacs equation. And through the least-squares method, the NN weights are solved. Based on the theoretical analysis, this algorithm is a derivation from Gauss-Newton method, which can solve an optimization problem without disturbance. Thus it will converge to the optimal value. Because large quantities of online data are used, the process will accurately converge optimal control. Simulation results can verify that it's realizable to deal with disturbed nonlinear ZSG.
引用
收藏
页码:2848 / 2853
页数:6
相关论文
共 50 条
  • [1] An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    Zhang, Huaguang
    Wei, Qinglai
    Liu, Derong
    AUTOMATICA, 2011, 47 (01) : 207 - 214
  • [2] Robust Adaptive Dynamic Programming of Two-Player Zero-Sum Games for Continuous-Time Linear Systems
    Fu, Yue
    Fu, Jun
    Chai, Tianyou
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (12) : 3314 - 3319
  • [3] Online Iterative Adaptive Dynamic Programming Approach for Solving the Zero-Sum Game for Nonlinear Continuous-Time Systems with Partially Unknown Dynamics
    Fu, Bin
    Sun, Bo
    Guo, Hang
    Yang, Tao
    Fu, Wenxing
    PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2833 - 2842
  • [4] Event-triggered adaptive dynamic programming algorithm for the nonlinear zero-sum differential games
    Cui L.-L.
    Zhang Y.
    Zhang X.
    Cui, Li-Li (cuilili8396@163.com), 2018, South China University of Technology (35): : 610 - 618
  • [5] Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games
    Wei, Qinglai
    Liu, Derong
    Lin, Qiao
    Song, Ruizhuo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (04) : 957 - 969
  • [6] Nonlinear Multi-Person Zero-Sum Differential Games Using Iterative Adaptive Dynamic Programming
    Wei Qinglai
    Liu Derong
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 2456 - 2461
  • [7] Continuous-time zero-sum games with probability criterion
    Bhabak, Arnab
    Saha, Subhamay
    STOCHASTIC ANALYSIS AND APPLICATIONS, 2021, 39 (06) : 1130 - 1143
  • [8] Event-Triggered Adaptive Dynamic Programming for Zero-Sum Game of Partially Unknown Continuous-Time Nonlinear Systems
    Xue, Shan
    Luo, Biao
    Liu, Derong
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09): : 3189 - 3199
  • [9] Event-Triggered Adaptive Dynamic Programming for Continuous-Time Nonlinear Two-Player Zero-Sum Game
    Xue, Shan
    Luo, Biao
    Liu, Derong
    Li, Yueheng
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VII, 2018, 11307 : 15 - 25
  • [10] Discrete-Time Two-Player Zero-Sum Games for Nonlinear Systems Using Iterative Adaptive Dynamic Programming
    Wei, Qinglai
    Liu, Derong
    ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 269 - 276