A Single-NN Iterative Adaptive Dynamic Programming Algorithm for Continuous-Time Nonlinear Zero-Sum Games

被引：0

作者：

Song, Ruizhuo ^{[1
]}

Li, Junsong ^{[1
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

来源：

2018 37TH CHINESE CONTROL CONFERENCE (CCC) | 2018年

基金：

中国国家自然科学基金;

关键词：

Adaptive dynamic programming (ADP); zero-sum game (ZSG); single NN; least-squares method; EQUATION;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper establishes an approximate optimal critic learning algorithm based on single-network adaptive dynamic programming (ADP) aiming at solving for continuous-time 2-player zero-sum games(ZSG). However, the situation where the accurate dynamics is influenced by disturbance will occur from time to time. Because neural network(NN) is used in this paper, we have to face the approximation error, which will disturb the control. In order to surmount this problem, we use online data to calculate the weights of NN, and design robust controller to stabilize the disturbed nonlinear system. In other way, we used policy iteration and integral reinforcement learning to settle the Hamilton-Jacobi-Isaacs equation. And through the least-squares method, the NN weights are solved. Based on the theoretical analysis, this algorithm is a derivation from Gauss-Newton method, which can solve an optimization problem without disturbance. Thus it will converge to the optimal value. Because large quantities of online data are used, the process will accurately converge optimal control. Simulation results can verify that it's realizable to deal with disturbed nonlinear ZSG.

引用

页码：2848 / 2853

页数：6

共 50 条

[1] An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
Zhang, Huaguang
Wei, Qinglai
Liu, Derong
AUTOMATICA, 2011, 47 (01) : 207 - 214
[2] Robust Adaptive Dynamic Programming of Two-Player Zero-Sum Games for Continuous-Time Linear Systems
Fu, Yue
Fu, Jun
Chai, Tianyou
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (12) : 3314 - 3319
[3] Online Iterative Adaptive Dynamic Programming Approach for Solving the Zero-Sum Game for Nonlinear Continuous-Time Systems with Partially Unknown Dynamics
Fu, Bin
Sun, Bo
Guo, Hang
Yang, Tao
Fu, Wenxing
PROCEEDINGS OF 2022 INTERNATIONAL CONFERENCE ON AUTONOMOUS UNMANNED SYSTEMS, ICAUS 2022, 2023, 1010 : 2833 - 2842
[4] Event-triggered adaptive dynamic programming algorithm for the nonlinear zero-sum differential games
Cui L.-L.
Zhang Y.
Zhang X.
Cui, Li-Li (cuilili8396@163.com), 2018, South China University of Technology (35): : 610 - 618
[5] Adaptive Dynamic Programming for Discrete-Time Zero-Sum Games
Wei, Qinglai
Liu, Derong
Lin, Qiao
Song, Ruizhuo
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (04) : 957 - 969
[6] Nonlinear Multi-Person Zero-Sum Differential Games Using Iterative Adaptive Dynamic Programming
Wei Qinglai
Liu Derong
2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 2456 - 2461
[7] Continuous-time zero-sum games with probability criterion
Bhabak, Arnab
Saha, Subhamay
STOCHASTIC ANALYSIS AND APPLICATIONS, 2021, 39 (06) : 1130 - 1143
[8] Event-Triggered Adaptive Dynamic Programming for Zero-Sum Game of Partially Unknown Continuous-Time Nonlinear Systems
Xue, Shan
Luo, Biao
Liu, Derong
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 50 (09): : 3189 - 3199
[9] Event-Triggered Adaptive Dynamic Programming for Continuous-Time Nonlinear Two-Player Zero-Sum Game
Xue, Shan
Luo, Biao
Liu, Derong
Li, Yueheng
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VII, 2018, 11307 : 15 - 25
[10] Discrete-Time Two-Player Zero-Sum Games for Nonlinear Systems Using Iterative Adaptive Dynamic Programming
Wei, Qinglai
Liu, Derong
ADVANCES IN NEURAL NETWORKS - ISNN 2016, 2016, 9719 : 269 - 276

← 1 2 3 4 5 →