Robust Optimal Control for Disturbed Nonlinear Zero-Sum Differential Games Based on Single NN and Least Squares

被引：70

作者：

Song, Ruizhuo ^{[1
]}

Li, Junsong ^{[1
]}

Lewis, Frank L. ^{[2
,3
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

[2] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA

[3] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110036, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2020年 / 50卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Game theory; Artificial neural networks; Games; Optimal control; Heuristic algorithms; Nonlinear systems; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming (ADP); policy iteration (PI); robust control; zero-sum game (ZSG); STATE-FEEDBACK CONTROL; POLICY UPDATE ALGORITHM; LEARNING ALGORITHM; SYSTEMS; APPROXIMATION; EQUATION; DESIGNS;

D O I：

10.1109/TSMC.2019.2897379

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper establishes an approximate optimal critic learning algorithm based on single neural network (NN) policy iteration (PI) aiming at solving for continuous-time (CT) 2-player zero-sum games (ZSGs). In fact, we have to face the problem that the errors will disturb the dynamics and in turn identifying dynamics will generate errors. In order to prevent the effect of errors, in this paper, a single NN-based online PI algorithm is developed for the CT system, which is disturbed nonlinear ZSG. With plenty of online data, the Hamilton-Jacobi-Isaacs equation can be solved without complete dynamics. Then by the least-squares method, we can obtain the NN weights. Moreover, in the process of dealing with the undisturbed system, we find the way that obtains NN weights in this paper is equal to the way that obtains the optimal solution by the Gauss-Newton method. Based on the convergence of the Gauss-Newton method, we can efficiently obtain the optimal controller for the undisturbed system by utilizing online data. After getting the controller of the undisturbed system, it is time to take disturbance into consideration, so that we design a robust control pair to overcome the disturbance. In order to demonstrate the effectiveness of this algorithm, we design a set of simulations. The results verify that we can solve the disturbed nonlinear ZSG by this algorithm.

引用

页码：4009 / 4019

页数：11

共 50 条

[1] Asymmetric Constrained Optimal Tracking Control With Critic Learning of Nonlinear Multiplayer Zero-Sum Games
Qiao, Junfei
Li, Menghua
Wang, Ding
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5671 - 5683
[2] Robust Control of Unknown Observable Nonlinear Systems Solved as a Zero-Sum Game
Radac, Mircea-Bogdan
Lala, Timotei
IEEE ACCESS, 2020, 8 (08): : 214153 - 214165
[3] An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
Zhang, Huaguang
Wei, Qinglai
Liu, Derong
AUTOMATICA, 2011, 47 (01) : 207 - 214
[4] Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games
Zhang X.
Bo Y.-C.
Cui L.-L.
Zhang, Xin (zhangxin@upc.edu.cn), 2018, South China University of Technology (35): : 619 - 626
[5] Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game
Zhong, Xiangnan
He, Haibo
Wang, Ding
Ni, Zhen
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) : 1633 - 1646
[6] Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games
Zhao, Jun
Lv, Yongfeng
Zhao, Ziliang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1437 - 1441
[7] Model-free Adaptive Dynamic Programming for Online optimal Solution of the Unknown Nonlinear Zero-Sum Differential Game
Qin, Chunbin
Zhang, Huaguang
Luo, Yanhong
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3815 - 3820
[8] Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control
Que, Xuejie
Wang, Zhenlei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (06) : 3146 - 3150
[9] Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games With Application to H-Infinity Control
Li, Jie
Li, Shengbo Eben
Duan, Jingliang
Lyu, Yao
Zou, Wenjun
Guan, Yang
Yin, Yuming
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) : 426 - 433
[10] Multiperson zero-sum differential games for a class of uncertain nonlinear systems
Liu, Derong
Wei, Qinglai
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2014, 28 (3-5) : 205 - 231

← 1 2 3 4 5 →