Robust Optimal Control for Disturbed Nonlinear Zero-Sum Differential Games Based on Single NN and Least Squares

被引:70
作者
Song, Ruizhuo [1 ]
Li, Junsong [1 ]
Lewis, Frank L. [2 ,3 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[2] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA
[3] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110036, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2020年 / 50卷 / 11期
基金
中国国家自然科学基金;
关键词
Game theory; Artificial neural networks; Games; Optimal control; Heuristic algorithms; Nonlinear systems; Adaptive critic designs; adaptive dynamic programming; approximate dynamic programming (ADP); policy iteration (PI); robust control; zero-sum game (ZSG); STATE-FEEDBACK CONTROL; POLICY UPDATE ALGORITHM; LEARNING ALGORITHM; SYSTEMS; APPROXIMATION; EQUATION; DESIGNS;
D O I
10.1109/TSMC.2019.2897379
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper establishes an approximate optimal critic learning algorithm based on single neural network (NN) policy iteration (PI) aiming at solving for continuous-time (CT) 2-player zero-sum games (ZSGs). In fact, we have to face the problem that the errors will disturb the dynamics and in turn identifying dynamics will generate errors. In order to prevent the effect of errors, in this paper, a single NN-based online PI algorithm is developed for the CT system, which is disturbed nonlinear ZSG. With plenty of online data, the Hamilton-Jacobi-Isaacs equation can be solved without complete dynamics. Then by the least-squares method, we can obtain the NN weights. Moreover, in the process of dealing with the undisturbed system, we find the way that obtains NN weights in this paper is equal to the way that obtains the optimal solution by the Gauss-Newton method. Based on the convergence of the Gauss-Newton method, we can efficiently obtain the optimal controller for the undisturbed system by utilizing online data. After getting the controller of the undisturbed system, it is time to take disturbance into consideration, so that we design a robust control pair to overcome the disturbance. In order to demonstrate the effectiveness of this algorithm, we design a set of simulations. The results verify that we can solve the disturbed nonlinear ZSG by this algorithm.
引用
收藏
页码:4009 / 4019
页数:11
相关论文
共 50 条
  • [1] Asymmetric Constrained Optimal Tracking Control With Critic Learning of Nonlinear Multiplayer Zero-Sum Games
    Qiao, Junfei
    Li, Menghua
    Wang, Ding
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5671 - 5683
  • [2] Robust Control of Unknown Observable Nonlinear Systems Solved as a Zero-Sum Game
    Radac, Mircea-Bogdan
    Lala, Timotei
    IEEE ACCESS, 2020, 8 (08): : 214153 - 214165
  • [3] An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    Zhang, Huaguang
    Wei, Qinglai
    Liu, Derong
    AUTOMATICA, 2011, 47 (01) : 207 - 214
  • [4] Event-triggered optimal control scheme for discrete-time nonlinear zero-sum games
    Zhang X.
    Bo Y.-C.
    Cui L.-L.
    Zhang, Xin (zhangxin@upc.edu.cn), 2018, South China University of Technology (35): : 619 - 626
  • [5] Model-Free Adaptive Control for Unknown Nonlinear Zero-Sum Differential Game
    Zhong, Xiangnan
    He, Haibo
    Wang, Ding
    Ni, Zhen
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (05) : 1633 - 1646
  • [6] Adaptive Learning Based Output-Feedback Optimal Control of CT Two-Player Zero-Sum Games
    Zhao, Jun
    Lv, Yongfeng
    Zhao, Ziliang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2022, 69 (03) : 1437 - 1441
  • [7] Model-free Adaptive Dynamic Programming for Online optimal Solution of the Unknown Nonlinear Zero-Sum Differential Game
    Qin, Chunbin
    Zhang, Huaguang
    Luo, Yanhong
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 3815 - 3820
  • [8] Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control
    Que, Xuejie
    Wang, Zhenlei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (06) : 3146 - 3150
  • [9] Relaxed Policy Iteration Algorithm for Nonlinear Zero-Sum Games With Application to H-Infinity Control
    Li, Jie
    Li, Shengbo Eben
    Duan, Jingliang
    Lyu, Yao
    Zou, Wenjun
    Guan, Yang
    Yin, Yuming
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2024, 69 (01) : 426 - 433
  • [10] Multiperson zero-sum differential games for a class of uncertain nonlinear systems
    Liu, Derong
    Wei, Qinglai
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2014, 28 (3-5) : 205 - 231