Adaptive optimal safety tracking control for multiplayer mixed zero-sum games of continuous-time systems

被引:0
|
作者
Chunbin Qin
Zhongwei Zhang
Ziyang Shang
Jishi Zhang
Dehua Zhang
机构
[1] Henan University,School of Artificial Intelligence
[2] Henan University,School of Software
来源
Applied Intelligence | 2023年 / 53卷
关键词
Mixed zero-sum(MZS) games; Control barrier function(CBF); Adaptive dynamic programming(ADP); Trajectory tracking;
D O I
暂无
中图分类号
学科分类号
摘要
When the equipment is working, it is very important to avoid the occurrence of malignant accidents by providing a highly reliable safety protection means. In this paper, for multiplayer mixed zero-sum games, an optimal safety tracking control scheme based on adaptive dynamic programming (ADP) is proposed, and a control barrier function (CBF) is introduced into the value function of the system to ensure that the system operates within its safe region. Firstly, through system transformation, the original tracking problem is transformed into a state tracking error problem. Secondly, an augmented Hamilton-Jacobi-Bellman (HJB) equation is derived from the improved augmented error system and the value function. Different from traditional methods, this paper uses a single critic neural network (NN) instead of the actor-critic NN to approximate the Nash equilibrium solution of the system, and introduces a concurrent learning technique that can relax the traditional continuous excitation condition into a simplified condition of recording data. Then, according to the Lyapunov theory, the stability of the system is analyzed in detail. Finally, two simulation examples are used to verify the effectiveness of the proposed scheme.
引用
收藏
页码:17460 / 17475
页数:15
相关论文
共 50 条
  • [21] Approximation of zero-sum continuous-time Markov games under the discounted payoff criterion
    Prieto-Rumeau, Tomas
    Maria Lorenzo, Jose
    TOP, 2015, 23 (03) : 799 - 836
  • [22] Zero-sum continuous-time Markov games with unbounded transition and discounted payoff rates
    Guo, XP
    Hernández-Lerma, O
    BERNOULLI, 2005, 11 (06) : 1009 - 1029
  • [23] Single-network ADP for near optimal control of continuous-time zero-sum games without using initial stabilising control laws
    Mu, Chaoxu
    Wang, Ke
    IET CONTROL THEORY AND APPLICATIONS, 2018, 12 (18): : 2449 - 2458
  • [24] Discrete-time Optimal Zero-sum Games for Nonlinear Systems via Adaptive Dynamic Programming
    Wei, Qinglai
    Song, Ruizhuo
    Xu, Yancai
    Liu, Derong
    Lin, Qiao
    2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 357 - 364
  • [25] Optimal strategies for adaptive zero-sum average Markov games
    Adolfo Minjarez-Sosa, J.
    Vega-Amaya, Oscar
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2013, 402 (01) : 44 - 56
  • [26] Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems
    Xin, Xilin
    Tu, Yidong
    Stojanovic, Vladimir
    Wang, Hai
    Shi, Kaibo
    He, Shuping
    Pan, Tianhong
    APPLIED MATHEMATICS AND COMPUTATION, 2022, 412
  • [27] Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control
    Que, Xuejie
    Wang, Zhenlei
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (06) : 3146 - 3150
  • [28] Zero-sum games for continuous-time jump Markov processes in polish spaces: Discounted payoffs
    Guo, Xianping
    Hernandez-Lerma, Onesimo
    ADVANCES IN APPLIED PROBABILITY, 2007, 39 (03) : 645 - 668
  • [29] Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates
    Guo, XP
    Hernández-Lerma, O
    JOURNAL OF APPLIED PROBABILITY, 2003, 40 (02) : 327 - 345
  • [30] Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics
    Li, Hongliang
    Liu, Derong
    Wang, Ding
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (03) : 706 - 714