Adaptive optimal safety tracking control for multiplayer mixed zero-sum games of continuous-time systems

被引：0

作者：

Chunbin Qin

Zhongwei Zhang

Ziyang Shang

Jishi Zhang

Dehua Zhang

机构：

[1] Henan University,School of Artificial Intelligence

[2] Henan University,School of Software

来源：

Applied Intelligence | 2023年 / 53卷

关键词：

Mixed zero-sum(MZS) games; Control barrier function(CBF); Adaptive dynamic programming(ADP); Trajectory tracking;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

When the equipment is working, it is very important to avoid the occurrence of malignant accidents by providing a highly reliable safety protection means. In this paper, for multiplayer mixed zero-sum games, an optimal safety tracking control scheme based on adaptive dynamic programming (ADP) is proposed, and a control barrier function (CBF) is introduced into the value function of the system to ensure that the system operates within its safe region. Firstly, through system transformation, the original tracking problem is transformed into a state tracking error problem. Secondly, an augmented Hamilton-Jacobi-Bellman (HJB) equation is derived from the improved augmented error system and the value function. Different from traditional methods, this paper uses a single critic neural network (NN) instead of the actor-critic NN to approximate the Nash equilibrium solution of the system, and introduces a concurrent learning technique that can relax the traditional continuous excitation condition into a simplified condition of recording data. Then, according to the Lyapunov theory, the stability of the system is analyzed in detail. Finally, two simulation examples are used to verify the effectiveness of the proposed scheme.

引用

页码：17460 / 17475

页数：15

共 50 条

[21] Approximation of zero-sum continuous-time Markov games under the discounted payoff criterion
Prieto-Rumeau, Tomas
Maria Lorenzo, Jose
TOP, 2015, 23 (03) : 799 - 836
[22] Zero-sum continuous-time Markov games with unbounded transition and discounted payoff rates
Guo, XP
Hernández-Lerma, O
BERNOULLI, 2005, 11 (06) : 1009 - 1029
[23] Single-network ADP for near optimal control of continuous-time zero-sum games without using initial stabilising control laws
Mu, Chaoxu
Wang, Ke
IET CONTROL THEORY AND APPLICATIONS, 2018, 12 (18): : 2449 - 2458
[24] Discrete-time Optimal Zero-sum Games for Nonlinear Systems via Adaptive Dynamic Programming
Wei, Qinglai
Song, Ruizhuo
Xu, Yancai
Liu, Derong
Lin, Qiao
2017 6TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS (DDCLS), 2017, : 357 - 364
[25] Optimal strategies for adaptive zero-sum average Markov games
Adolfo Minjarez-Sosa, J.
Vega-Amaya, Oscar
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2013, 402 (01) : 44 - 56
[26] Online reinforcement learning multiplayer non-zero sum games of continuous-time Markov jump linear systems
Xin, Xilin
Tu, Yidong
Stojanovic, Vladimir
Wang, Hai
Shi, Kaibo
He, Shuping
Pan, Tianhong
APPLIED MATHEMATICS AND COMPUTATION, 2022, 412
[27] Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control
Que, Xuejie
Wang, Zhenlei
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2024, 71 (06) : 3146 - 3150
[28] Zero-sum games for continuous-time jump Markov processes in polish spaces: Discounted payoffs
Guo, Xianping
Hernandez-Lerma, Onesimo
ADVANCES IN APPLIED PROBABILITY, 2007, 39 (03) : 645 - 668
[29] Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates
Guo, XP
Hernández-Lerma, O
JOURNAL OF APPLIED PROBABILITY, 2003, 40 (02) : 327 - 345
[30] Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics
Li, Hongliang
Liu, Derong
Wang, Ding
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 11 (03) : 706 - 714

← 1 2 3 4 5 →