Barrier-Critic Adaptive Robust Control of Nonzero-Sum Differential Games for Uncertain Nonlinear Systems With State Constraints

被引：39

作者：

Qin, Chunbin ^{[1
]}

Qiao, Xiaopeng ^{[1
]}

Wang, Jinguang ^{[1
]}

Zhang, Dehua ^{[1
]}

Hou, Yandong ^{[1
]}

Hu, Shaolin ^{[2
]}

机构：

[1] Henan Univ, Sch Artificial Intelligence, Zhengzhou 450000, Peoples R China

[2] Guangdong Univ Petrochem Technol, Sch Automat, Maoming 525000, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2024年 / 54卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Games; Safety; Cost function; Control systems; Differential games; Adaptive systems; Robust control; Adaptive dynamic programming (ADP); control barrier function (CBF); nonzero-sum (NZS) differential games; robust control; state constraints; SWITCHED NEUTRAL SYSTEMS; OPTIMAL TRACKING CONTROL; GUARANTEED COST CONTROL; NEURAL-NETWORKS; CONTROL DESIGN; HJB SOLUTION; STABILIZATION; APPROXIMATION; SCHEME;

D O I：

10.1109/TSMC.2023.3302656

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, for the nonzero-sum (NZS) differential games problem of uncertain nonlinear systems with state constraints, an adaptive robust stabilization scheme based on the control barrier function (CBF) is presented under the influence of random disturbances and control input matrix uncertainty. To deal with the impact of uncertainty on the system, the nominal system of the original system is adopted and the cost functions associated with each player are appropriately chosen to convert the robust regulation problem of multiplayer differential games into an optimal regulation problem. Furthermore, the purpose of combining the cost function relevant to each player with the CBF is to make the system states evolve in the safe area. Different from the classical actor-critic dual neural network (NN), each player only needs a critic NN to approach the corresponding cost function without the restriction of the initial stabilizing control. Combined with the Lyapunov stability theory, under the combined influence of random disturbances and state constraints, the state and critic NN weights of the closed-loop system are guaranteed to be uniformly ultimately bounded (UUB). Finally, two simulation examples are used to verify the validity of the presented scheme.

引用

页码：50 / 63

页数：14

共 52 条

[1] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].

Al-Tamimi, Asma ;

Lewis, Frank .

2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, :38-+

[2]

Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ecc.2019.8796030, 10.23919/ECC.2019.8796030]

[3] Robust optimal feedback control design for uncertain systems based on artificial neural network approximation of the Bellman's value function [J].

Ballesteros, Mariana ;

Chairez, Isaac ;

Poznyak, Alexander .

NEUROCOMPUTING, 2020, 413 :134-144

[4] Obstacle Avoidance for Low-Speed Autonomous Vehicles With Barrier Function [J].

Chen, Yuxiao ;

Peng, Huei ;

Grizzle, Jessy .

IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (01) :194-206

[5] Robust Control Barrier-Value Functions for Safety-Critical Control [J].

Choi, Jason J. ;

Lee, Donggun ;

Sreenath, Koushil ;

Tomlin, Claire J. ;

Herbert, Sylvia L. .

2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, :6814-6821

[6]

Cohen MH, 2020, IEEE DECIS CONTR P, P2062, DOI [10.1109/cdc42340.2020.9303896, 10.1109/CDC42340.2020.9303896]

[7] Event-triggered single-network ADP method for constrained optimal tracking control of continuous-time non-linear systems [J].

Cui, Lili ;

Xie, Xiangpeng ;

Wang, Xiaowei ;

Luo, Yanhong ;

Liu, Jingbo .

APPLIED MATHEMATICS AND COMPUTATION, 2019, 352 :220-234

[8]

Federica F., 2020, ROBOT AUTON SYST, V124

[9] Adaptive Learning and Control for MIMO System Based on Adaptive Dynamic Programming [J].

Fu, Jian ;

He, Haibo ;

Zhou, Xinmin .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (07) :1133-1148

[10]

Ghadiri H, 2014, INT J CONTROL AUTOM, V12, P1167, DOI [10.1007/s12555-013-0524-8, 10.1007/s12555-013-0487-9]

← 1 2 3 4 5 6 →