Finite-time safe reinforcement learning control of multi-player nonzero-sum game for quadcopter systems

被引：1

作者：

Tan, Junkai ^{[1
,2
]}

Xue, Shuangsi ^{[1
,2
]}

Guan, Qingshu ^{[1
,2
]}

Qu, Kai ^{[1
,2
]}

Cao, Hui ^{[1
,2
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Elect Engn, Xian 710049, Peoples R China

[2] Xi An Jiao Tong Univ, State Key Lab, Xian 710049, Peoples R China

来源：

INFORMATION SCIENCES | 2025年 / 712卷

基金：

中国博士后科学基金;

关键词：

Finite-time optimal control; Nonzero-sum game; Reinforcement learning; Neural network; Dynamic event-trigger; Adaptive dynamic programming; SYNCHRONIZATION;

D O I：

10.1016/j.ins.2025.122117

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a finite-time safe reinforcement learning control algorithm for multi-player nonzero-sum games (FT-SRL-NZS). In addressing the finite-time safe optimal control issue, value functions incorporating designated barrier functions for the involved players are established within the transformed finite-time stable space. The finite-time safe optimal controller is derived from the solution to the transformed Nash equilibrium condition. An actor-critic structure is proposed for solving the Hamilton-Jacobi-Bellman (HJB) equation in the finite-time stable space, aimed at approximating the finite-time optimal value and its corresponded controller using a novel finite-time concurrent learning update law. A dynamic event-trigger rule adjusts the trigger condition in real time, thereby minimizing the computational and communicative demands associated with calculating Nash equilibrium. Lyapunov stability analysis is employed to examine the finite-time equilibrium of the closed-loop system. Numerical simulations and unmanned aerial vehicle (UAV) hardware tests are carried out to illustrate the efficacy of the proposed finite-time safe control algorithm.

引用

页数：21

共 50 条

[1] The multi-player nonzero-sum Dynkin game in discrete time
Said Hamadène
Mohammed Hassani
Mathematical Methods of Operations Research, 2014, 79 : 179 - 194
[2] The multi-player nonzero-sum Dynkin game in discrete time
Hamadene, Said
Hassani, Mohammed
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2014, 79 (02) : 179 - 194
[3] e-Nash Equilibria of a Multi-player Nonzero-Sum Dynkin Game in Discrete Time
Hamadene, Said
Hassani, Mohammed
Morlais, Marie-Amelie
DYNAMIC GAMES AND APPLICATIONS, 2024, 14 (03) : 642 - 664
[4] Integral reinforcement learning off-policy method for solving nonlinear multi-player nonzero-sum games with saturated actuator
Ren, He
Zhang, Huaguang
Wen, Yinlei
Liu, Chong
NEUROCOMPUTING, 2019, 335 : 96 - 104
[5] Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems
Wu, Qiuye
Zhao, Bo
Liu, Derong
2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 59 - 63
[6] Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics
Lv, Yongfeng
Ren, Xuemei
Na, Jing
NEUROCOMPUTING, 2018, 283 : 87 - 97
[7] Neural networks-based optimal tracking control for nonzero-sum games of multi-player continuous-time nonlinear systems via reinforcement learning
Zhao, Jingang
NEUROCOMPUTING, 2020, 412 : 167 - 176
[8] Data-based approximate optimal control for nonzero-sum games of multi-player systems using adaptive dynamic programming
Jiang, He
Zhang, Huaguang
Xiao, Geyang
Cui, Xiaohong
NEUROCOMPUTING, 2018, 275 : 192 - 199
[9] Nash Tracking Controls of Multi-input Nonzero-Sum Game System with Reinforcement Learning
Lv, Yongfeng
Ren, Xuemei
Li, Linwei
Na, Jing
2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2765 - 2769
[10] Event-triggered robust control for multi-player nonzero-sum games with input constraints and mismatched uncertainties
Zhang, Shunchao
Zhao, Bo
Liu, Derong
Alippi, Cesare
Zhang, Yongwei
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (05) : 3086 - 3106

← 1 2 3 4 5 →