Finite-time safe reinforcement learning control of multi-player nonzero-sum game for quadcopter systems

被引：1

作者：

Tan, Junkai ^{[1
,2
]}

Xue, Shuangsi ^{[1
,2
]}

Guan, Qingshu ^{[1
,2
]}

Qu, Kai ^{[1
,2
]}

Cao, Hui ^{[1
,2
]}

机构：

[1] Xi An Jiao Tong Univ, Sch Elect Engn, Xian 710049, Peoples R China

[2] Xi An Jiao Tong Univ, State Key Lab, Xian 710049, Peoples R China

来源：

INFORMATION SCIENCES | 2025年 / 712卷

基金：

中国博士后科学基金;

关键词：

Finite-time optimal control; Nonzero-sum game; Reinforcement learning; Neural network; Dynamic event-trigger; Adaptive dynamic programming; SYNCHRONIZATION;

D O I：

10.1016/j.ins.2025.122117

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a finite-time safe reinforcement learning control algorithm for multi-player nonzero-sum games (FT-SRL-NZS). In addressing the finite-time safe optimal control issue, value functions incorporating designated barrier functions for the involved players are established within the transformed finite-time stable space. The finite-time safe optimal controller is derived from the solution to the transformed Nash equilibrium condition. An actor-critic structure is proposed for solving the Hamilton-Jacobi-Bellman (HJB) equation in the finite-time stable space, aimed at approximating the finite-time optimal value and its corresponded controller using a novel finite-time concurrent learning update law. A dynamic event-trigger rule adjusts the trigger condition in real time, thereby minimizing the computational and communicative demands associated with calculating Nash equilibrium. Lyapunov stability analysis is employed to examine the finite-time equilibrium of the closed-loop system. Numerical simulations and unmanned aerial vehicle (UAV) hardware tests are carried out to illustrate the efficacy of the proposed finite-time safe control algorithm.

引用

页数：21

共 50 条

[31] Observer-based event-triggered control for zero-sum games of input constrained multi-player nonlinear systems
Zhang, Shunchao
Zhao, Bo
Liu, Derong
Zhang, Yongwei
NEURAL NETWORKS, 2021, 144 : 101 - 112
[32] Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems
Zhu, Xiaoxia
Yuan, Xin
Wang, Yuanda
Sun, Changyin
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6178 - 6182
[33] Zero-sum game-based H∞ optimal finite-time prescribed performance control for nonlinear multiagent systems with actuator faults
Zhang, Bowen
Zhang, Linchuang
Pan, Yingnan
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 9019 - 9039
[34] Finite-Time Distributed Tracking Control for Multi-Agent Systems With a Virtual Leader
Lu, Xiaoqing
Lu, Renquan
Chen, Shihua
Lu, Jinhu
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2013, 60 (02) : 352 - 362
[35] ε\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon $$\end{document}-Nash Equilibria of a Multi-player Nonzero-Sum Dynkin Game in Discrete Time
Said Hamadène
Mohammed Hassani
Marie-Amélie Morlais
Dynamic Games and Applications, 2024, 14 (3) : 642 - 664
[36] Finite-Time Tracking Consensus Control for A Class of Nonlinear Multi-Agent Systems
Li, Zhenxing
Chen, Xiangyong
Wen, Yumei
Qiu, Jianlong
IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 5803 - 5808
[37] Finite-time containment control for nonlinear multi-agent systems with external disturbances
Lu, Hui
He, Wangli
Han, Qing-Long
Ge, Xiaohua
Peng, Chen
INFORMATION SCIENCES, 2020, 512 : 338 - 351
[38] A distributed adaptive architecture with the nonlinear reference model for safe finite-time control of uncertain multiagent systems
Deniz, Meryem
Dogan, K. Merve
Yucelen, Tansel
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (04) : 822 - 834
[39] Finite-Time Consensus Tracking Control for Speed Sensorless Multi-Motor Systems
Zhang, Bolun
Mo, Shuangye
Zhou, Hao
Qin, Tong
Zhong, Yong
APPLIED SCIENCES-BASEL, 2022, 12 (11):
[40] Finite-Time Average Consensus Control of Multi-Agent Systems Based on the Aperiodically Intermittent Control
Luo, Yiping
Zhu, Junling
IEEE ACCESS, 2022, 10 : 14959 - 14968

← 1 2 3 4 5 →