Finite-time safe reinforcement learning control of multi-player nonzero-sum game for quadcopter systems

被引:1
作者
Tan, Junkai [1 ,2 ]
Xue, Shuangsi [1 ,2 ]
Guan, Qingshu [1 ,2 ]
Qu, Kai [1 ,2 ]
Cao, Hui [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab, Xian 710049, Peoples R China
基金
中国博士后科学基金;
关键词
Finite-time optimal control; Nonzero-sum game; Reinforcement learning; Neural network; Dynamic event-trigger; Adaptive dynamic programming; SYNCHRONIZATION;
D O I
10.1016/j.ins.2025.122117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a finite-time safe reinforcement learning control algorithm for multi-player nonzero-sum games (FT-SRL-NZS). In addressing the finite-time safe optimal control issue, value functions incorporating designated barrier functions for the involved players are established within the transformed finite-time stable space. The finite-time safe optimal controller is derived from the solution to the transformed Nash equilibrium condition. An actor-critic structure is proposed for solving the Hamilton-Jacobi-Bellman (HJB) equation in the finite-time stable space, aimed at approximating the finite-time optimal value and its corresponded controller using a novel finite-time concurrent learning update law. A dynamic event-trigger rule adjusts the trigger condition in real time, thereby minimizing the computational and communicative demands associated with calculating Nash equilibrium. Lyapunov stability analysis is employed to examine the finite-time equilibrium of the closed-loop system. Numerical simulations and unmanned aerial vehicle (UAV) hardware tests are carried out to illustrate the efficacy of the proposed finite-time safe control algorithm.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Observer-based event-triggered control for zero-sum games of input constrained multi-player nonlinear systems
    Zhang, Shunchao
    Zhao, Bo
    Liu, Derong
    Zhang, Yongwei
    NEURAL NETWORKS, 2021, 144 : 101 - 112
  • [32] Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems
    Zhu, Xiaoxia
    Yuan, Xin
    Wang, Yuanda
    Sun, Changyin
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 6178 - 6182
  • [33] Zero-sum game-based H∞ optimal finite-time prescribed performance control for nonlinear multiagent systems with actuator faults
    Zhang, Bowen
    Zhang, Linchuang
    Pan, Yingnan
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (13) : 9019 - 9039
  • [34] Finite-Time Distributed Tracking Control for Multi-Agent Systems With a Virtual Leader
    Lu, Xiaoqing
    Lu, Renquan
    Chen, Shihua
    Lu, Jinhu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2013, 60 (02) : 352 - 362
  • [35] ε\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varepsilon $$\end{document}-Nash Equilibria of a Multi-player Nonzero-Sum Dynkin Game in Discrete Time
    Said Hamadène
    Mohammed Hassani
    Marie-Amélie Morlais
    Dynamic Games and Applications, 2024, 14 (3) : 642 - 664
  • [36] Finite-Time Tracking Consensus Control for A Class of Nonlinear Multi-Agent Systems
    Li, Zhenxing
    Chen, Xiangyong
    Wen, Yumei
    Qiu, Jianlong
    IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 5803 - 5808
  • [37] Finite-time containment control for nonlinear multi-agent systems with external disturbances
    Lu, Hui
    He, Wangli
    Han, Qing-Long
    Ge, Xiaohua
    Peng, Chen
    INFORMATION SCIENCES, 2020, 512 : 338 - 351
  • [38] A distributed adaptive architecture with the nonlinear reference model for safe finite-time control of uncertain multiagent systems
    Deniz, Meryem
    Dogan, K. Merve
    Yucelen, Tansel
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2023, 54 (04) : 822 - 834
  • [39] Finite-Time Consensus Tracking Control for Speed Sensorless Multi-Motor Systems
    Zhang, Bolun
    Mo, Shuangye
    Zhou, Hao
    Qin, Tong
    Zhong, Yong
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [40] Finite-Time Average Consensus Control of Multi-Agent Systems Based on the Aperiodically Intermittent Control
    Luo, Yiping
    Zhu, Junling
    IEEE ACCESS, 2022, 10 : 14959 - 14968