Finite-time safe reinforcement learning control of multi-player nonzero-sum game for quadcopter systems

被引:1
作者
Tan, Junkai [1 ,2 ]
Xue, Shuangsi [1 ,2 ]
Guan, Qingshu [1 ,2 ]
Qu, Kai [1 ,2 ]
Cao, Hui [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab, Xian 710049, Peoples R China
基金
中国博士后科学基金;
关键词
Finite-time optimal control; Nonzero-sum game; Reinforcement learning; Neural network; Dynamic event-trigger; Adaptive dynamic programming; SYNCHRONIZATION;
D O I
10.1016/j.ins.2025.122117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a finite-time safe reinforcement learning control algorithm for multi-player nonzero-sum games (FT-SRL-NZS). In addressing the finite-time safe optimal control issue, value functions incorporating designated barrier functions for the involved players are established within the transformed finite-time stable space. The finite-time safe optimal controller is derived from the solution to the transformed Nash equilibrium condition. An actor-critic structure is proposed for solving the Hamilton-Jacobi-Bellman (HJB) equation in the finite-time stable space, aimed at approximating the finite-time optimal value and its corresponded controller using a novel finite-time concurrent learning update law. A dynamic event-trigger rule adjusts the trigger condition in real time, thereby minimizing the computational and communicative demands associated with calculating Nash equilibrium. Lyapunov stability analysis is employed to examine the finite-time equilibrium of the closed-loop system. Numerical simulations and unmanned aerial vehicle (UAV) hardware tests are carried out to illustrate the efficacy of the proposed finite-time safe control algorithm.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] The multi-player nonzero-sum Dynkin game in discrete time
    Said Hamadène
    Mohammed Hassani
    Mathematical Methods of Operations Research, 2014, 79 : 179 - 194
  • [2] The multi-player nonzero-sum Dynkin game in discrete time
    Hamadene, Said
    Hassani, Mohammed
    MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2014, 79 (02) : 179 - 194
  • [3] e-Nash Equilibria of a Multi-player Nonzero-Sum Dynkin Game in Discrete Time
    Hamadene, Said
    Hassani, Mohammed
    Morlais, Marie-Amelie
    DYNAMIC GAMES AND APPLICATIONS, 2024, 14 (03) : 642 - 664
  • [4] Integral reinforcement learning off-policy method for solving nonlinear multi-player nonzero-sum games with saturated actuator
    Ren, He
    Zhang, Huaguang
    Wen, Yinlei
    Liu, Chong
    NEUROCOMPUTING, 2019, 335 : 96 - 104
  • [5] Integral Reinforcement Learning-Based Optimal Control for Nonzero-Sum Games of Multi-Player Input-Constrained Nonlinear Systems
    Wu, Qiuye
    Zhao, Bo
    Liu, Derong
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 59 - 63
  • [6] Online optimal solutions for multi-player nonzero-sum game with completely unknown dynamics
    Lv, Yongfeng
    Ren, Xuemei
    Na, Jing
    NEUROCOMPUTING, 2018, 283 : 87 - 97
  • [7] Neural networks-based optimal tracking control for nonzero-sum games of multi-player continuous-time nonlinear systems via reinforcement learning
    Zhao, Jingang
    NEUROCOMPUTING, 2020, 412 : 167 - 176
  • [8] Data-based approximate optimal control for nonzero-sum games of multi-player systems using adaptive dynamic programming
    Jiang, He
    Zhang, Huaguang
    Xiao, Geyang
    Cui, Xiaohong
    NEUROCOMPUTING, 2018, 275 : 192 - 199
  • [9] Nash Tracking Controls of Multi-input Nonzero-Sum Game System with Reinforcement Learning
    Lv, Yongfeng
    Ren, Xuemei
    Li, Linwei
    Na, Jing
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 2765 - 2769
  • [10] Event-triggered robust control for multi-player nonzero-sum games with input constraints and mismatched uncertainties
    Zhang, Shunchao
    Zhao, Bo
    Liu, Derong
    Alippi, Cesare
    Zhang, Yongwei
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2023, 33 (05) : 3086 - 3106