Finite-time safe reinforcement learning control of multi-player nonzero-sum game for quadcopter systems

被引:1
作者
Tan, Junkai [1 ,2 ]
Xue, Shuangsi [1 ,2 ]
Guan, Qingshu [1 ,2 ]
Qu, Kai [1 ,2 ]
Cao, Hui [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Sch Elect Engn, Xian 710049, Peoples R China
[2] Xi An Jiao Tong Univ, State Key Lab, Xian 710049, Peoples R China
基金
中国博士后科学基金;
关键词
Finite-time optimal control; Nonzero-sum game; Reinforcement learning; Neural network; Dynamic event-trigger; Adaptive dynamic programming; SYNCHRONIZATION;
D O I
10.1016/j.ins.2025.122117
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a finite-time safe reinforcement learning control algorithm for multi-player nonzero-sum games (FT-SRL-NZS). In addressing the finite-time safe optimal control issue, value functions incorporating designated barrier functions for the involved players are established within the transformed finite-time stable space. The finite-time safe optimal controller is derived from the solution to the transformed Nash equilibrium condition. An actor-critic structure is proposed for solving the Hamilton-Jacobi-Bellman (HJB) equation in the finite-time stable space, aimed at approximating the finite-time optimal value and its corresponded controller using a novel finite-time concurrent learning update law. A dynamic event-trigger rule adjusts the trigger condition in real time, thereby minimizing the computational and communicative demands associated with calculating Nash equilibrium. Lyapunov stability analysis is employed to examine the finite-time equilibrium of the closed-loop system. Numerical simulations and unmanned aerial vehicle (UAV) hardware tests are carried out to illustrate the efficacy of the proposed finite-time safe control algorithm.
引用
收藏
页数:21
相关论文
共 50 条
  • [41] Distributed Finite-Time Containment Control for Second-Order Multi-Agent Systems
    Zhao Yu
    Duan Zhisheng
    Wen Guanghui
    Zhang Yanjiao
    PROCEEDINGS OF THE 31ST CHINESE CONTROL CONFERENCE, 2012, : 6202 - 6207
  • [42] Finite-time formation control for multi-agent systems underlying heterogeneous communication typologies
    Zhang, Haopeng
    Liyanage, Sanka
    2020 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2020, : 1441 - 1446
  • [43] Finite-time formation control for linear multi-agent systems: A motion planning approach
    Liu, Yongfang
    Geng, Zhiyong
    SYSTEMS & CONTROL LETTERS, 2015, 85 : 54 - 60
  • [44] Online Learning-based Optimal Control of Nonlinear Systems with Finite-Time Convergence Guarantees
    Kokolakis, Nick-Marios T.
    Vamvoudakis, Kyriakos G.
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 812 - 817
  • [45] Finite-time H∞ cluster consensus control for nonlinear multi-agent systems with aperiodically intermittent control
    Yao, Yuejie
    Luo, Yiping
    Cao, Jinde
    COMMUNICATIONS IN NONLINEAR SCIENCE AND NUMERICAL SIMULATION, 2022, 114
  • [46] SUM-OF-SQUARES-BASED FINITE-TIME ADAPTIVE SLIDING MODE CONTROL OF UNCERTAIN POLYNOMIAL SYSTEMS WITH INPUT NONLINEARITIES
    Mardani, Mohammad Mehdi
    Vafamand, Navid
    Zeini, Mostafa Shokrian
    Shasadeghi, Mokhtar
    Khayatian, Alireza
    ASIAN JOURNAL OF CONTROL, 2018, 20 (04) : 1658 - 1662
  • [47] Cooperative Output Regulation By Q-learning For Discrete Multi-agent Systems In Finite-time
    Wei, Wenjun
    Tang, Jingyuan
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2022, 26 (06): : 853 - 864
  • [48] Finite-time synchronization for nonlinear multi-agent system with directed structure by iterative learning control
    Lin, Zongzong
    Chen, Tianping
    Lu, Wenlian
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 8542 - 8547
  • [49] Distributed finite-time formation tracking control of multi-agent systems via FTSMC approach
    Han, Tao
    Guan, Zhi-Hong
    Liao, Rui-Quan
    Chen, Jie
    Chi, Ming
    He, Ding-Xin
    IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (15) : 2585 - 2590
  • [50] Finite-time consensus for second-order multi-agent systems with saturated control protocols
    Zhao, Yu
    Duan, Zhisheng
    Wen, Guanghui
    IET CONTROL THEORY AND APPLICATIONS, 2015, 9 (03) : 312 - 319