Barrier function based safe reinforcement learning for multi-agent systems

被引:0
作者
Yao, Ying [1 ]
Zhang, Dianfeng [1 ]
Wu, Zhaojing [1 ]
Shao, Guangru [1 ]
机构
[1] Yantai Univ, Sch Math & Informat Sci, Yantai 264005, Shandong, Peoples R China
来源
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC | 2023年
关键词
Multi-agent systems; optimal control; collision avoidance; maintaining communication; reinforcement learning;
D O I
10.1109/CCDC58219.2023.10327078
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper intends to design a safety optimized controller that can guarantee collision avoidance and maintaining communication among multi-agent systems (MAS) while minimizing some performance. Different from the existing works on safety optimization problems using quadratic program (QP) method, a new class of Lyapunov-like barrier functions (BFs) is introduced and integrated into the performance indices to guarantee safety. This can transform the original constrained optimal control problem into an unconstrained one. Furthermore, the vanishing viscosity method is introduced to construct a general value function which eliminates the effect of nonsmooth caused by the input constraints on the solution of the modified Hamilton-Jacobi-Bellman (HJB) equation. To solve the HJB equation, an improved Actor-Critic (A-C) neural networks (NNs) algorithm is developed to find a smooth approximation safety-optimized controller. Finally, a simulation is performed to demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:1714 / 1721
页数:8
相关论文
共 27 条
[1]   Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].
Abu-Khalaf, M ;
Lewis, FL .
AUTOMATICA, 2005, 41 (05) :779-791
[2]  
Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
[3]   Control Barrier Function Based Quadratic Programs for Safety Critical Systems [J].
Ames, Aaron D. ;
Xu, Xiangru ;
Grizzle, Jessy W. ;
Tabuada, Paulo .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) :3861-3876
[4]   Set invariance in control [J].
Blanchini, F .
AUTOMATICA, 1999, 35 (11) :1747-1767
[5]  
Cai Z., 2021, ARXIV210312553
[6]  
Cohen MH, 2020, IEEE DECIS CONTR P, P2062, DOI [10.1109/CDC42340.2020.9303896, 10.1109/cdc42340.2020.9303896]
[7]   Distributed Event-Triggered Control for Multi-Agent Systems [J].
Dimarogonas, Dimos V. ;
Frazzoli, Emilio ;
Johansson, Karl H. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (05) :1291-1297
[8]  
Jankovic M., 2022, ARXIV220704915
[9]   Robust control barrier functions for constrained stabilization of nonlinear systems [J].
Jankovic, Mrdjan .
AUTOMATICA, 2018, 96 :359-367
[10]  
Li Z., 2017, AUTOMATION CONTROL E