Barrier function based safe reinforcement learning for multi-agent systems

被引：0

作者：

Yao, Ying ^{[1
]}

Zhang, Dianfeng ^{[1
]}

Wu, Zhaojing ^{[1
]}

Shao, Guangru ^{[1
]}

机构：

[1] Yantai Univ, Sch Math & Informat Sci, Yantai 264005, Shandong, Peoples R China

来源：

2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC | 2023年

关键词：

Multi-agent systems; optimal control; collision avoidance; maintaining communication; reinforcement learning;

D O I：

10.1109/CCDC58219.2023.10327078

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper intends to design a safety optimized controller that can guarantee collision avoidance and maintaining communication among multi-agent systems (MAS) while minimizing some performance. Different from the existing works on safety optimization problems using quadratic program (QP) method, a new class of Lyapunov-like barrier functions (BFs) is introduced and integrated into the performance indices to guarantee safety. This can transform the original constrained optimal control problem into an unconstrained one. Furthermore, the vanishing viscosity method is introduced to construct a general value function which eliminates the effect of nonsmooth caused by the input constraints on the solution of the modified Hamilton-Jacobi-Bellman (HJB) equation. To solve the HJB equation, an improved Actor-Critic (A-C) neural networks (NNs) algorithm is developed to find a smooth approximation safety-optimized controller. Finally, a simulation is performed to demonstrate the effectiveness of the proposed approach.

引用

页码：1714 / 1721

页数：8

共 27 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].

Abu-Khalaf, M ;

Lewis, FL .

AUTOMATICA, 2005, 41 (05) :779-791

[2]

Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]

[3] Control Barrier Function Based Quadratic Programs for Safety Critical Systems [J].

Ames, Aaron D. ;

Xu, Xiangru ;

Grizzle, Jessy W. ;

Tabuada, Paulo .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) :3861-3876

[4] Set invariance in control [J].

Blanchini, F .

AUTOMATICA, 1999, 35 (11) :1747-1767

[5]

Cai Z., 2021, ARXIV210312553

[6]

Cohen MH, 2020, IEEE DECIS CONTR P, P2062, DOI [10.1109/CDC42340.2020.9303896, 10.1109/cdc42340.2020.9303896]

[7] Distributed Event-Triggered Control for Multi-Agent Systems [J].

Dimarogonas, Dimos V. ;

Frazzoli, Emilio ;

Johansson, Karl H. .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (05) :1291-1297

[8]

Jankovic M., 2022, ARXIV220704915

[9] Robust control barrier functions for constrained stabilization of nonlinear systems [J].

Jankovic, Mrdjan .

AUTOMATICA, 2018, 96 :359-367

[10]

Li Z., 2017, AUTOMATION CONTROL E

← 1 2 3 →