Collision Avoidance Verification of Multiagent Systems With Learned Policies

被引：0

作者：

Dong, Zihao ^{[1
]}

Omidshafiei, Shayegan ^{[2
,3
]}

Everett, Michael ^{[4
]}

机构：

[1] Northeastern Univ, Khoury Coll Comp Sci, Boston, MA 02115 USA

[2] Google Res, People & AI Res, Cambridge, MA 02139 USA

[3] Field AI, Irvine, CA 92602 USA

[4] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA

来源：

IEEE CONTROL SYSTEMS LETTERS | 2024年 / 8卷

关键词：

Neural networks; safety verification; multi-agent systems; reachability analysis;

D O I：

10.1109/LCSYS.2024.3400190

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

For many multiagent control problems, neural networks (NNs) have enabled promising new capabilities. However, many of these systems lack formal guarantees (e.g., collision avoidance, robustness), which prevents leveraging these advances in safety-critical settings. While there is recent work on formal verification of NN-controlled systems, most existing techniques cannot handle scenarios with more than one agent. To address this research gap, this letter presents a backward reachability-based approach for verifying the collision avoidance properties of Multi-Agent Neural Feedback Loops (MA-NFLs). Given the dynamics models and trained control policies of each agent, the proposed algorithm computes relative backprojection sets by (simultaneously) solving a series of Mixed Integer Linear Programs (MILPs) offline for each pair of agents. We account for state measurement uncertainties, making it well aligned with real-world scenarios. Using those results, the agents can quickly check for collision avoidance online by solving low-dimensional Linear Programs (LPs). We demonstrate the proposed algorithm can verify collision-free properties of a MA-NFL with agents trained to imitate a collision avoidance algorithm (Reciprocal Velocity Obstacles). We further demonstrate the computational scalability of the approach on systems with up to 10 agents.

引用

页码：652 / 657

页数：6

共 16 条

[1] Reachability Analysis for Neural Feedback Systems using Regressive Polynomial Rule Inference
Dutta, Souradeep
Chen, Xin
Sankaranarayanan, Sriram
[J]. PROCEEDINGS OF THE 2019 22ND ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (HSCC '19), 2019, : 157 - 168
[2] Reachability Analysis of Neural Feedback Loops
Everett, Michael
Habibi, Golnaz
Sun, Chuangchuang
How, Jonathan P.
[J]. IEEE ACCESS, 2021, 9 : 163938 - 163953
[3] Gama Fernando, 2020, C ROB LEARN, P671
[4] Scalable Forward Reachability Analysis of Multi-Agent Systems with Neural Network Controllers
Gates, Oliver
Newton, Matthew
Gatsis, Konstantinos
[J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 67 - 72
[5] Gurobi Optimization LLC Beaverton OR USA, 2023, Gurobi Optimizer Reference Manual
[6] Hu HM, 2020, IEEE DECIS CONTR P, P5929, DOI [10.1109/cdc42340.2020.9304296, 10.1109/CDC42340.2020.9304296]
[7] ReachNN: Reachability Analysis of Neural-Network Controlled Systems
Huang, Chao
Fan, Jiameng
Li, Wenchao
Chen, Xin
Zhu, Qi
[J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
[8] Kouvaros P, 2019, AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, P179
[9] Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning
Palanisamy, Praveen
[J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
[10] Probabilistic verification of a decentralized policy for conflict resolution in multi-agent systems
Pallottino, Lucia
Scordio, Vincenzo Giovanni
Frazzoli, Emilio
Bicchi, Antonio
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 2448 - +

← 1 2 →