Collision Avoidance Verification of Multiagent Systems With Learned Policies

被引:0
作者
Dong, Zihao [1 ]
Omidshafiei, Shayegan [2 ,3 ]
Everett, Michael [4 ]
机构
[1] Northeastern Univ, Khoury Coll Comp Sci, Boston, MA 02115 USA
[2] Google Res, People & AI Res, Cambridge, MA 02139 USA
[3] Field AI, Irvine, CA 92602 USA
[4] Northeastern Univ, Dept Elect & Comp Engn, Boston, MA 02115 USA
来源
IEEE CONTROL SYSTEMS LETTERS | 2024年 / 8卷
关键词
Neural networks; safety verification; multi-agent systems; reachability analysis;
D O I
10.1109/LCSYS.2024.3400190
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For many multiagent control problems, neural networks (NNs) have enabled promising new capabilities. However, many of these systems lack formal guarantees (e.g., collision avoidance, robustness), which prevents leveraging these advances in safety-critical settings. While there is recent work on formal verification of NN-controlled systems, most existing techniques cannot handle scenarios with more than one agent. To address this research gap, this letter presents a backward reachability-based approach for verifying the collision avoidance properties of Multi-Agent Neural Feedback Loops (MA-NFLs). Given the dynamics models and trained control policies of each agent, the proposed algorithm computes relative backprojection sets by (simultaneously) solving a series of Mixed Integer Linear Programs (MILPs) offline for each pair of agents. We account for state measurement uncertainties, making it well aligned with real-world scenarios. Using those results, the agents can quickly check for collision avoidance online by solving low-dimensional Linear Programs (LPs). We demonstrate the proposed algorithm can verify collision-free properties of a MA-NFL with agents trained to imitate a collision avoidance algorithm (Reciprocal Velocity Obstacles). We further demonstrate the computational scalability of the approach on systems with up to 10 agents.
引用
收藏
页码:652 / 657
页数:6
相关论文
共 16 条
  • [1] Reachability Analysis for Neural Feedback Systems using Regressive Polynomial Rule Inference
    Dutta, Souradeep
    Chen, Xin
    Sankaranarayanan, Sriram
    [J]. PROCEEDINGS OF THE 2019 22ND ACM INTERNATIONAL CONFERENCE ON HYBRID SYSTEMS: COMPUTATION AND CONTROL (HSCC '19), 2019, : 157 - 168
  • [2] Reachability Analysis of Neural Feedback Loops
    Everett, Michael
    Habibi, Golnaz
    Sun, Chuangchuang
    How, Jonathan P.
    [J]. IEEE ACCESS, 2021, 9 : 163938 - 163953
  • [3] Gama Fernando, 2020, C ROB LEARN, P671
  • [4] Scalable Forward Reachability Analysis of Multi-Agent Systems with Neural Network Controllers
    Gates, Oliver
    Newton, Matthew
    Gatsis, Konstantinos
    [J]. 2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 67 - 72
  • [5] Gurobi Optimization LLC Beaverton OR USA, 2023, Gurobi Optimizer Reference Manual
  • [6] Hu HM, 2020, IEEE DECIS CONTR P, P5929, DOI [10.1109/cdc42340.2020.9304296, 10.1109/CDC42340.2020.9304296]
  • [7] ReachNN: Reachability Analysis of Neural-Network Controlled Systems
    Huang, Chao
    Fan, Jiameng
    Li, Wenchao
    Chen, Xin
    Zhu, Qi
    [J]. ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2019, 18 (05)
  • [8] Kouvaros P, 2019, AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, P179
  • [9] Multi-Agent Connected Autonomous Driving using Deep Reinforcement Learning
    Palanisamy, Praveen
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [10] Probabilistic verification of a decentralized policy for conflict resolution in multi-agent systems
    Pallottino, Lucia
    Scordio, Vincenzo Giovanni
    Frazzoli, Emilio
    Bicchi, Antonio
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 2448 - +