Near-Optimal Multi-Agent Learning for Safe Coverage Control

被引:0
作者
Prajapat, Manish [1 ]
Turchetta, Matteo [1 ]
Zeilinger, Melanie N. [1 ]
Krause, Andreas [1 ]
机构
[1] Swiss Fed Inst Technol, Zurich, Switzerland
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
基金
瑞士国家科学基金会;
关键词
GAUSSIAN-PROCESSES; REGRET BOUNDS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-agent coverage control problems, agents navigate their environment to reach locations that maximize the coverage of some density. In practice, the density is rarely known a priori, further complicating the original NP-hard problem. Moreover, in many applications, agents cannot visit arbitrary locations due to a priori unknown safety constraints. In this paper, we aim to efficiently learn the density to approximately solve the coverage problem while preserving the agents' safety. We first propose a conditionally linear submodular coverage function that facilitates theoretical analysis. Utilizing this structure, we develop MACOPT, a novel algorithm that efficiently trades off the exploration-exploitation dilemma due to partial observability, and show that it achieves sublinear regret. Next, we extend results on single-agent safe exploration to our multi-agent setting and propose SAFEMAC for safe coverage and exploration. We analyze SAFEMAC and give first of its kind results: near optimal coverage in finite time while provably guaranteeing safety. We extensively evaluate our algorithms on synthetic and real problems, including a biodiversity monitoring task under safety constraints, where SAFEMAC outperforms competing methods.
引用
收藏
页数:15
相关论文
共 69 条
  • [1] Achiam J, 2017, PR MACH LEARN RES, V70
  • [2] [Anonymous], 2002, Proceedings of ACMW SNA
  • [3] Averyanova Yu., 2021, P 6 INT C ACT PROBL, P41
  • [4] Balandat M., 2020, ADV NEURAL INF PROCE, V33, P21524, DOI DOI 10.48550/ARXIV.1910.06403
  • [5] Battocletti Gianpietro, 2021, AIAA AVIATION 2021 FORUM, DOI 10.2514/6.2021-3016
  • [6] Multi-Robot Coordination for Estimation and Coverage of Unknown Spatial Fields
    Benevento, Alessia
    Santos, Maria
    Notarstefano, Giuseppe
    Paynabar, Kamran
    Bloch, Matthieu
    Egerstedt, Magnus
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 7740 - 7746
  • [7] Berkenkamp F, 2017, ADV NEUR IN, V30
  • [8] Bilmes Jeff, 2022, ARXIV220200132
  • [9] Brunke Lukas, 2021, ANN REV CONTROL ROBO, V5
  • [10] GOSSIP COVERAGE CONTROL FOR ROBOTIC NETWORKS: DYNAMICAL SYSTEMS ON THE SPACE OF PARTITIONS
    Bullo, Francesco
    Carli, Ruggero
    Frasca, Paolo
    [J]. SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2012, 50 (01) : 419 - 447