AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning

被引:4
|
作者
Rahman, Aowabin [1 ]
Bhattacharya, Arnab [1 ]
Ramachandran, Thiagarajan [1 ]
Mukherjee, Sayak [1 ]
Sharma, Himanshu [1 ]
Fujimoto, Ted [2 ]
Chatterjee, Samrat [3 ]
机构
[1] Pacific Northwest Natl Lab, Optimizat & Control Grp, Richland, WA USA
[2] Pacific Northwest Natl Lab, Data Analyt Grp, Richland, WA USA
[3] Pacific Northwest Natl Lab, Data Sci & Machine Intelligence Grp, Richland, WA USA
关键词
Search and Rescue; Multi-agent Reinforcement Learning; Adversarial Reinforcement Learning; Critical Infrastructure Security;
D O I
10.1109/HST56032.2022.10025434
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Search and Rescue (SAR) missions in remote environments often employ autonomous multi-robot systems that learn, plan, and execute a combination of local single-robot control actions, group primitives, and global mission-oriented coordination and collaboration. Often, SAR coordination strategies are manually designed by human experts who can remotely control the multi-robot system and enable semi-autonomous operations. However, in remote environments where connectivity is limited and human intervention is often not possible, decentralized collaboration strategies are needed for fully-autonomous operations. Nevertheless, decentralized coordination may be ineffective in adversarial environments due to sensor noise, actuation faults, or manipulation of inter-agent communication data. In this paper, we propose an algorithmic approach based on adversarial multi-agent reinforcement learning (MARL) that allows robots to efficiently coordinate their strategies in the presence of adversarial inter-agent communications. In our setup, the objective of the multi-robot team is to discover targets strategically in an obstacle-strewn geographical area by minimizing the average time needed to find the targets. It is assumed that the robots have no prior knowledge of the target locations, and they can interact with only a subset of neighboring robots at any time. Based on the centralized training with decentralized execution (CTDE) paradigm in MARL, we utilize a hierarchical meta-learning framework to learn dynamic team-coordination modalities and discover emergent team behavior under complex cooperative-competitive scenarios. The effectiveness of our approach is demonstrated on a collection of prototype grid-world environments with different specifications of benign and adversarial agents, target locations, and agent rewards.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Output synchronization of multi-agent systems via reinforcement learning
    Liu, Yingying
    Wang, Zhanshan
    NEUROCOMPUTING, 2022, 508 : 110 - 119
  • [32] Network Maintenance Planning Via Multi-Agent Reinforcement Learning
    Thomas, Jonathan
    Hernandez, Marco Perez
    Parlikad, Ajith Kumar
    Piechocki, Robert
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2289 - 2295
  • [33] IntelligentCrowd: Mobile Crowdsensing via Multi-Agent Reinforcement Learning
    Chen, Yize
    Wang, Hao
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (05): : 840 - 845
  • [34] Safe Multi-Agent Reinforcement Learning via Dynamic Shielding
    Qiu, Yunbo
    Jin, Yue
    Yu, Lebin
    Wang, Jian
    Zhang, Xudong
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1254 - 1257
  • [35] PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning
    Sartoretti, Guillaume
    Kerr, Justin
    Shi, YunFei
    Wagner, Glenn
    Kumar, T. K. Satish
    Koenig, Sven
    Choset, Howie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03): : 2378 - 2385
  • [36] Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
    Feng, Jun
    Li, Heng
    Huang, Minlie
    Liu, Shichen
    Ou, Wenwu
    Wang, Zhirong
    Zhu, Xiaoyan
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1939 - 1948
  • [37] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [38] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [39] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [40] Hierarchical multi-agent reinforcement learning
    Mohammad Ghavamzadeh
    Sridhar Mahadevan
    Rajbala Makar
    Autonomous Agents and Multi-Agent Systems, 2006, 13 : 197 - 229