AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning

被引:4
|
作者
Rahman, Aowabin [1 ]
Bhattacharya, Arnab [1 ]
Ramachandran, Thiagarajan [1 ]
Mukherjee, Sayak [1 ]
Sharma, Himanshu [1 ]
Fujimoto, Ted [2 ]
Chatterjee, Samrat [3 ]
机构
[1] Pacific Northwest Natl Lab, Optimizat & Control Grp, Richland, WA USA
[2] Pacific Northwest Natl Lab, Data Analyt Grp, Richland, WA USA
[3] Pacific Northwest Natl Lab, Data Sci & Machine Intelligence Grp, Richland, WA USA
关键词
Search and Rescue; Multi-agent Reinforcement Learning; Adversarial Reinforcement Learning; Critical Infrastructure Security;
D O I
10.1109/HST56032.2022.10025434
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Search and Rescue (SAR) missions in remote environments often employ autonomous multi-robot systems that learn, plan, and execute a combination of local single-robot control actions, group primitives, and global mission-oriented coordination and collaboration. Often, SAR coordination strategies are manually designed by human experts who can remotely control the multi-robot system and enable semi-autonomous operations. However, in remote environments where connectivity is limited and human intervention is often not possible, decentralized collaboration strategies are needed for fully-autonomous operations. Nevertheless, decentralized coordination may be ineffective in adversarial environments due to sensor noise, actuation faults, or manipulation of inter-agent communication data. In this paper, we propose an algorithmic approach based on adversarial multi-agent reinforcement learning (MARL) that allows robots to efficiently coordinate their strategies in the presence of adversarial inter-agent communications. In our setup, the objective of the multi-robot team is to discover targets strategically in an obstacle-strewn geographical area by minimizing the average time needed to find the targets. It is assumed that the robots have no prior knowledge of the target locations, and they can interact with only a subset of neighboring robots at any time. Based on the centralized training with decentralized execution (CTDE) paradigm in MARL, we utilize a hierarchical meta-learning framework to learn dynamic team-coordination modalities and discover emergent team behavior under complex cooperative-competitive scenarios. The effectiveness of our approach is demonstrated on a collection of prototype grid-world environments with different specifications of benign and adversarial agents, target locations, and agent rewards.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Multi-Agent Adversarial Inverse Reinforcement Learning
    Yu, Lantao
    Song, Jiaming
    Ermon, Stefano
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [2] Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning
    Song, Luona
    Wen, Zhigang
    Teng, Junjie
    Zhang, Jian
    Nicolas, Merveille
    SUSTAINABILITY, 2023, 15 (24)
  • [3] Learning adversarial policy in multiple scenes environment via multi-agent reinforcement learning
    Li, Yang
    Wang, Xinzhi
    Wang, Wei
    Zhang, Zhenyu
    Wang, Jianshu
    Luo, Xiangfeng
    Xie, Shaorong
    CONNECTION SCIENCE, 2021, 33 (03) : 407 - 426
  • [4] Adversarial Machine Learning Attacks and Defences in Multi-Agent Reinforcement Learning
    Standen, Maxwell
    Kim, Junae
    Szabo, Claudia
    ACM COMPUTING SURVEYS, 2025, 57 (05)
  • [5] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Ma, Aaron
    Ouimet, Michael
    Cortes, Jorge
    AUTONOMOUS ROBOTS, 2020, 44 (3-4) : 485 - 503
  • [6] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Aaron Ma
    Michael Ouimet
    Jorge Cortés
    Autonomous Robots, 2020, 44 : 485 - 503
  • [7] Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning
    Liu, Guanlin
    Lai, Lifeng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [8] Distributed hierarchical reinforcement learning in multi-agent adversarial environments
    Naderializadeh, Navid
    Soleyman, Sean
    Hung, Fan
    Khosla, Deepak
    Chen, Yang
    Fadaie, Joshua G.
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [9] Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition
    Phan, Thomy
    Belzner, Lenz
    Gabor, Thomas
    Sedlmeier, Andreas
    Ritz, Fabian
    Linnhoff-Popien, Claudia
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11308 - 11316
  • [10] An Investigation of Underground Rescue Scheduling with Multi-agent Reinforcement Learning
    Li, Xuge
    Zhong, Yueyun
    Hu, Chengpeng
    ADVANCES IN SWARM INTELLIGENCE, PT I, ICSI 2024, 2024, 14788 : 379 - 390