Pursuit-evasion with Decentralized Robotic Swarm in Continuous State Space and Action Space via Deep Reinforcement Learning

被引:8
作者
Singh, Gurpreet [1 ]
Lofaro, Daniel M. [2 ]
Sofge, Donald [2 ]
机构
[1] Naval Air Warfare Ctr, Robot & Intelligent Syst Engn RISE Lab, Aircraft Div, Lakehurst, NJ 08733 USA
[2] US Naval Res Lab, Distributed Autonomous Syst Grp, 4555 Overlook Ave SW, Washington, DC 20375 USA
来源
ICAART: PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 1 | 2020年
关键词
Swarm Robotics; Deep Reinforcement Learning; Continuous Space; Actor Critic;
D O I
10.5220/0008971502260233
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we address the pursuit-evasion problem using deep reinforcement learning techniques. The goal of this project is to train each agent in a swarm of pursuers to learn a control strategy to capture the evaders in optimal time while displaying collaborative behavior. Additional challenges addressed in this paper include the use of continuous agent state and action spaces, and the requirement that agents in the swarm must take actions in a decentralized fashion. Our technique builds on the actor-critic model-free Multi-Agent Deep Deterministic Policy Gradient (MADDPG) algorithm that operates over continuous spaces. The evader strategy is not learned and is based on Voronoi regions, which the pursuers try to minimize and the evader tries to maximize. We assume global visibility of all agents at all times. We implement the algorithm and train the models using Python Pytorch machine learning library. Our results show that the pursuers can learn a control strategy to capture evaders.
引用
收藏
页码:226 / 233
页数:8
相关论文
共 10 条
[1]  
Huttenrauch Maximilian., 2018, CoRR
[2]  
Lillicrap Timothy P, 2016, P 4 INT C LEARNING R
[3]  
Lowe R., 2017, P 31 INT C NEUR INF, P6379
[4]  
Mnih V., 2013, PLAYING ATARI DEEP R
[5]   Human-level control through deep reinforcement learning [J].
Mnih, Volodymyr ;
Kavukcuoglu, Koray ;
Silver, David ;
Rusu, Andrei A. ;
Veness, Joel ;
Bellemare, Marc G. ;
Graves, Alex ;
Riedmiller, Martin ;
Fidjeland, Andreas K. ;
Ostrovski, Georg ;
Petersen, Stig ;
Beattie, Charles ;
Sadik, Amir ;
Antonoglou, Ioannis ;
King, Helen ;
Kumaran, Dharshan ;
Wierstra, Daan ;
Legg, Shane ;
Hassabis, Demis .
NATURE, 2015, 518 (7540) :529-533
[6]  
Nguyen T.T., 2018, Deep reinforcement learning for multi-agent systems: A review of challenges, solutions and applications
[7]  
Schuler T., 2019, 2019 3 INT S SWARM B
[8]  
Silver D., 2014, DETERMINISTIC POLICY
[9]  
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[10]  
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698