Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

被引:23
作者
Chen, Wenzhou [1 ]
Zhou, Shizheng [1 ]
Pan, Zaisheng [1 ,2 ]
Zheng, Huixian [2 ]
Liu, Yong [1 ]
机构
[1] Zhejiang Univ, Inst Cyber Syst & Control, Hangzhou 310027, Zhejiang, Peoples R China
[2] Zhejiang Chipkong Technol Co Ltd, Hangzhou 310000, Zhejiang, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2019年 / 9卷 / 20期
基金
中国国家自然科学基金;
关键词
multi-robot; collaborative navigation; deep reinforcement learning; SIMULTANEOUS LOCALIZATION; TREE; GAME; GO;
D O I
10.3390/app9204198
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Compared with the single robot system, a multi-robot system has higher efficiency and fault tolerance. The multi-robot system has great potential in some application scenarios, such as the robot search, rescue and escort tasks, and so on. Deep reinforcement learning provides a potential framework for multi-robot formation and collaborative navigation. This paper mainly studies the collaborative formation and navigation of multi-robots by using the deep reinforcement learning algorithm. The proposed method improves the classical Deep Deterministic Policy Gradient (DDPG) to address the single robot mapless navigation task. We also extend the single-robot Deep Deterministic Policy Gradient algorithm to the multi-robot system, and obtain the Parallel Deep Deterministic Policy Gradient (PDDPG). By utilizing the 2D lidar sensor, the group of robots can accomplish the formation construction task and the collaborative formation navigation task. The experiment results in a Gazebo simulation platform illustrates that our method is capable of guiding mobile robots to construct the formation and keep the formation during group navigation, directly through raw lidar data inputs.
引用
收藏
页数:15
相关论文
共 44 条
[1]  
[Anonymous], ARXIV161103673
[2]  
Bengio Y., 2009, P 26 ANN INT C MACH, P41, DOI [DOI 10.1145/1553374.1553380.EVENT-PLACE, 10.1145/1553374.1553380, DOI 10.1145/1553374.15533802,5]
[3]   WiFi Localization and Navigation for Autonomous Indoor Mobile Robots [J].
Biswas, Joydeep ;
Veloso, Manuela .
2010 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2010, :4379-4384
[4]   Simultaneous Localization and Mapping: A Survey of Current Trends in Autonomous Driving [J].
Bresson, Guillaume ;
Alsayed, Zayed ;
Yu, Li ;
Glaser, Sebastien .
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2017, 2 (03) :194-220
[5]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[6]   Past, Present, and Future of Simultaneous Localization and Mapping: Toward the Robust-Perception Age [J].
Cadena, Cesar ;
Carlone, Luca ;
Carrillo, Henry ;
Latif, Yasir ;
Scaramuzza, Davide ;
Neira, Jose ;
Reid, Ian ;
Leonard, John J. .
IEEE TRANSACTIONS ON ROBOTICS, 2016, 32 (06) :1309-1332
[7]  
Chen YF, 2017, IEEE INT C INT ROBOT, P1343, DOI 10.1109/IROS.2017.8202312
[8]  
Dabney W, 2018, AAAI CONF ARTIF INTE, P2892
[9]   Integrated vehicle dynamics control via coordination of active front steering and rear braking [J].
Doumiati, Moustapha ;
Sename, Olivier ;
Dugard, Luc ;
Martinez-Molina, John-Jairo ;
Gaspar, Peter ;
Szabo, Zoltan .
EUROPEAN JOURNAL OF CONTROL, 2013, 19 (02) :121-143
[10]  
Fortunato M., 2017, ARXIV17061029