Voronoi-Based Multi-Robot Autonomous Exploration in Unknown Environments via Deep Reinforcement Learning

被引:262
作者
Hu, Junyan [1 ]
Niu, Hanlin [1 ]
Carrasco, Joaquin [1 ]
Lennox, Barry [1 ]
Arvin, Farshad [1 ]
机构
[1] Univ Manchester, Dept Elect & Elect Engn, Manchester M13 9PL, Lancs, England
基金
英国工程与自然科学研究理事会; 欧盟地平线“2020”;
关键词
Autonomous exploration; path planning; deep reinforcement learning; multi-vehicle systems; collision avoidance; PATH PLANNING ALGORITHM;
D O I
10.1109/TVT.2020.3034800
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Autonomous exploration is an important application of multi-vehicle systems, where a team of networked robots are coordinated to explore an unknown environment collaboratively. This technique has earned significant research interest due to its usefulness in search and rescue, fault detection and monitoring, localization and mapping, etc. In this paper, a novel cooperative exploration strategy is proposed for multiple mobile robots, which reduces the overall task completion time and energy costs compared to conventional methods. To efficiently navigate the networked robots during the collaborative tasks, a hierarchical control architecture is designed which contains a high-level decision making layer and a low-level target tracking layer. The proposed cooperative exploration approach is developed using dynamic Voronoi partitions, which minimizes duplicated exploration areas by assigning different target locations to individual robots. To deal with sudden obstacles in the unknown environment, an integrated deep reinforcement learning based collision avoidance algorithm is then proposed, which enables the control policy to learn from human demonstration data and thus improve the learning speed and performance. Finally, simulation and experimental results are provided to demonstrate the effectiveness of the proposed scheme.
引用
收藏
页码:14413 / 14423
页数:11
相关论文
共 37 条
[1]   Coordinated multi-robot exploration [J].
Burgard, W ;
Moors, M ;
Stachniss, C ;
Schneider, FE .
IEEE TRANSACTIONS ON ROBOTICS, 2005, 21 (03) :376-386
[2]  
Choi J, 2019, IEEE INT CONF ROBOT, P5993, DOI [10.1109/ICRA.2019.8793979, 10.1109/icra.2019.8793979]
[3]   Communication-Efficient Planning and Mapping for Multi-Robot Exploration in Large Environments [J].
Corah, Micah ;
O'Meadhra, Cormac ;
Goel, Kshitij ;
Michael, Nathan .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (02) :1715-1721
[4]   Composite Platoon Trajectory Planning Strategy for Intersection Throughput Maximization [J].
Feng, Yijia ;
He, Dazhi ;
Guan, Yunfeng .
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (07) :6305-6319
[5]   A probabilistic approach to collaborative multi-robot localization [J].
Fox, D ;
Burgard, W ;
Kruppa, H ;
Thrun, S .
AUTONOMOUS ROBOTS, 2000, 8 (03) :325-344
[6]   The Sensor-based Random Graph Method for Cooperative Robot Exploration [J].
Franchi, Antonio ;
Freda, Luigi ;
Oriolo, Giuseppe ;
Vendittelli, Marilena .
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2009, 14 (02) :163-175
[7]   Occlusion-Based Coordination Protocol Design for Autonomous Robotic Shepherding Tasks [J].
Hu, Junyan ;
Turgut, Ali Emre ;
Krajnik, Tomas ;
Lennox, Barry ;
Arvin, Farshad .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (01) :126-135
[8]   Two-layer distributed formation-containment control strategy for linear swarm systems: Algorithm and experiments [J].
Hu, Junyan ;
Bhowmick, Parijat ;
Lanzon, Alexander .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2020, 30 (16) :6433-6453
[9]   Cooperative Control of Heterogeneous Connected Vehicle Platoons: An Adaptive Leader-Following Approach [J].
Hu, Junyan ;
Bhowmick, Parijat ;
Arvin, Farshad ;
Lanzon, Alexander ;
Lennox, Barry .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2020, 5 (02) :977-984
[10]   Distributed Adaptive Time-Varying Group Formation Tracking for Multiagent Systems With Multiple Leaders on Directed Graphs [J].
Hu, Junyan ;
Bhowmick, Parijat ;
Lanzon, Alexander .
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2020, 7 (01) :140-150