Learning Navigation Behaviors End-to-End With AutoRL

被引:155
作者
Chiang, Hao-Tien Lewis [1 ]
Faust, Aleksandra [1 ]
Fiser, Marek [1 ]
Francis, Anthony [1 ]
机构
[1] Google AI, Google, Robot, Mountain View, CA 94043 USA
关键词
Autonomous agents; collision avoidance; deep learning in robotics and automation; motion and path planning;
D O I
10.1109/LRA.2019.2899918
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
We learn end-to-end point-to-point and pathfollowing navigation behaviors that avoid moving obstacles. These policies receive noisy lidar observations and output robot linear and angular velocities. The policies are trained in small, static environments with AutoRL, an evolutionary automation layer around reinforcement learning (RL) that searches for a deep RL reward and neural network architecture with large-scale hyper-parameter optimization. AutoRL first finds a reward that maximizes task completion and then finds a neural network architecture that maximizes the cumulative of the found reward. Empirical evaluations, both in simulation and on-robot, show that AutoRL policies do not suffer from the catastrophic forgetfulness that plagues many other deep reinforcement learning algorithms, generalize to new environments and moving obstacles, are robust to sensor, actuator, and localization noise, and can serve as robust building blocks for larger navigation tasks. Our path-following and point-to-point policies are, respectively, 23% and 26% more successful than comparison methods across new environments.
引用
收藏
页码:2007 / 2014
页数:8
相关论文
共 38 条
[1]   Reinforcement learning-based mobile robot navigation [J].
Altuntas, Nihal ;
Imal, Erkan ;
Emanet, Nahit ;
Ozturk, Ceyda Nur .
TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2016, 24 (03) :1747-1767
[2]  
Andrychowicz M, 2016, ADV NEUR IN, V29
[3]  
[Anonymous], 2016, WORKSH AUT MOB SERV
[4]  
[Anonymous], ABS180506066 CORR
[5]  
[Anonymous], 2017, P IEEE INT S ROB RES
[6]  
[Anonymous], 1995, Proceedings of the 6th International Conference on Genetic Algorithms
[7]  
[Anonymous], P SPEC INT GROUP DIS
[8]  
[Anonymous], P EUR S ART NEUR NET
[9]  
[Anonymous], ABS180606161 CORR
[10]   On Learning Navigation Behaviors for Small Mobile Robots With Reservoir Computing Architectures [J].
Antonelo, Eric Aislan ;
Schrauwen, Benjamin .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (04) :763-780