Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引:0
作者
Kahn, Gregory [1 ]
Villaflor, Adam [1 ]
Ding, Bosen [1 ]
Abbeel, Pieter [1 ]
Levine, Sergey [1 ]
机构
[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg
引用
收藏
页码:5129 / 5136
页数:8
相关论文
共 50 条
[31]   On Exploiting Haptic Cues for Self-Supervised Learning of Depth-Based Robot Navigation Affordances [J].
José Baleia ;
Pedro Santana ;
José Barata .
Journal of Intelligent & Robotic Systems, 2015, 80 :455-474
[32]   Deep active sampling with self-supervised learning [J].
SHI Haochen ;
ZHOU Hui .
Frontiers of Computer Science, 2023, 17 (04)
[33]   Deep Metric Learning with Self-Supervised Ranking [J].
Fu, Zheren ;
Li, Yan ;
Mao, Zhendong ;
Wang, Quan ;
Zhang, Yongdong .
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 :1370-1378
[34]   On Exploiting Haptic Cues for Self-Supervised Learning of Depth-Based Robot Navigation Affordances [J].
Baleia, Jose ;
Santana, Pedro ;
Barata, Jose .
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 80 (3-4) :455-474
[35]   Reverse Optical Flow for Self-Supervised Adaptive Autonomous Robot Navigation [J].
A. Lookingbill ;
J. Rogers ;
D. Lieb ;
J. Curry ;
S. Thrun .
International Journal of Computer Vision, 2007, 74 :287-302
[36]   Reverse optical flow for self-supervised adaptive autonomous robot navigation [J].
Lookingbill, A. ;
Rogers, J. ;
Lieb, D. ;
Curry, J. ;
Thrun, S. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 74 (03) :287-302
[37]   Growing Robot Navigation Based on Deep Reinforcement Learning [J].
Ataka, Ahmad ;
Sandiwan, Andreas P. .
2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, :115-120
[38]   Deep Reinforcement Learning for Mapless Robot Navigation Systems [J].
Oliveira, Iure Rosa L. ;
Brandao, Alexandre S. .
2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, :331-336
[39]   Quantum Deep Reinforcement Learning for Robot Navigation Tasks [J].
Hohenfeld, Hans ;
Heimann, Dirk ;
Wiebe, Felix ;
Kirchner, Frank .
IEEE ACCESS, 2024, 12 :87217-87236
[40]   Mobile Robot Navigation Using Deep Reinforcement Learning [J].
Lee, Min-Fan Ricky ;
Yusuf, Sharfiden Hassen .
PROCESSES, 2022, 10 (12)