Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引：0

作者：

Kahn, Gregory ^{[1
]}

Villaflor, Adam ^{[1
]}

Ding, Bosen ^{[1
]}

Abbeel, Pieter ^{[1
]}

Levine, Sergey ^{[1
]}

机构：

[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg

引用

页码：5129 / 5136

页数：8

共 50 条

[1] Improving robot navigation through self-supervised Online learning
Sofman, Boris
Lin, Ellie
Bagnell, J. Andrew
Cole, John
Vandapel, Nicolas
Stentz, Anthony
JOURNAL OF FIELD ROBOTICS, 2006, 23 (11-12) : 1059 - 1075
[2] Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
Bai, Chenjia
Liu, Peng
Liu, Kaiyu
Wang, Lingxiao
Zhao, Yingnan
Han, Lei
Wang, Zhaoran
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4776 - 4790
[3] Relational Self-Supervised Learning on Graphs
Lee, Namkyeong
Hyun, Dongmin
Lee, Junseok
Park, Chanyoung
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1054 - 1063
[4] Decoupled Self-supervised Learning for Graphs
Xiao, Teng
Chen, Zhengyu
Guo, Zhimeng
Zhuang, Zeyang
Wang, Suhang
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[5] Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning
Zeng, Andy
Song, Shuran
Welker, Stefan
Lee, Johnny
Rodriguez, Alberto
Funkhouser, Thomas
2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 4238 - 4245
[6] Supervised fuzzy reinforcement learning for robot navigation
Fathinezhad, Fatemeh
Derhami, Vali
Rezaeian, Mehdi
APPLIED SOFT COMPUTING, 2016, 40 : 33 - 41
[7] Self-Supervised Learning of Robot Manipulation
Tommy, Robin
Krishnan, Athira R.
2020 4TH INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL AND ROBOTS (ICACR 2020), 2020, : 22 - 25
[8] Multiple Self-Supervised Auxiliary Tasks for Target-Driven Visual Navigation Using Deep Reinforcement Learning
Zhang, Wenzhi
He, Li
Wang, Hongwei
Yuan, Liang
Xiao, Wendong
ENTROPY, 2023, 25 (07)
[9] Asynchronous Deep Reinforcement Learning for the Mobile Robot Navigation with Supervised Auxiliary Tasks
Tongloy, T.
Chuwongin, S.
Jaksukam, K.
Chousangsuntorn, C.
Boonsang, S.
2017 2ND INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING (ICRAE), 2017, : 68 - 72
[10] Intrinsically Motivated Self-supervised Learning in Reinforcement Learning
Zhao, Yue
Du, Chenzhuang
Zhao, Hang
Li, Tiejun
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 3605 - 3615

← 1 2 3 4 5 →