Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引:0
|
作者
Kahn, Gregory [1 ]
Villaflor, Adam [1 ]
Ding, Bosen [1 ]
Abbeel, Pieter [1 ]
Levine, Sergey [1 ]
机构
[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg
引用
收藏
页码:5129 / 5136
页数:8
相关论文
共 50 条
  • [1] Improving robot navigation through self-supervised Online learning
    Sofman, Boris
    Lin, Ellie
    Bagnell, J. Andrew
    Cole, John
    Vandapel, Nicolas
    Stentz, Anthony
    JOURNAL OF FIELD ROBOTICS, 2006, 23 (11-12) : 1059 - 1075
  • [2] Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
    Bai, Chenjia
    Liu, Peng
    Liu, Kaiyu
    Wang, Lingxiao
    Zhao, Yingnan
    Han, Lei
    Wang, Zhaoran
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4776 - 4790
  • [3] Relational Self-Supervised Learning on Graphs
    Lee, Namkyeong
    Hyun, Dongmin
    Lee, Junseok
    Park, Chanyoung
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 1054 - 1063
  • [4] Decoupled Self-supervised Learning for Graphs
    Xiao, Teng
    Chen, Zhengyu
    Guo, Zhimeng
    Zhuang, Zeyang
    Wang, Suhang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [5] Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning
    Zeng, Andy
    Song, Shuran
    Welker, Stefan
    Lee, Johnny
    Rodriguez, Alberto
    Funkhouser, Thomas
    2018 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2018, : 4238 - 4245
  • [6] Supervised fuzzy reinforcement learning for robot navigation
    Fathinezhad, Fatemeh
    Derhami, Vali
    Rezaeian, Mehdi
    APPLIED SOFT COMPUTING, 2016, 40 : 33 - 41
  • [7] Self-Supervised Learning of Robot Manipulation
    Tommy, Robin
    Krishnan, Athira R.
    2020 4TH INTERNATIONAL CONFERENCE ON AUTOMATION, CONTROL AND ROBOTS (ICACR 2020), 2020, : 22 - 25
  • [8] Multiple Self-Supervised Auxiliary Tasks for Target-Driven Visual Navigation Using Deep Reinforcement Learning
    Zhang, Wenzhi
    He, Li
    Wang, Hongwei
    Yuan, Liang
    Xiao, Wendong
    ENTROPY, 2023, 25 (07)
  • [9] Asynchronous Deep Reinforcement Learning for the Mobile Robot Navigation with Supervised Auxiliary Tasks
    Tongloy, T.
    Chuwongin, S.
    Jaksukam, K.
    Chousangsuntorn, C.
    Boonsang, S.
    2017 2ND INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING (ICRAE), 2017, : 68 - 72
  • [10] Intrinsically Motivated Self-supervised Learning in Reinforcement Learning
    Zhao, Yue
    Du, Chenzhuang
    Zhao, Hang
    Li, Tiejun
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 3605 - 3615