Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引:0
|
作者
Kahn, Gregory [1 ]
Villaflor, Adam [1 ]
Ding, Bosen [1 ]
Abbeel, Pieter [1 ]
Levine, Sergey [1 ]
机构
[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg
引用
收藏
页码:5129 / 5136
页数:8
相关论文
共 50 条
  • [41] Adaptive-Masking Policy with Deep Reinforcement Learning for Self-Supervised Medical Image Segmentation
    Xu, Gang
    Wang, Shengxin
    Lukasiewicz, Thomas
    Xu, Zhenghua
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2285 - 2290
  • [42] Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling
    Yu, Xudong
    Bai, Chenjia
    Wang, Changhong
    Yu, Dengxiu
    Chen, C. L. Philip
    Wang, Zhen
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (12): : 7732 - 7743
  • [43] Multi-task Self-Supervised Adaptation for Reinforcement Learning
    Wu, Keyu
    Chen, Zhenghua
    Wu, Min
    Xiang, Shili
    Jin, Ruibing
    Zhang, Le
    Li, Xiaoli
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 15 - 20
  • [44] ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning
    Wang, Yufei
    Narasimhan, Gautham Narayan
    Lin, Xingyu
    Okorn, Brian
    Held, David
    CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 1030 - 1048
  • [45] Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
    Gao Z.
    Xu K.
    Zhai Y.
    Ding B.
    Feng D.
    Mao X.
    Wang H.
    IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 10
  • [46] Self-Supervised Representations for Multi-View Reinforcement Learning
    Yang, Huanhuan
    Shi, Dianxi
    Xie, Guojun
    Peng, Yingxuan
    Zhang, Yi
    Yang, Yantai
    Yang, Shaowu
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2203 - 2213
  • [47] Self-Supervised Reinforcement Learning that Transfers using Random Features
    Chen, Boyuan
    Zhu, Chuning
    Agrawal, Pulkit
    Zhang, Kaiqing
    Gupta, Abhishek
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Self-Supervised Reinforcement Learning for Proactive Prediction of Passive Intermodulation
    Banerjee, Serene
    Uppuluri, Pratyush Kiran
    Sharma, Rahul N.
    Bandyopadhyay, Subhadip
    2023 15TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS, COMSNETS, 2023,
  • [49] Missing nodes detection on graphs with self-supervised contrastive learning
    Liu, Chen
    Cao, Tingting
    Zhou, Lixin
    Shao, Ying
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
  • [50] Deep self-supervised transformation learning for leukocyte classification
    Chen, Xinwei
    Zheng, Guolin
    Zhou, Liwei
    Li, Zuoyong
    Fan, Haoyi
    JOURNAL OF BIOPHOTONICS, 2023, 16 (03)