Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引：0

作者：

Kahn, Gregory ^{[1
]}

Villaflor, Adam ^{[1
]}

Ding, Bosen ^{[1
]}

Abbeel, Pieter ^{[1
]}

Levine, Sergey ^{[1
]}

机构：

[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg

引用

页码：5129 / 5136

页数：8

共 50 条

[41] Adaptive-Masking Policy with Deep Reinforcement Learning for Self-Supervised Medical Image Segmentation
Xu, Gang
Wang, Shengxin
Lukasiewicz, Thomas
Xu, Zhenghua
2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2285 - 2290
[42] Self-Supervised Imitation for Offline Reinforcement Learning With Hindsight Relabeling
Yu, Xudong
Bai, Chenjia
Wang, Changhong
Yu, Dengxiu
Chen, C. L. Philip
Wang, Zhen
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (12): : 7732 - 7743
[43] Multi-task Self-Supervised Adaptation for Reinforcement Learning
Wu, Keyu
Chen, Zhenghua
Wu, Min
Xiang, Shili
Jin, Ruibing
Zhang, Le
Li, Xiaoli
2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 15 - 20
[44] ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning
Wang, Yufei
Narasimhan, Gautham Narayan
Lin, Xingyu
Okorn, Brian
Held, David
CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 1030 - 1048
[45] Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Gao Z.
Xu K.
Zhai Y.
Ding B.
Feng D.
Mao X.
Wang H.
IEEE Transactions on Artificial Intelligence, 2024, 5 (11): : 1 - 10
[46] Self-Supervised Representations for Multi-View Reinforcement Learning
Yang, Huanhuan
Shi, Dianxi
Xie, Guojun
Peng, Yingxuan
Zhang, Yi
Yang, Yantai
Yang, Shaowu
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 2203 - 2213
[47] Self-Supervised Reinforcement Learning that Transfers using Random Features
Chen, Boyuan
Zhu, Chuning
Agrawal, Pulkit
Zhang, Kaiqing
Gupta, Abhishek
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[48] Self-Supervised Reinforcement Learning for Proactive Prediction of Passive Intermodulation
Banerjee, Serene
Uppuluri, Pratyush Kiran
Sharma, Rahul N.
Bandyopadhyay, Subhadip
2023 15TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS, COMSNETS, 2023,
[49] Missing nodes detection on graphs with self-supervised contrastive learning
Liu, Chen
Cao, Tingting
Zhou, Lixin
Shao, Ying
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 132
[50] Deep self-supervised transformation learning for leukocyte classification
Chen, Xinwei
Zheng, Guolin
Zhou, Liwei
Li, Zuoyong
Fan, Haoyi
JOURNAL OF BIOPHOTONICS, 2023, 16 (03)

← 1 2 3 4 5 →