Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引:0
作者
Kahn, Gregory [1 ]
Villaflor, Adam [1 ]
Ding, Bosen [1 ]
Abbeel, Pieter [1 ]
Levine, Sergey [1 ]
机构
[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA
来源
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg
引用
收藏
页码:5129 / 5136
页数:8
相关论文
共 50 条
[21]   Self-supervised Reinforcement Learning with Independently Controllable Subgoals [J].
Zadaianchuk, Andrii ;
Martius, Georg ;
Yang, Fanny .
CONFERENCE ON ROBOT LEARNING, VOL 164, 2021, 164 :384-394
[22]   Computation of transcranial magnetic stimulation electric fields using self-supervised deep learning [J].
Li, Hongming ;
Deng, Zhi-De ;
Oathes, Desmond ;
Fan, Yong .
NEUROIMAGE, 2022, 264
[23]   Self-Supervised Discovering of Interpretable Features for Reinforcement Learning [J].
Shi, Wenjie ;
Huang, Gao ;
Song, Shiji ;
Wang, Zhuoyuan ;
Lin, Tingyu ;
Wu, Cheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) :2712-2724
[24]   Knowledge Graph Reasoning With Self-Supervised Reinforcement Learning [J].
Ma, Ying ;
Burns, Owen ;
Wang, Mingqiu ;
Li, Gang ;
Du, Nan ;
El Shafey, Laurent ;
Wang, Liqiang ;
Shafran, Izhak ;
Soltau, Hagen .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2025, 33 :1508-1519
[25]   Self-Supervised Reinforcement Learning for Active Object Detection [J].
Fang, Fen ;
Liang, Wenyu ;
Wu, Yan ;
Xu, Qianli ;
Lim, Joo-Hwee .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :10224-10231
[26]   Deep active sampling with self-supervised learning [J].
Shi, Haochen ;
Zhou, Hui .
FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)
[27]   Self-Supervised Attention-Aware Reinforcement Learning [J].
Wu, Haiping ;
Khetarpa, Khimya ;
Precup, Doina .
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 :10311-10319
[28]   OAGknow: Self-Supervised Learning for Linking Knowledge Graphs [J].
Liu, Xiao ;
Mian, Li ;
Dong, Yuxiao ;
Zhang, Fanjin ;
Zhang, Jing ;
Tang, Jie ;
Zhang, Peng ;
Gong, Jibing ;
Wang, Kuansan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) :1895-1908
[29]   Self-Supervised Learning on Graphs: Contrastive, Generative, or Predictive [J].
Wu, Lirong ;
Lin, Haitao ;
Tan, Cheng ;
Gao, Zhangyang ;
Li, Stan Z. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) :4216-4235
[30]   Augmentation-Free Self-Supervised Learning on Graphs [J].
Lee, Namkyeong ;
Lee, Junseok ;
Park, Chanyoung .
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, :7372-7380