Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引：0

作者：

Kahn, Gregory ^{[1
]}

Villaflor, Adam ^{[1
]}

Ding, Bosen ^{[1
]}

Abbeel, Pieter ^{[1
]}

Levine, Sergey ^{[1
]}

机构：

[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg

引用

页码：5129 / 5136

页数：8

共 50 条

[21] Self-supervised Reinforcement Learning with Independently Controllable Subgoals [J].

Zadaianchuk, Andrii ;

Martius, Georg ;

Yang, Fanny .

CONFERENCE ON ROBOT LEARNING, VOL 164, 2021, 164 :384-394

[22] Computation of transcranial magnetic stimulation electric fields using self-supervised deep learning [J].

Li, Hongming ;

Deng, Zhi-De ;

Oathes, Desmond ;

Fan, Yong .

NEUROIMAGE, 2022, 264

[23] Self-Supervised Discovering of Interpretable Features for Reinforcement Learning [J].

Shi, Wenjie ;

Huang, Gao ;

Song, Shiji ;

Wang, Zhuoyuan ;

Lin, Tingyu ;

Wu, Cheng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (05) :2712-2724

[24] Knowledge Graph Reasoning With Self-Supervised Reinforcement Learning [J].

Ma, Ying ;

Burns, Owen ;

Wang, Mingqiu ;

Li, Gang ;

Du, Nan ;

El Shafey, Laurent ;

Wang, Liqiang ;

Shafran, Izhak ;

Soltau, Hagen .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2025, 33 :1508-1519

[25] Self-Supervised Reinforcement Learning for Active Object Detection [J].

Fang, Fen ;

Liang, Wenyu ;

Wu, Yan ;

Xu, Qianli ;

Lim, Joo-Hwee .

IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (04) :10224-10231

[26] Deep active sampling with self-supervised learning [J].

Shi, Haochen ;

Zhou, Hui .

FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (04)

[27] Self-Supervised Attention-Aware Reinforcement Learning [J].

Wu, Haiping ;

Khetarpa, Khimya ;

Precup, Doina .

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 :10311-10319

[28] OAGknow: Self-Supervised Learning for Linking Knowledge Graphs [J].

Liu, Xiao ;

Mian, Li ;

Dong, Yuxiao ;

Zhang, Fanjin ;

Zhang, Jing ;

Tang, Jie ;

Zhang, Peng ;

Gong, Jibing ;

Wang, Kuansan .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) :1895-1908

[29] Self-Supervised Learning on Graphs: Contrastive, Generative, or Predictive [J].

Wu, Lirong ;

Lin, Haitao ;

Tan, Cheng ;

Gao, Zhangyang ;

Li, Stan Z. .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (04) :4216-4235

[30] Augmentation-Free Self-Supervised Learning on Graphs [J].

Lee, Namkyeong ;

Lee, Junseok ;

Park, Chanyoung .

THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, :7372-7380

← 1 2 3 4 5 →