Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation

被引：0

作者：

Kahn, Gregory ^{[1
]}

Villaflor, Adam ^{[1
]}

Ding, Bosen ^{[1
]}

Abbeel, Pieter ^{[1
]}

Levine, Sergey ^{[1
]}

机构：

[1] Univ Calif Berkeley, BAIR, Berkeley, CA 94720 USA

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Enabling robots to autonomously navigate complex environments is essential for real-world deployment. Prior methods approach this problem by having the robot maintain an internal map of the world, and then use a localization and planning method to navigate through the internal map. However, these approaches often include a variety of assumptions, are computationally intensive, and do not learn from failures. In contrast, learning-based methods improve as the robot acts in the environment, but are difficult to deploy in the real-world due to their high sample complexity. To address the need to learn complex policies with few samples, we propose a generalized computation graph that subsumes value-based model-free methods and model-based methods, with specific instantiations interpolating between model-free and model-based. We then instantiate this graph to form a navigation model that learns from raw images and is sample efficient. Our simulated car experiments explore the design decisions of our navigation model, and show our approach outperforms single-step and N-step double Q-learning. We also evaluate our approach on a real-world RC car and show it can learn to navigate through a complex indoor environment with a few hours of fully autonomous, self-supervised training. Videos of the experiments and code can be found at github.com/gkahn13/gcg

引用

页码：5129 / 5136

页数：8

共 50 条

[31] On Exploiting Haptic Cues for Self-Supervised Learning of Depth-Based Robot Navigation Affordances [J].

José Baleia ;

Pedro Santana ;

José Barata .

Journal of Intelligent & Robotic Systems, 2015, 80 :455-474

[32] Deep active sampling with self-supervised learning [J].

SHI Haochen ;

ZHOU Hui .

Frontiers of Computer Science, 2023, 17 (04)

[33] Deep Metric Learning with Self-Supervised Ranking [J].

Fu, Zheren ;

Li, Yan ;

Mao, Zhendong ;

Wang, Quan ;

Zhang, Yongdong .

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 :1370-1378

[34] On Exploiting Haptic Cues for Self-Supervised Learning of Depth-Based Robot Navigation Affordances [J].

Baleia, Jose ;

Santana, Pedro ;

Barata, Jose .

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2015, 80 (3-4) :455-474

[35] Reverse Optical Flow for Self-Supervised Adaptive Autonomous Robot Navigation [J].

A. Lookingbill ;

J. Rogers ;

D. Lieb ;

J. Curry ;

S. Thrun .

International Journal of Computer Vision, 2007, 74 :287-302

[36] Reverse optical flow for self-supervised adaptive autonomous robot navigation [J].

Lookingbill, A. ;

Rogers, J. ;

Lieb, D. ;

Curry, J. ;

Thrun, S. .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 74 (03) :287-302

[37] Growing Robot Navigation Based on Deep Reinforcement Learning [J].

Ataka, Ahmad ;

Sandiwan, Andreas P. .

2023 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ROBOTICS, ICCAR, 2023, :115-120

[38] Deep Reinforcement Learning for Mapless Robot Navigation Systems [J].

Oliveira, Iure Rosa L. ;

Brandao, Alexandre S. .

2023 LATIN AMERICAN ROBOTICS SYMPOSIUM, LARS, 2023 BRAZILIAN SYMPOSIUM ON ROBOTICS, SBR, AND 2023 WORKSHOP ON ROBOTICS IN EDUCATION, WRE, 2023, :331-336

[39] Quantum Deep Reinforcement Learning for Robot Navigation Tasks [J].

Hohenfeld, Hans ;

Heimann, Dirk ;

Wiebe, Felix ;

Kirchner, Frank .

IEEE ACCESS, 2024, 12 :87217-87236

[40] Mobile Robot Navigation Using Deep Reinforcement Learning [J].

Lee, Min-Fan Ricky ;

Yusuf, Sharfiden Hassen .

PROCESSES, 2022, 10 (12)

← 1 2 3 4 5 →