Visual Reinforcement Learning With Self-Supervised 3D Representations

被引：10

作者：

Ze, Yanjie ^{[1
,2
]}

Hansen, Nicklas ^{[2
]}

Chen, Yinbo ^{[2
]}

Jain, Mohit ^{[2
]}

Wang, Xiaolong ^{[2
]}

机构：

[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China

[2] Univ Calif San Diego, San Diego, CA 92093 USA

来源：

IEEE ROBOTICS AND AUTOMATION LETTERS | 2023年 / 8卷 / 05期

关键词：

Three-dimensional displays; Task analysis; Visualization; Cameras; Representation learning; Training; Robot vision systems; Reinforcement learning; representation learning; deep learning for visual perception;

D O I：

10.1109/LRA.2023.3259681

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

A prominent approach to visual Reinforcement Learning (RL) is to learn an internal state representation using self-supervised methods, which has the potential benefit of improved sample-efficiency and generalization through additional learning signal and inductive biases. However, while the real world is inherently 3D, prior efforts have largely been focused on leveraging 2D computer vision techniques as auxiliary self-supervision. In this work, we present a unified framework for self-supervised learning of 3D representations for motor control. Our proposed framework consists of two phases: a pretraining phase where a deep voxel-based 3D autoencoder is pretrained on a large object-centric dataset, and a finetuning phase where the representation is jointly finetuned together with RL on in-domain data. We empirically show that our method enjoys improved sample efficiency compared to 2D representation learning methods. Additionally, our learned policies transfer zero-shot to a real robot setup with only approximate geometric correspondence, and successfully solve motor control tasks that involve grasping and lifting from a single, uncalibrated RGB camera.

引用

页码：2890 / 2897

页数：8

共 44 条

[1] Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras
Akinola, Iretiayo
Varley, Jacob
Kalashnikov, Dmitry
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4616 - 4622
[2] Learning dexterous in-hand manipulation
Andrychowicz, Marcin
Baker, Bowen
Chociej, Maciek
Jozefowicz, Rafal
McGrew, Bob
Pachocki, Jakub
Petron, Arthur
Plappert, Matthias
Powell, Glenn
Ray, Alex
Schneider, Jonas
Sidor, Szymon
Tobin, Josh
Welinder, Peter
Weng, Lilian
Zaremba, Wojciech
[J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) : 3 - 20
[3] Chen Boyuan, 2021, P MACHINE LEARNING R, V139
[4] Chen Wang, 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA), P10059, DOI 10.1109/ICRA40945.2020.9196679
[5] Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
[6] Cheng Ricson, 2018, C ROBOT LEARNING, V87, P422
[7] Distance modulation of neural activity in the visual cortex
Dobbins, AC
Jeo, RM
Fiser, J
Allman, JM
[J]. SCIENCE, 1998, 281 (5376) : 552 - 555
[8] Driess D, 2022, Arxiv, DOI arXiv:2206.01634
[9] Auto-Tuned Sim-to-Real Transfer
Du, Yuqing
Watkins, Olivia
Darrell, Trevor
Abbeel, Pieter
Pathak, Deepak
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1290 - 1296
[10] Tung HYF, 2020, Arxiv, DOI arXiv:2011.06464

← 1 2 3 4 5 →