Visual Reinforcement Learning With Self-Supervised 3D Representations

被引:10
作者
Ze, Yanjie [1 ,2 ]
Hansen, Nicklas [2 ]
Chen, Yinbo [2 ]
Jain, Mohit [2 ]
Wang, Xiaolong [2 ]
机构
[1] Shanghai Jiao Tong Univ, Shanghai 200240, Peoples R China
[2] Univ Calif San Diego, San Diego, CA 92093 USA
关键词
Three-dimensional displays; Task analysis; Visualization; Cameras; Representation learning; Training; Robot vision systems; Reinforcement learning; representation learning; deep learning for visual perception;
D O I
10.1109/LRA.2023.3259681
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
A prominent approach to visual Reinforcement Learning (RL) is to learn an internal state representation using self-supervised methods, which has the potential benefit of improved sample-efficiency and generalization through additional learning signal and inductive biases. However, while the real world is inherently 3D, prior efforts have largely been focused on leveraging 2D computer vision techniques as auxiliary self-supervision. In this work, we present a unified framework for self-supervised learning of 3D representations for motor control. Our proposed framework consists of two phases: a pretraining phase where a deep voxel-based 3D autoencoder is pretrained on a large object-centric dataset, and a finetuning phase where the representation is jointly finetuned together with RL on in-domain data. We empirically show that our method enjoys improved sample efficiency compared to 2D representation learning methods. Additionally, our learned policies transfer zero-shot to a real robot setup with only approximate geometric correspondence, and successfully solve motor control tasks that involve grasping and lifting from a single, uncalibrated RGB camera.
引用
收藏
页码:2890 / 2897
页数:8
相关论文
共 44 条
  • [1] Learning Precise 3D Manipulation from Multiple Uncalibrated Cameras
    Akinola, Iretiayo
    Varley, Jacob
    Kalashnikov, Dmitry
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4616 - 4622
  • [2] Learning dexterous in-hand manipulation
    Andrychowicz, Marcin
    Baker, Bowen
    Chociej, Maciek
    Jozefowicz, Rafal
    McGrew, Bob
    Pachocki, Jakub
    Petron, Arthur
    Plappert, Matthias
    Powell, Glenn
    Ray, Alex
    Schneider, Jonas
    Sidor, Szymon
    Tobin, Josh
    Welinder, Peter
    Weng, Lilian
    Zaremba, Wojciech
    [J]. INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2020, 39 (01) : 3 - 20
  • [3] Chen Boyuan, 2021, P MACHINE LEARNING R, V139
  • [4] Chen Wang, 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA), P10059, DOI 10.1109/ICRA40945.2020.9196679
  • [5] Chen XL, 2020, Arxiv, DOI arXiv:2003.04297
  • [6] Cheng Ricson, 2018, C ROBOT LEARNING, V87, P422
  • [7] Distance modulation of neural activity in the visual cortex
    Dobbins, AC
    Jeo, RM
    Fiser, J
    Allman, JM
    [J]. SCIENCE, 1998, 281 (5376) : 552 - 555
  • [8] Driess D, 2022, Arxiv, DOI arXiv:2206.01634
  • [9] Auto-Tuned Sim-to-Real Transfer
    Du, Yuqing
    Watkins, Olivia
    Darrell, Trevor
    Abbeel, Pieter
    Pathak, Deepak
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 1290 - 1296
  • [10] Tung HYF, 2020, Arxiv, DOI arXiv:2011.06464