Self-Supervised Correspondence in Visuomotor Policy Learning

被引:89
作者
Florence, Peter [1 ]
Manuelli, Lucas [1 ]
Tedrake, Russ [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Deep learning in robotics and automation; perception for grasping and manipulation; visual learning; MANIPULATION;
D O I
10.1109/LRA.2019.2956365
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this letter, we explore using self-supervised correspondence for improving the generalization performance and sample efficiency of visuomotor policy learning. Prior work has primarily used approaches such as autoencoding, pose-based losses, and end-to-end policy optimization in order to train the visual portion of visuomotor policies. We instead propose an approach using self-supervised dense visual correspondence training and show that this enables visuomotor policy learning with surprisingly high generalization performance with modest amounts of data. Using imitation learning, we demonstrate extensive hardware validation on challenging manipulation tasks with as few as 50 demonstrations. Our learned policies can generalize across classes of objects, react to deformable object configurations, and manipulate textureless symmetrical objects in a variety of backgrounds, all with closed-loop, real-time vision-based policies. Simulated imitation learning experiments suggest that correspondence training offers sample complexity and generalization benefits compared to autoencoding and end-to-end training.
引用
收藏
页码:492 / 499
页数:8
相关论文
共 41 条
[31]  
Ross S., 2011, P 14 INT C ARTIFICIA, P627
[32]   Self-Supervised Visual Descriptor Learning for Dense Correspondence [J].
Schmidt, Tanner ;
Newcombe, Richard ;
Fox, Dieter .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02) :420-427
[33]  
Sermanet P, 2018, IEEE INT CONF ROBOT, P1134
[34]   GPLAC: Generalizing Vision-Based Robotic Skills using Weakly Labeled Images [J].
Singh, Avi ;
Yang, Larry ;
Levine, Sergey .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5852-5861
[35]  
Tedrake Russ, 2019, Drake: Model-based design and verification for robotics
[36]  
van Hoof H, 2016, 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), P3928, DOI 10.1109/IROS.2016.7759578
[37]  
Yahya A, 2017, IEEE INT C INT ROBOT, P79, DOI 10.1109/IROS.2017.8202141
[38]   Repeatable Folding Task by Humanoid Robot Worker Using Deep Learning [J].
Yang, Pin-Chu ;
Sasaki, Kazuma ;
Suzuki, Kanata ;
Kase, Kei ;
Sugano, Shigeki ;
Ogata, Tetsuya .
IEEE ROBOTICS AND AUTOMATION LETTERS, 2017, 2 (02) :397-403
[39]  
Yu T, 2019, DESTECH TRANS SOC
[40]  
Yu Yu T. T., ROBOTICS SCI SYSTEMS, DOI [DOI 10.15607/RSS.2018.XIV.002, 10.15607/RSS.2018.XIV.002]