Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

被引:0
|
作者
Rahmatizadeh, Rouhollah [1 ]
Abolghasemi, Pooya [1 ]
Boloni, Ladislau [1 ]
Levine, Sergey [2 ]
机构
[1] Univ Cent Florida, Orlando, FL 32816 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
TASK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a technique for multi-task learning from demonstration that trains the controller of a low-cost robotic arm to accomplish several complex picking and placing tasks, as well as non-prehensile manipulation. The controller is a recurrent neural network using raw images as input and generating robot arm trajectories, with the parameters shared across the tasks. The controller also combines VAE-GAN-based reconstruction with autoregressive multimodal action prediction. Our results demonstrate that it is possible to learn complex manipulation tasks, such as picking up a towel, wiping an object, and depositing the towel to its previous position, entirely from raw images with direct behavior cloning. We show that weight sharing and reconstruction-based regularization substantially improve generalization and robustness, and training on multiple tasks simultaneously increases the success rate on all tasks.
引用
收藏
页码:3758 / 3765
页数:8
相关论文
共 50 条
  • [41] Visually Guided Picking Control of an Omnidirectional Mobile Manipulator Based on End-to-End Multi-Task Imitation Learning
    Tsai, Chi-Yi
    Chou, Yung-Shan
    Wong, Ching-Chang
    Lai, Yu-Cheng
    Huang, Chien-Che
    IEEE ACCESS, 2020, 8 : 1882 - 1891
  • [42] Multi-task and multi-view training for end-to-end relation extraction
    Zhang, Junchi
    Zhang, Yue
    Ji, Donghong
    Liu, Mengchi
    NEUROCOMPUTING, 2019, 364 : 245 - 253
  • [43] ASR Posterior-based Loss for Multi-task End-to-end Speech Translation
    Ko, Yuka
    Sudoh, Katsuhito
    Sakti, Sakriani
    Nakamura, Satoshi
    INTERSPEECH 2021, 2021, : 2272 - 2276
  • [44] An End-to-End Multi-Task and Fusion CNN for Inertial-Based Gait Recognition
    Delgado-Escano, Ruben
    Castro, Francisco M.
    Cozar, Julian Ramos
    Marin-Jimenez, Manuel J.
    Guil, Nicolas
    IEEE ACCESS, 2019, 7 : 1897 - 1908
  • [45] End-to-end Multi-task Learning Framework for Spatio-Temporal Grounding in Video Corpus
    Gao, Yingqi
    Luo, Zhiling
    Chen, Shiqian
    Zhou, Wei
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 3958 - 3962
  • [46] An effective multi-task learning model for end-to-end emotion-cause pair extraction
    Chenbing Li
    Jie Hu
    Tianrui Li
    Shengdong Du
    Fei Teng
    Applied Intelligence, 2023, 53 : 3519 - 3529
  • [47] An end-to-end multi-task learning to link framework for emotion-cause pair extraction
    Song, Haolin
    Song, Dawei
    2021 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE, 2021, 12076
  • [48] An effective multi-task learning model for end-to-end emotion-cause pair extraction
    Li, Chenbing
    Hu, Jie
    Li, Tianrui
    Du, Shengdong
    Teng, Fei
    APPLIED INTELLIGENCE, 2023, 53 (03) : 3519 - 3529
  • [49] End-to-end Delay Analysis for Coded Computing with Multi-Task Arrival
    Ji, Zhongming
    Chen, Li
    Chen, Xiaohui
    Wei, Guo
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 398 - 403
  • [50] SDAPNet: End-to-End Multi-task Simultaneous Detection and Prediction Network
    Ye, Shanding
    Yao, Han
    Wang, Wenfu
    Fu, Yongjian
    Pan, Zhijie
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,