Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

被引:0
|
作者
Rahmatizadeh, Rouhollah [1 ]
Abolghasemi, Pooya [1 ]
Boloni, Ladislau [1 ]
Levine, Sergey [2 ]
机构
[1] Univ Cent Florida, Orlando, FL 32816 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
TASK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a technique for multi-task learning from demonstration that trains the controller of a low-cost robotic arm to accomplish several complex picking and placing tasks, as well as non-prehensile manipulation. The controller is a recurrent neural network using raw images as input and generating robot arm trajectories, with the parameters shared across the tasks. The controller also combines VAE-GAN-based reconstruction with autoregressive multimodal action prediction. Our results demonstrate that it is possible to learn complex manipulation tasks, such as picking up a towel, wiping an object, and depositing the towel to its previous position, entirely from raw images with direct behavior cloning. We show that weight sharing and reconstruction-based regularization substantially improve generalization and robustness, and training on multiple tasks simultaneously increases the success rate on all tasks.
引用
收藏
页码:3758 / 3765
页数:8
相关论文
共 50 条
  • [31] A Time-domain End-to-End Method for Sound Source Localization Using Multi-Task Learning
    Huang, Yankun
    Wu, Xihong
    Qu, Tianshu
    2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 52 - 56
  • [32] A Vision-Based End-to-End Reinforcement Learning Framework for Drone Target Tracking
    Zhao, Xun
    Huang, Xinjian
    Cheng, Jianheng
    Xia, Zhendong
    Tu, Zhiheng
    DRONES, 2024, 8 (11)
  • [33] VGAI: END-TO-END LEARNING OF VISION-BASED DECENTRALIZED CONTROLLERS FOR ROBOT SWARMS
    Hu, Ting-Kuei
    Gama, Fernando
    Chen, Tianlong
    Wang, Zhangyang
    Ribeiro, Alejandro
    Sadler, Brian M.
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4900 - 4904
  • [34] End-to-End Multi-task Learning Regression Network for Fovea Localization in Fundus Images
    Huang, Limin
    Lei, Haijun
    Liu, Weixin
    Li, Zhen
    Xie, Hai
    Lei, Baiying
    2022 IEEE 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2022, : 389 - 393
  • [35] Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
    Qiu, David
    He, Yanzhang
    Li, Qiujia
    Zhang, Yu
    Gao, Liangliang
    McGraw, Ian
    INTERSPEECH 2021, 2021, : 4074 - 4078
  • [36] SPEECH ENHANCEMENT AIDED END-TO-END MULTI-TASK LEARNING FOR VOICE ACTIVITY DETECTION
    Tan, Xu
    Zhang, Xiao-Lei
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6823 - 6827
  • [37] End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs
    Kano, Takatomo
    Sakti, Sakriani
    Nakamura, Satoshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1342 - 1355
  • [38] End-to-end Japanese Multi-dialect Speech Recognition and Dialect Identification with Multi-task Learning
    Imaizumi, Ryo
    Masumura, Ryo
    Shiota, Sayaka
    Kiya, Hitoshi
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
  • [39] Age-Invariant Training for End-to-End Child Speech Recognition using Adversarial Multi-Task Learning
    Rumberg, Lars
    Ehlert, Hanna
    Luedtke, Ulrike
    Ostermann, Joern
    INTERSPEECH 2021, 2021, : 3850 - 3854
  • [40] An End-to-end Multi-task Object Detection using Embedded GPU in Autonomous Driving
    Zhou, Shanglin
    Xie, Mimi
    Jin, Yufang
    Miao, Fei
    Ding, Caiwen
    PROCEEDINGS OF THE 2021 TWENTY SECOND INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2021), 2021, : 122 - 128