Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

被引:0
|
作者
Rahmatizadeh, Rouhollah [1 ]
Abolghasemi, Pooya [1 ]
Boloni, Ladislau [1 ]
Levine, Sergey [2 ]
机构
[1] Univ Cent Florida, Orlando, FL 32816 USA
[2] Univ Calif Berkeley, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
TASK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a technique for multi-task learning from demonstration that trains the controller of a low-cost robotic arm to accomplish several complex picking and placing tasks, as well as non-prehensile manipulation. The controller is a recurrent neural network using raw images as input and generating robot arm trajectories, with the parameters shared across the tasks. The controller also combines VAE-GAN-based reconstruction with autoregressive multimodal action prediction. Our results demonstrate that it is possible to learn complex manipulation tasks, such as picking up a towel, wiping an object, and depositing the towel to its previous position, entirely from raw images with direct behavior cloning. We show that weight sharing and reconstruction-based regularization substantially improve generalization and robustness, and training on multiple tasks simultaneously increases the success rate on all tasks.
引用
收藏
页码:3758 / 3765
页数:8
相关论文
共 50 条
  • [21] An Interactive Multi-Task Learning Network for End-to-End Aspect-Based Sentiment Analysis
    He, Ruidan
    Lee, Wee Sun
    Ng, Hwee Tou
    Dahlmeier, Daniel
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 504 - 515
  • [22] A multi-task learning framework for end-to-end aspect sentiment triplet extraction
    Chen, Fang
    Yang, Zhongliang
    Huang, Yongfeng
    NEUROCOMPUTING, 2022, 479 : 12 - 21
  • [23] An end-to-end multi-task deep learning framework for bronchoscopy image classification
    Setayeshi, Rojin
    Vahidi, Javad
    Kozegar, Ehsan
    Tan, Tao
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [24] Multi-Task Neural Learning Architecture for End-to-End Identification of Helpful Reviews
    Fan, Miao
    Feng, Yue
    Sun, Mingming
    Li, Ping
    Wang, Haifeng
    Wang, Jianmin
    2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), 2018, : 343 - 350
  • [25] Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis
    Bensoltane, Rajae
    Zaki, Taher
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [26] An End-to-End Multi-Task Deep Learning Framework for Skin Lesion Analysis
    Song, Lei
    Lin, Jianzhe
    Wang, Z. Jane
    Wang, Haoqian
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (10) : 2912 - 2921
  • [27] End-to-End Multi-task Learning for Allusion Detection in Ancient Chinese Poems
    Liu, Lei
    Chen, Xiaoyang
    He, Ben
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT II, 2020, 12275 : 300 - 311
  • [28] End-to-end Argument Mining with Cross-corpora Multi-task Learning
    Morio, Gaku
    Ozaki, Hiroaki
    Morishita, Terufumi
    Yanai, Kohsuke
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 639 - 658
  • [29] ATTENTION-AUGMENTED END-TO-END MULTI-TASK LEARNING FOR EMOTION PREDICTION FROM SPEECH
    Zhang, Zixing
    Wu, Bingwen
    Schuller, Bjoern
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6705 - 6709
  • [30] End-to-end multi-task optimization model for task-based dialogue systems
    Zhao F.
    Qiu M.
    Li X.
    Sun Y.
    Yang Z.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (11): : 3592 - 3599