Vision-Based Multi-Task Manipulation for Inexpensive Robots Using End-To-End Learning from Demonstration

被引：0

作者：

Rahmatizadeh, Rouhollah ^{[1
]}

Abolghasemi, Pooya ^{[1
]}

Boloni, Ladislau ^{[1
]}

Levine, Sergey ^{[2
]}

机构：

[1] Univ Cent Florida, Orlando, FL 32816 USA

[2] Univ Calif Berkeley, Berkeley, CA 94720 USA

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA) | 2018年

基金：

美国国家科学基金会;

关键词：

TASK;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We propose a technique for multi-task learning from demonstration that trains the controller of a low-cost robotic arm to accomplish several complex picking and placing tasks, as well as non-prehensile manipulation. The controller is a recurrent neural network using raw images as input and generating robot arm trajectories, with the parameters shared across the tasks. The controller also combines VAE-GAN-based reconstruction with autoregressive multimodal action prediction. Our results demonstrate that it is possible to learn complex manipulation tasks, such as picking up a towel, wiping an object, and depositing the towel to its previous position, entirely from raw images with direct behavior cloning. We show that weight sharing and reconstruction-based regularization substantially improve generalization and robustness, and training on multiple tasks simultaneously increases the success rate on all tasks.

引用

页码：3758 / 3765

页数：8

共 50 条

[31] A Time-domain End-to-End Method for Sound Source Localization Using Multi-Task Learning
Huang, Yankun
Wu, Xihong
Qu, Tianshu
2019 2ND IEEE INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP), 2019, : 52 - 56
[32] A Vision-Based End-to-End Reinforcement Learning Framework for Drone Target Tracking
Zhao, Xun
Huang, Xinjian
Cheng, Jianheng
Xia, Zhendong
Tu, Zhiheng
DRONES, 2024, 8 (11)
[33] VGAI: END-TO-END LEARNING OF VISION-BASED DECENTRALIZED CONTROLLERS FOR ROBOT SWARMS
Hu, Ting-Kuei
Gama, Fernando
Chen, Tianlong
Wang, Zhangyang
Ribeiro, Alejandro
Sadler, Brian M.
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4900 - 4904
[34] End-to-End Multi-task Learning Regression Network for Fovea Localization in Fundus Images
Huang, Limin
Lei, Haijun
Liu, Weixin
Li, Zhen
Xie, Hai
Lei, Baiying
2022 IEEE 35TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2022, : 389 - 393
[35] Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction
Qiu, David
He, Yanzhang
Li, Qiujia
Zhang, Yu
Gao, Liangliang
McGraw, Ian
INTERSPEECH 2021, 2021, : 4074 - 4078
[36] SPEECH ENHANCEMENT AIDED END-TO-END MULTI-TASK LEARNING FOR VOICE ACTIVITY DETECTION
Tan, Xu
Zhang, Xiao-Lei
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6823 - 6827
[37] End-to-End Speech Translation With Transcoding by Multi-Task Learning for Distant Language Pairs
Kano, Takatomo
Sakti, Sakriani
Nakamura, Satoshi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1342 - 1355
[38] End-to-end Japanese Multi-dialect Speech Recognition and Dialect Identification with Multi-task Learning
Imaizumi, Ryo
Masumura, Ryo
Shiota, Sayaka
Kiya, Hitoshi
APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
[39] Age-Invariant Training for End-to-End Child Speech Recognition using Adversarial Multi-Task Learning
Rumberg, Lars
Ehlert, Hanna
Luedtke, Ulrike
Ostermann, Joern
INTERSPEECH 2021, 2021, : 3850 - 3854
[40] An End-to-end Multi-task Object Detection using Embedded GPU in Autonomous Driving
Zhou, Shanglin
Xie, Mimi
Jin, Yufang
Miao, Fei
Ding, Caiwen
PROCEEDINGS OF THE 2021 TWENTY SECOND INTERNATIONAL SYMPOSIUM ON QUALITY ELECTRONIC DESIGN (ISQED 2021), 2021, : 122 - 128

← 1 2 3 4 5 →