Tactile-Visual Fusion Based Robotic Grasp Detection Method with a Reproducible Sensor

被引：0

作者：

Song, Yaoxian ^{[1
,2
,3
]}

Luo, Yun ^{[3
]}

Yu, Changbin ^{[4
,5
,6
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

[2] Fudan Univ, Inst Intelligent Robots, Shanghai, Peoples R China

[3] Westlake Univ, Sch Engn, Hangzhou, Peoples R China

[4] Shandong First Med Univ, Coll Artificial Intelligence & Big Data, Tai An, Shandong, Peoples R China

[5] Shandong Acad Med Sci, Tai An, Shandong, Peoples R China

[6] Univ Johannesburg, Fac Engn & Built Environm, Johannesburg, South Africa

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS | 2021年 / 14卷 / 01期

关键词：

Tactile sensor; Tactile-visual dataset; Multi-modal fusion; Deep learning; Grasp detection; VISION; MODEL;

D O I：

10.2991/ijcis.d.210531.001;

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Robotic grasp detection is a fundamental problem in robotic manipulation. The conventional grasp methods, using vision information only, can cause potential damage in force-sensitive tasks. In this paper, we propose a tactile-visual based method using a reproducible sensor to realize a fine-grained and haptic grasping. Although there exist several tactile-based methods, they require expensive custom sensors in coordination with their specific datasets. In order to overcome the limitations, we introduce a low-cost and reproducible tactile fingertip and build a general tactile-visual fusion grasp dataset including 5,110 grasping trials. We further propose a hierarchical encoder-decoder neural network to predict grasp points and force in an end-to-end manner. Then comparisons of our method with the state-of-the-art methods in the benchmark are shown both in vision-based and tactile-visual fusion schemes, and our method outperforms in most scenarios. Furthermore, we also compare our fusion method with the only vision-based method in the physical experiment, and the results indicate that our end-to-end method empowers the robot with a more fine-grained grasp ability, reducing force redundancy by 41%. Our project is available at https://sites.google.com/view/tvgd (c) 2021 The Authors. Published by Atlantis Press B.V.

引用

页码：1753 / 1762

页数：10

共 24 条

[1] Using a Deep Learning Model on Images to Obtain a 2D Laser People Detector for a Mobile Robot [J].