Attention Based Visual Analysis for Fast Grasp Planning With a Multi-Fingered Robotic Hand

被引:9
作者
Deng, Zhen [1 ]
Gao, Ge [1 ]
Frintrop, Simone [1 ]
Sun, Fuchun [2 ]
Zhang, Changshui [2 ]
Zhang, Jianwei [1 ]
机构
[1] Univ Hamburg, Dept Informat, Hamburg, Germany
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
grasp planning; grasp type; visual attention; deep learning; multi-fingered robotic hand; OBJECT; SYSTEM; MODEL;
D O I
10.3389/fnbot.2019.00060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present an attention based visual analysis framework to compute grasp-relevant information which helps to guide grasp planning using a multi-fingered robotic hand. Our approach uses a computational visual attention model to locate regions of interest in a scene and employ a deep convolutional neural network to detect grasp type and grasp attention point for a sub-region of the object in a region of interest. We demonstrate the proposed framework with object grasping tasks, in which the information generated from the proposed framework is used as prior information to guide grasp planning. The effectiveness of the proposed approach is evaluated in both simulation experiments and real-world experiments. Experimental results show that the proposed framework can not only speed up grasp planning with more stable configurations, but also handle unknown objects. Furthermore, our framework can handle cluttered scenarios. A new Grasp Type Dataset (GTD) which includes six commonly used grasp types and covers 12 household objects is also presented.
引用
收藏
页数:12
相关论文
共 40 条
[1]   A 3D shape segmentation approach for robot grasping by parts [J].
Aleotti, Jacopo ;
Caselli, Stefano .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2012, 60 (03) :358-366
[2]  
[Anonymous], INT C LEARN REPR SAN
[3]  
[Anonymous], 2015, ICLR
[4]   Top-down versus bottom-up attentional control: a failed theoretical dichotomy [J].
Awh, Edward ;
Belopolsky, Artem V. ;
Theeuwes, Jan .
TRENDS IN COGNITIVE SCIENCES, 2012, 16 (08) :437-443
[5]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[6]   The Yale human grasping dataset: Grasp, object, and task data in household and machine shop environments [J].
Bullock, Ian M. ;
Feix, Thomas ;
Dollar, Aaron M. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2015, 34 (03) :251-255
[7]   An Ego-Vision System for Hand Grasp Analysis [J].
Cai, Minjie ;
Kitani, Kris M. ;
Sato, Yoichi .
IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2017, 47 (04) :524-535
[8]  
Cai MJ, 2015, IEEE INT CONF ROBOT, P1360, DOI 10.1109/ICRA.2015.7139367
[9]   Benchmarking in Manipulation Research Using the Yale-CMU-Berkeley Object and Model Set [J].
Calli, Berk ;
Walsman, Aaron ;
Singh, Arjun ;
Srinivasa, Siddhartha ;
Abbeel, Pieter ;
Dollar, Aaron M. .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2015, 22 (03) :36-52
[10]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848