CCAN: Constraint Co-Attention Network for Instance Grasping

被引:0
|
作者
Cai, Junhao [1 ]
Tao, Xuefeng [1 ]
Cheng, Hui [1 ]
Zhang, Zhanpeng [2 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
[2] Sensetime Grp Ltd, Shenzhen, Peoples R China
关键词
D O I
10.1109/icra40945.2020.9197182
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Instance grasping is a challenging robotic grasping task when a robot aims to grasp a specified target object in cluttered scenes. In this paper, we propose a novel end-to-end instance grasping method using only monocular workspace and query images, where the workspace image includes several objects and the query image only contains the target object. To effectively extract discriminative features and facilitate the training process, a learning-based method, referred to as Constraint Co-Attention Network (CCAN), is proposed which consists of a constraint co-attention module and a grasp affordance predictor. An effective co-attention module is presented to construct the features of a workspace image from the extracted features of the query image. By introducing soft constraints into the co-attention module, it highlights the target object's features while trivializes other objects' features in the workspace image. Using the features extracted from the co-attention module, the cascaded grasp affordance interpreter network only predicts the grasp configuration for the target object. The training of the CCAN is totally based on simulated self-supervision. Extensive qualitative and quantitative experiments show the effectiveness of our method both in simulated and real-world environments even for totally unseen objects.
引用
收藏
页码:8353 / 8359
页数:7
相关论文
共 50 条
  • [31] Progressive Co-Attention Network for Fine-Grained Visual Classification
    Zhang, Tian
    Chang, Dongliang
    Ma, Zhanyu
    Guo, Jun
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [32] Multimodal Fusion with Co-attention Mechanism
    Li, Pei
    Li, Xinde
    PROCEEDINGS OF 2020 23RD INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2020), 2020, : 607 - 614
  • [33] Co-Attention for Conditioned Image Matching
    Wiles, Olivia
    Ehrhardt, Sebastien
    Zisserman, Andrew
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 15915 - 15924
  • [34] Multimodal Emotion Recognition Using a Modified Dense Co-Attention Symmetric Network
    Zhao, Zhi-Wei
    Liu, Wei
    Lu, Bao-Liang
    2021 10TH INTERNATIONAL IEEE/EMBS CONFERENCE ON NEURAL ENGINEERING (NER), 2021, : 73 - 76
  • [35] Multi-Modal Co-Attention Capsule Network for Fake News Detection
    Yin, Chunyan
    Chen, Yongheng
    OPTICAL MEMORY AND NEURAL NETWORKS, 2024, 33 (01) : 13 - 27
  • [36] Co-attention Based Feature Fusion Network for Spam Review Detection on Douban
    Cai, Huanyu
    Yu, Ke
    Zhou, Yuhao
    Wu, Xiaofei
    NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5251 - 5271
  • [37] Deep Interleaved Network for Image Super-Resolution With Asymmetric Co-Attention
    Li, Feng
    Cong, Runming
    Bai, Huihui
    He, Yifan
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 537 - 543
  • [38] Context-aware and co-attention network based image captioning model
    Sharma, Himanshu
    Srivastava, Swati
    IMAGING SCIENCE JOURNAL, 2023, 71 (03): : 244 - 256
  • [39] Spatio-temporal co-attention fusion network for video splicing localization
    Lin, Man
    Cao, Gang
    Lou, Zijie
    Zhang, Chi
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (03) : 33027
  • [40] Multi-Modal Co-Attention Capsule Network for Fake News Detection
    Optical Memory and Neural Networks, 2024, 33 : 13 - 27