Category-Level 6D Object Pose Estimation With Structure Encoder and Reasoning Attention

被引:10
|
作者
Liu, Jierui [1 ,2 ]
Cao, Zhiqiang [1 ,2 ]
Tang, Yingbo [1 ,2 ]
Liu, Xilong [1 ,2 ]
Tan, Min [1 ,2 ]
机构
[1] Chinese Acad Sci, State Key Lab Management & Control Complex Syst, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Shape; Three-dimensional displays; Cognition; Pose estimation; Feature extraction; Decoding; Solid modeling; Category-level; 6D object pose estimation; structure encoder; reasoning attention;
D O I
10.1109/TCSVT.2022.3169144
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Category-level 6D object pose estimation has gained popularity and it is still challenging due to the diversity of different instances within the same category. In this paper, a novel category-level 6D object pose estimation framework with structure encoder and reasoning attention is proposed. A structure autoencoder is introduced to mine the shared structure features in the color images within the same category, via a distinct learning strategy that recovers the image of another instance but with the most similar pose to the input. On this basis, a reasoning attention decoder and full connected layers are stacked to form a rotation prediction network, where the structure features and 3D shape features are integrated and projected to a semantic space. The semantic space includes observed patterns and learnable patterns, which are better learned by adding a shortcut connection branch parallel to reasoning attention decoder with gradient decouple. Further reasoning based on these patterns endows the decoder with powerful feature representation. Without 3D object models, the proposed method models the attributes of category implicitly in the semantic space and better performance of 6D object pose estimation is guaranteed by reasoning on this space. The effectiveness of the proposed method is verified by the results on public datasets and actual experiments.
引用
收藏
页码:6728 / 6740
页数:13
相关论文
共 50 条
  • [1] An efficient network for category-level 6D object pose estimation
    Sun, Shantong
    Liu, Rongke
    Sun, Shuqiao
    Yang, Xinxin
    Lu, Guangshan
    SIGNAL IMAGE AND VIDEO PROCESSING, 2021, 15 (07) : 1643 - 1651
  • [2] CatFormer: Category-Level 6D Object Pose Estimation with Transformer
    Yu, Sheng
    Zhai, Di-Hua
    Xia, Yuanqing
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 6808 - 6816
  • [3] RANSAC Optimization for Category-level 6D Object Pose Estimation
    Chen, Ying
    Kang, Guixia
    Wang, Yiping
    2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, : 50 - 56
  • [4] An efficient network for category-level 6D object pose estimation
    Shantong Sun
    Rongke Liu
    Shuqiao Sun
    Xinxin Yang
    Guangshan Lu
    Signal, Image and Video Processing, 2021, 15 : 1643 - 1651
  • [5] Normalized Object Coordinate Space for Category-Level 6D Object Pose and Size Estimation
    Wang, He
    Sridhar, Srinath
    Huang, Jingwei
    Valentin, Julien
    Song, Shuran
    Guibas, Leonidas J.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 2637 - 2646
  • [6] SGPA: Structure-Guided Prior Adaptation for Category-Level 6D Object Pose Estimation
    Chen, Kai
    Dou, Qi
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 2753 - 2762
  • [7] Attention-guided RGB-D Fusion Network for Category-level 6D Object Pose Estimation
    Wang, Hao
    Li, Weiming
    Kim, Jiyeon
    Wang, Qiang
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10651 - 10658
  • [8] Category-Level 6D Object Pose Recovery in Depth Images
    Sahin, Caner
    Kim, Tae-Kyun
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT I, 2019, 11129 : 665 - 681
  • [9] SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose Estimation
    Li, Guowei
    Zhu, Dongchen
    Zhang, Guanghui
    Shi, Wenjun
    Zhang, Tianyu
    Zhang, Xiaolin
    Li, Jiamao
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5674 - 5683
  • [10] GSNet: Model Reconstruction Network for Category-level 6D Object Pose and Size Estimation
    Liu, Penglei
    Zhang, Qieshi
    Cheng, Jun
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 2898 - 2904