Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset

被引:0
作者
Zheng, Tianyu [1 ]
Zhang, Chunyan [1 ]
Zhang, Shengwen [1 ]
Wang, Yanyan [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Mech Engineer, Zhenjiang 212100, Peoples R China
关键词
6-DoF object pose estimation; synthetic dataset; deep learning; bilateral filtering; CBAM-CDAE;
D O I
10.3390/s23249854
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Due to the difficulty in generating a 6-Degree-of-Freedom (6-DoF) object pose estimation dataset, and the existence of domain gaps between synthetic and real data, existing pose estimation methods face challenges in improving accuracy and generalization. This paper proposes a methodology that employs higher quality datasets and deep learning-based methods to reduce the problem of domain gaps between synthetic and real data and enhance the accuracy of pose estimation. The high-quality dataset is obtained from Blenderproc and it is innovatively processed using bilateral filtering to reduce the gap. A novel attention-based mask region-based convolutional neural network (R-CNN) is proposed to reduce the computation cost and improve the model detection accuracy. Meanwhile, an improved feature pyramidal network (iFPN) is achieved by adding a layer of bottom-up paths to extract the internalization of features of the underlying layer. Consequently, a novel convolutional block attention module-convolutional denoising autoencoder (CBAM-CDAE) network is proposed by presenting channel attention and spatial attention mechanisms to improve the ability of AE to extract images' features. Finally, an accurate 6-DoF object pose is obtained through pose refinement. The proposed approach is compared to other models using the T-LESS and LineMOD datasets. Comparison results demonstrate the proposed approach outperforms the other estimation models.
引用
收藏
页数:24
相关论文
共 44 条
  • [1] YOLACT Real-time Instance Segmentation
    Bolya, Daniel
    Zhou, Chong
    Xiao, Fanyi
    Lee, Yong Jae
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9156 - 9165
  • [2] 6IMPOSE: bridging the reality gap in 6D pose estimation for robotic grasping
    Cao, Hongpeng
    Dirnberger, Lukas
    Bernardini, Daniele
    Piazza, Cristina
    Caccamo, Marco
    [J]. FRONTIERS IN ROBOTICS AND AI, 2023, 10
  • [3] TensorMask: A Foundation for Dense Object Segmentation
    Chen, Xinlei
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2061 - 2069
  • [4] Marker-Less 3d Object Recognition and 6d Pose Estimation for Homogeneous Textureless Objects: An RGB-D Approach
    Hajari, Nasim
    Bustillo, Gabriel Lugo
    Sharma, Harsh
    Cheng, Irene
    [J]. SENSORS, 2020, 20 (18) : 1 - 22
  • [5] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
  • [6] He Y, 2020, IEEE C COMP VIS PATT, P11629, DOI [10.1109/CVPR42600.2020.01165, DOI 10.1109/CVPR42600.2020.01165]
  • [7] Hinterstoisser S, 2012, LECT NOTES COMPUT SC, V7585, P593, DOI 10.1007/978-3-642-33885-4_60
  • [8] Hodan T, 2019, IEEE IMAGE PROC, P66, DOI [10.1109/ICIP.2019.8803821, 10.1109/icip.2019.8803821]
  • [9] BOP: Benchmark for 6D Object Pose Estimation
    Hodan, Tomas
    Michel, Frank
    Brachmann, Eric
    Kehl, Wadim
    Buch, Anders Glent
    Kraft, Dirk
    Drost, Bertram
    Vidal, Joel
    Ihrke, Stephan
    Zabulis, Xenophon
    Sahin, Caner
    Manhardt, Fabian
    Tombari, Federico
    Kim, Tae-Kyun
    Matas, Jiri
    Rother, Carsten
    [J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
  • [10] T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects
    Hodan, Tomas
    Haluza, Pavel
    Obdrzalek, Stepan
    Matas, Jiri
    Lourakis, Manolis
    Zabulis, Xenophon
    [J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 880 - 888