Deep Learning-Based 6-DoF Object Pose Estimation Considering Synthetic Dataset

被引：0

作者：

Zheng, Tianyu ^{[1
]}

Zhang, Chunyan ^{[1
]}

Zhang, Shengwen ^{[1
]}

Wang, Yanyan ^{[1
]}

机构：

[1] Jiangsu Univ Sci & Technol, Sch Mech Engineer, Zhenjiang 212100, Peoples R China

来源：

SENSORS | 2023年 / 23卷 / 24期

关键词：

6-DoF object pose estimation; synthetic dataset; deep learning; bilateral filtering; CBAM-CDAE;

D O I：

10.3390/s23249854

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Due to the difficulty in generating a 6-Degree-of-Freedom (6-DoF) object pose estimation dataset, and the existence of domain gaps between synthetic and real data, existing pose estimation methods face challenges in improving accuracy and generalization. This paper proposes a methodology that employs higher quality datasets and deep learning-based methods to reduce the problem of domain gaps between synthetic and real data and enhance the accuracy of pose estimation. The high-quality dataset is obtained from Blenderproc and it is innovatively processed using bilateral filtering to reduce the gap. A novel attention-based mask region-based convolutional neural network (R-CNN) is proposed to reduce the computation cost and improve the model detection accuracy. Meanwhile, an improved feature pyramidal network (iFPN) is achieved by adding a layer of bottom-up paths to extract the internalization of features of the underlying layer. Consequently, a novel convolutional block attention module-convolutional denoising autoencoder (CBAM-CDAE) network is proposed by presenting channel attention and spatial attention mechanisms to improve the ability of AE to extract images' features. Finally, an accurate 6-DoF object pose is obtained through pose refinement. The proposed approach is compared to other models using the T-LESS and LineMOD datasets. Comparison results demonstrate the proposed approach outperforms the other estimation models.

引用

页数：24

共 44 条

[1] YOLACT Real-time Instance Segmentation
Bolya, Daniel
Zhou, Chong
Xiao, Fanyi
Lee, Yong Jae
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9156 - 9165
[2] 6IMPOSE: bridging the reality gap in 6D pose estimation for robotic grasping
Cao, Hongpeng
Dirnberger, Lukas
Bernardini, Daniele
Piazza, Cristina
Caccamo, Marco
[J]. FRONTIERS IN ROBOTICS AND AI, 2023, 10
[3] TensorMask: A Foundation for Dense Object Segmentation
Chen, Xinlei
Girshick, Ross
He, Kaiming
Dollar, Piotr
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2061 - 2069
[4] Marker-Less 3d Object Recognition and 6d Pose Estimation for Homogeneous Textureless Objects: An RGB-D Approach
Hajari, Nasim
Bustillo, Gabriel Lugo
Sharma, Harsh
Cheng, Irene
[J]. SENSORS, 2020, 20 (18) : 1 - 22
[5] He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
[6] He Y, 2020, IEEE C COMP VIS PATT, P11629, DOI [10.1109/CVPR42600.2020.01165, DOI 10.1109/CVPR42600.2020.01165]
[7] Hinterstoisser S, 2012, LECT NOTES COMPUT SC, V7585, P593, DOI 10.1007/978-3-642-33885-4_60
[8] Hodan T, 2019, IEEE IMAGE PROC, P66, DOI [10.1109/ICIP.2019.8803821, 10.1109/icip.2019.8803821]
[9] BOP: Benchmark for 6D Object Pose Estimation
Hodan, Tomas
Michel, Frank
Brachmann, Eric
Kehl, Wadim
Buch, Anders Glent
Kraft, Dirk
Drost, Bertram
Vidal, Joel
Ihrke, Stephan
Zabulis, Xenophon
Sahin, Caner
Manhardt, Fabian
Tombari, Federico
Kim, Tae-Kyun
Matas, Jiri
Rother, Carsten
[J]. COMPUTER VISION - ECCV 2018, PT X, 2018, 11214 : 19 - 35
[10] T-LESS: An RGB-D Dataset for 6D Pose Estimation of Texture-less Objects
Hodan, Tomas
Haluza, Pavel
Obdrzalek, Stepan
Matas, Jiri
Lourakis, Manolis
Zabulis, Xenophon
[J]. 2017 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2017), 2017, : 880 - 888

← 1 2 3 4 5 →