Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans

被引：1

作者：

Gajdosech, Lukas ^{[1
,2
]}

Kocur, Viktor ^{[2
,4
]}

Stuchlik, Martin ^{[1
]}

Hudec, Lukas ^{[3
]}

Madaras, Martin ^{[1
,2
]}

机构：

[1] Skeletex Res, Karlova Ves, Slovakia

[2] Comenius Univ, Fac Math Phys & Informat, Bratislava, Slovakia

[3] Slovak Tech Univ Bratislava, Fac Informat & Informat Technol, Bratislava, Slovakia

[4] Brno Univ Technol, Fac Informat Technol, Brno, Czech Republic

来源：

PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4 | 2022年

关键词：

Computer Vision; Bin Pose Estimation; 6D Pose Estimation; Deep Learning; Point Clouds;

D O I：

10.5220/0010878200003124

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

An automated robotic system needs to be as robust as possible and fail-safe in general while having relatively high precision and repeatability. Although deep learning-based methods are becoming research standard on how to approach 3D scan and image processing tasks, the industry standard for processing this data is still analytically-based. Our paper claims that analytical methods are less robust and harder for testing, updating, and maintaining. This paper focuses on a specific task of 6D pose estimation of a bin in 3D scans. Therefore, we present a high-quality dataset composed of synthetic data and real scans captured by a structured-light scanner with precise annotations. Additionally, we propose two different methods for 6D bin pose estimation, an analytical method as the industrial standard and a baseline data-driven method. Both approaches are cross-evaluated, and our experiments show that augmenting the training on real scans with synthetic data improves our proposed data-driven neural model. This position paper is preliminary, as proposed methods are trained and evaluated on a relatively small initial dataset which we plan to extend in the future.

引用

页码：545 / 552

页数：8

共 25 条

[1] A METHOD FOR REGISTRATION OF 3-D SHAPES [J].

BESL, PJ ;

MCKAY, ND .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) :239-256

[2]

Bukschat Yannick, 2020, Efficientpose: An efficient, accurate and scalable end-to-end 6d multi object pose estimation approach

[3] Model Globally, Match Locally: Efficient and Robust 3D Object Recognition [J].

Drost, Bertram ;

Ulrich, Markus ;

Navab, Nassir ;

Ilic, Slobodan .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :998-1005

[4] Efficient Center Voting for Object Detection and 6D Pose Estimation in 3D Point Cloud [J].

Guo, Jianwei ;

Xing, Xuejun ;

Quan, Weize ;

Yan, Dong-Ming ;

Gu, Qingyi ;

Liu, Yang ;

Zhang, Xiaopeng .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :5072-5084

[5] Pose estimation and adaptable grasp configuration with point cloud registration and geometry understanding for fruit grasp planning [J].

Guo, Ning ;

Zhang, Baohua ;

Zhou, Jun ;

Zhan, Ketian ;

Lai, Shuang .

COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 179

[6] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[7]

He Y., 2020, Pvn3d: A deep point-wise 3d keypoints voting network for 6dof pose estimation

[8]

Hinterstoisser S., 2013, ACCV, P548, DOI [DOI 10.1007/978-3-642-37331-242, 10.1007/978-3- 642-37331-2_42, DOI 10.1007/978-3-642-37331-2_42]

[9] BOP Challenge 2020 on 6D Object Localization [J].

Hodan, Tomas ;

Sundermeyer, Martin ;

Drost, Bertram ;

Labbe, Yann ;

Brachmann, Eric ;

Michel, Frank ;

Rother, Carsten ;

Matas, Jiri .

COMPUTER VISION - ECCV 2020 WORKSHOPS, PT II, 2020, 12536 :577-594

[10] EPOS: Estimating 6D Pose of Objects with Symmetries [J].

Hodan, Tomas ;

Barath, Daniel ;

Matas, Jiri .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11700-11709

← 1 2 3 →