Towards Deep Learning-based 6D Bin Pose Estimation in 3D Scans

被引:1
作者
Gajdosech, Lukas [1 ,2 ]
Kocur, Viktor [2 ,4 ]
Stuchlik, Martin [1 ]
Hudec, Lukas [3 ]
Madaras, Martin [1 ,2 ]
机构
[1] Skeletex Res, Karlova Ves, Slovakia
[2] Comenius Univ, Fac Math Phys & Informat, Bratislava, Slovakia
[3] Slovak Tech Univ Bratislava, Fac Informat & Informat Technol, Bratislava, Slovakia
[4] Brno Univ Technol, Fac Informat Technol, Brno, Czech Republic
来源
PROCEEDINGS OF THE 17TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS (VISAPP), VOL 4 | 2022年
关键词
Computer Vision; Bin Pose Estimation; 6D Pose Estimation; Deep Learning; Point Clouds;
D O I
10.5220/0010878200003124
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An automated robotic system needs to be as robust as possible and fail-safe in general while having relatively high precision and repeatability. Although deep learning-based methods are becoming research standard on how to approach 3D scan and image processing tasks, the industry standard for processing this data is still analytically-based. Our paper claims that analytical methods are less robust and harder for testing, updating, and maintaining. This paper focuses on a specific task of 6D pose estimation of a bin in 3D scans. Therefore, we present a high-quality dataset composed of synthetic data and real scans captured by a structured-light scanner with precise annotations. Additionally, we propose two different methods for 6D bin pose estimation, an analytical method as the industrial standard and a baseline data-driven method. Both approaches are cross-evaluated, and our experiments show that augmenting the training on real scans with synthetic data improves our proposed data-driven neural model. This position paper is preliminary, as proposed methods are trained and evaluated on a relatively small initial dataset which we plan to extend in the future.
引用
收藏
页码:545 / 552
页数:8
相关论文
共 25 条
  • [1] A METHOD FOR REGISTRATION OF 3-D SHAPES
    BESL, PJ
    MCKAY, ND
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) : 239 - 256
  • [2] Bukschat Y., 2020, EFFICIENTPOSE EFFICI
  • [3] Model Globally, Match Locally: Efficient and Robust 3D Object Recognition
    Drost, Bertram
    Ulrich, Markus
    Navab, Nassir
    Ilic, Slobodan
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 998 - 1005
  • [4] Efficient Center Voting for Object Detection and 6D Pose Estimation in 3D Point Cloud
    Guo, Jianwei
    Xing, Xuejun
    Quan, Weize
    Yan, Dong-Ming
    Gu, Qingyi
    Liu, Yang
    Zhang, Xiaopeng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5072 - 5084
  • [5] Pose estimation and adaptable grasp configuration with point cloud registration and geometry understanding for fruit grasp planning
    Guo, Ning
    Zhang, Baohua
    Zhou, Jun
    Zhan, Ketian
    Lai, Shuang
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 179
  • [6] Deep Residual Learning for Image Recognition
    He, Kaiming
    Zhang, Xiangyu
    Ren, Shaoqing
    Sun, Jian
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 770 - 778
  • [7] He Y., 2020, Pvn3d: A deep point-wise 3d keypoints voting network for 6dof pose estimation
  • [8] Hinterstoisser S, 2012, P AS C COMP VIS, P548, DOI DOI 10.1007/978-3-642-37331-2_42
  • [9] Hodan Tomas, 2020, Computer Vision - ECCV 2020 Workshops. Proceedings. Lecture Notes in Computer Science (LNCS 12536), P577, DOI 10.1007/978-3-030-66096-3_39
  • [10] Hodan T, 2020, PROC CVPR IEEE, P11700, DOI 10.1109/CVPR42600.2020.01172