Learning-Based Point Cloud Registration for 6D Object Pose Estimation in the Real World

被引:9
作者
Dang, Zheng [1 ]
Wang, Lizhou [2 ]
Guo, Yu [2 ]
Salzmann, Mathieu [1 ,3 ]
机构
[1] Ecole Polytech Fed Lausanne, CVLab, Lausanne, Switzerland
[2] Xi An Jiao Tong Univ, Xian, Shaanxi, Peoples R China
[3] Clearspace, Renens, Switzerland
来源
COMPUTER VISION - ECCV 2022, PT I | 2022年 / 13661卷
关键词
6D object pose estimation; Point cloud registration;
D O I
10.1007/978-3-031-19769-7_2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we tackle the task of estimating the 6D pose of an object from point cloud data. While recent learning-based approaches to addressing this task have shown great success on synthetic datasets, we have observed them to fail in the presence of real-world data. We thus analyze the causes of these failures, which we trace back to the difference between the feature distributions of the source and target point clouds, and the sensitivity of the widely-used SVD-based loss function to the range of rotation between the two point clouds. We address the first challenge by introducing a new normalization strategy, Match Normalization, and the second via the use of a loss function based on the negative log likelihood of point correspondences. Our two contributions are general and can be applied to many existing learning-based 3D object registration frameworks, which we illustrate by implementing them in two of them, DCP and IDAM. Our experiments on the real-scene TUD-L [26], LINEMOD [23] and Occluded-LINEMOD [T] datasets evidence the benefits of our strategies. They allow for the first time learning-based 3D object registration methods to achieve meaningful results on real-world data. We therefore expect them to be key to the future development of point cloud registration methods.
引用
收藏
页码:19 / 37
页数:19
相关论文
共 76 条
[41]   SUPER 4PCS Fast Global Pointcloud Registration via Smart Indexing [J].
Mellado, Nicolas ;
Aiger, Dror ;
Mitra, Niloy J. .
COMPUTER GRAPHICS FORUM, 2014, 33 (05) :205-215
[42]   Super Generalized 4PCS for 3D Registration [J].
Mohamad, Mustafa ;
Ahmed, Mirza Tahir ;
Rappaport, David ;
Greenspan, Michael .
2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, :598-606
[43]   Pix2Pose: Pixel-Wise Coordinate Regression of Objects for 6D Pose Estimation [J].
Park, Kiru ;
Patten, Timothy ;
Vincze, Markus .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :7667-7676
[44]  
Paszke Adam, 2017, P INT C NEURAL INFOR
[45]   PVNet: Pixel-wise Voting Network for 6DoF Pose Estimation [J].
Peng, Sida ;
Liu, Yuan ;
Huang, Qixing ;
Zhou, Xiaowei ;
Bao, Hujun .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4556-4565
[46]  
Qi CR, 2017, ADV NEUR IN, V30
[47]   PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation [J].
Qi, Charles R. ;
Su, Hao ;
Mo, Kaichun ;
Guibas, Leonidas J. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :77-85
[48]   BB8: A Scalable, Accurate, Robust to Partial Occlusion Method for Predicting the 3D Poses of Challenging Objects without Using Depth [J].
Rad, Mahdi ;
Lepetit, Vincent .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :3848-3856
[49]  
Raposo Carolina, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P5652, DOI 10.1109/ICRA.2017.7989664
[50]   SE-Sync: A certifiably correct algorithm for synchronization over the special Euclidean group [J].
Rosen, David M. ;
Carlone, Luca ;
Bandeira, Afonso S. ;
Leonard, John J. .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2019, 38 (2-3) :95-125