Towards Real-time Object Recognition and Pose Estimation in Point Clouds

被引:1
作者
Marcon, Marlon [1 ]
Pereira Bellon, Olga Regina [2 ]
Silva, Luciano [2 ]
机构
[1] Univ Tecnol Fed Parana, Dapartment Software Engn, Dois Vizinhos, Brazil
[2] Univ Fed Parana, Dept Comp Sci, Curitiba, Parana, Brazil
来源
VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP | 2021年
关键词
Transfer Learning; 3D Computer Vision; Feature-based Registration; ICP Dense Registration; RGB-D Images; HISTOGRAMS;
D O I
10.5220/0010265601640174
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Object recognition and 6DoF pose estimation are quite challenging tasks in computer vision applications. Despite efficiency in such tasks, standard methods deliver far from real-time processing rates. This paper presents a novel pipeline to estimate a fine 6DoF pose of objects, applied to realistic scenarios in real-time. We split our proposal into three main parts. Firstly, a Color feature classification leverages the use of pre-trained CNN color features trained on the ImageNet for object detection. A Feature-based registration module conducts a coarse pose estimation, and finally, a Fine-adjustment step performs an ICP-based dense registration. Our proposal achieves, in the best case, an accuracy performance of almost 83% on the RGB-D Scenes dataset. Regarding processing time, the object detection task is done at a frame processing rate up to 90 FPS, and the pose estimation at almost 14 FPS in a full execution strategy. We discuss that due to the proposal's modularity, we could let the full execution occurs only when necessary and perform a scheduled execution that unlocks real-time processing, even for multitask situations.
引用
收藏
页码:164 / 174
页数:11
相关论文
共 37 条
[1]  
Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22
[2]   Tutorial Point Cloud Library Three-Dimensional Object Recognition and 6 DOF Pose Estimation [J].
Aldoma, Aitor ;
Marton, Zoltan-Csaba ;
Tombari, Federico ;
Wohlkinger, Walter ;
Potthast, Christian ;
Zeisl, Bernhard ;
Rusu, Radu Bogdan ;
Gedikli, Suat ;
Vincze, Markus .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2012, 19 (03) :80-91
[3]   A Multi-Modal, Discriminative and Spatially Invariant CNN for RGB-D Object Labeling [J].
Asif, Umar ;
Bennamoun, Mohammed ;
Sohel, Ferdous A. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (09) :2051-2065
[4]  
BESL PJ, 1992, SENSOR FUSION 4 CONT, P586, DOI DOI 10.1117/12.57955
[5]  
Bo L., 2013, EXPT ROBOTICS, P387, DOI DOI 10.1007/978-3-319-00065-727
[6]  
Caglayan A., 2020, ARXIV PREPRINT ARXIV
[7]   OBJECT MODELING BY REGISTRATION OF MULTIPLE RANGE IMAGES [J].
CHEN, Y ;
MEDIONI, G .
IMAGE AND VISION COMPUTING, 1992, 10 (03) :145-155
[8]  
Choi S, 2015, PROC CVPR IEEE, P5556, DOI 10.1109/CVPR.2015.7299195
[9]   Deep Global Registration [J].
Choy, Christopher ;
Dong, Wei ;
Koltun, Vladlen .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2511-2520
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848