Towards Real-time Object Recognition and Pose Estimation in Point Clouds

被引：1

作者：

Marcon, Marlon ^{[1
]}

Pereira Bellon, Olga Regina ^{[2
]}

Silva, Luciano ^{[2
]}

机构：

[1] Univ Tecnol Fed Parana, Dapartment Software Engn, Dois Vizinhos, Brazil

[2] Univ Fed Parana, Dept Comp Sci, Curitiba, Parana, Brazil

来源：

VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 5: VISAPP | 2021年

关键词：

Transfer Learning; 3D Computer Vision; Feature-based Registration; ICP Dense Registration; RGB-D Images; HISTOGRAMS;

D O I：

10.5220/0010265601640174

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Object recognition and 6DoF pose estimation are quite challenging tasks in computer vision applications. Despite efficiency in such tasks, standard methods deliver far from real-time processing rates. This paper presents a novel pipeline to estimate a fine 6DoF pose of objects, applied to realistic scenarios in real-time. We split our proposal into three main parts. Firstly, a Color feature classification leverages the use of pre-trained CNN color features trained on the ImageNet for object detection. A Feature-based registration module conducts a coarse pose estimation, and finally, a Fine-adjustment step performs an ICP-based dense registration. Our proposal achieves, in the best case, an accuracy performance of almost 83% on the RGB-D Scenes dataset. Regarding processing time, the object detection task is done at a frame processing rate up to 90 FPS, and the pose estimation at almost 14 FPS in a full execution strategy. We discuss that due to the proposal's modularity, we could let the full execution occurs only when necessary and perform a scheduled execution that unlocks real-time processing, even for multitask situations.

引用

页码：164 / 174

页数：11

共 37 条

[1]

Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22

[2] Tutorial Point Cloud Library Three-Dimensional Object Recognition and 6 DOF Pose Estimation [J].

Aldoma, Aitor ;

Marton, Zoltan-Csaba ;

Tombari, Federico ;

Wohlkinger, Walter ;

Potthast, Christian ;

Zeisl, Bernhard ;

Rusu, Radu Bogdan ;

Gedikli, Suat ;

Vincze, Markus .

IEEE ROBOTICS & AUTOMATION MAGAZINE, 2012, 19 (03) :80-91

[3] A Multi-Modal, Discriminative and Spatially Invariant CNN for RGB-D Object Labeling [J].

Asif, Umar ;

Bennamoun, Mohammed ;

Sohel, Ferdous A. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (09) :2051-2065

[4]

BESL PJ, 1992, SENSOR FUSION 4 CONT, P586, DOI DOI 10.1117/12.57955

[5]

Bo L., 2013, EXPT ROBOTICS, P387, DOI DOI 10.1007/978-3-319-00065-727

[6]

Caglayan A., 2020, ARXIV PREPRINT ARXIV

[7] OBJECT MODELING BY REGISTRATION OF MULTIPLE RANGE IMAGES [J].

CHEN, Y ;

MEDIONI, G .

IMAGE AND VISION COMPUTING, 1992, 10 (03) :145-155

[8]

Choi S, 2015, PROC CVPR IEEE, P5556, DOI 10.1109/CVPR.2015.7299195

[9] Deep Global Registration [J].

Choy, Christopher ;

Dong, Wei ;

Koltun, Vladlen .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :2511-2520

[10]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 →