Speedup 3-D Texture-Less Object Recognition Against Self-Occlusion for Intelligent Manufacturing

被引：40

作者：

Cong, Yang ^{[1
]}

Tian, Dongying ^{[1
]}

Feng, Yun ^{[1
]}

Fan, Baojie ^{[2
]}

Yu, Haibin ^{[1
]}

机构：

[1] Chinese Acad Sci, Shenyang Inst Automat, State Key Lab Robot, Shenyang 110016, Liaoning, Peoples R China

[2] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210042, Jiangsu, Peoples R China

来源：

IEEE TRANSACTIONS ON CYBERNETICS | 2019年 / 49卷 / 11期

关键词：

Hough voting; hypothesis generation; k-d tree; local reference frame (LRF); object recognition; pose estimation; 3D; EFFICIENT; FEATURES; IMAGES; MODEL;

D O I：

10.1109/TCYB.2018.2851666

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Realtime 3-D object detection and 6-DOF pose estimation in clutter background is crucial for intelligent manufacturing, for example, robot feeding and assembly, where robustness and efficiency are the two most desirable goals. Especially for various metal parts with a textless surface, it is hard for most state of the arts to extract robust feature from the clutter background with various occlusions. To overcome this, in this paper, we propose an online 3-D object detection and pose estimation method to overcome self-occlusion for textureless objects. For feature representation, we only adopt the raw 3-D point clouds with normal cues to define our local reference frame and we automatically learn the compact 3-D feature from the simple local normal statistics via autoencoder. For a similarity search, a new basis buffer k-d tree method is designed without suffering branch divergence; therefore, ours can maximize the GPU parallel processing capabilities especially in practice. We then generate the hypothesis candidates via the hough voting, filter the false hypotheses, and refine the pose estimation via the iterative closest point strategy. For the experiments, we build a new 3-D dataset including industrial objects with heavy self-occlusions and conduct various comparisons with the state of the arts to justify the effectiveness and efficiency of our method.

引用

页码：3887 / 3897

页数：11

共 47 条

[1] A Novel Multi-Purpose Matching Representation of Local 3D Surfaces: A Rotationally Invariant, Efficient, and Highly Discriminative Approach With an Adjustable Sensitivity [J].

Al-Osaimi, Faisal R. .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (02) :658-672

[2]

Aldoma A, 2013, IEEE INT CONF ROBOT, P2104, DOI 10.1109/ICRA.2013.6630859

[3]

Aldoma A, 2012, LECT NOTES COMPUT SC, V7574, P511, DOI 10.1007/978-3-642-33712-3_37

[4]

[Anonymous], BMVC

[5] Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models [J].

Aubry, Mathieu ;

Maturana, Daniel ;

Efros, Alexei A. ;

Russell, Bryan C. ;

Sivic, Josef .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3762-3769

[6] 3D Geometric Scale Variability in Range Images: Features and Descriptors [J].

Bariya, Prabin ;

Novatnack, John ;

Schwartz, Gabriel ;

Nishino, Ko .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 99 (02) :232-255

[7] A METHOD FOR REGISTRATION OF 3-D SHAPES [J].

BESL, PJ ;

MCKAY, ND .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) :239-256

[8]

Brachmann E, 2014, LECT NOTES COMPUT SC, V8690, P536, DOI 10.1007/978-3-319-10605-2_35

[9] In Search of Inliers: 3D Correspondence by Local and Global Voting [J].

Buch, Anders Glent ;

Yang, Yang ;

Kruger, Norbert ;

Petersen, Henrik Gordon .

2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :2075-2082

[10] Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection [J].

Cong, Yang ;

Yuan, Junsong ;

Luo, Jiebo .

IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (01) :66-75

← 1 2 3 4 5 →