Fast and automatic object pose estimation for range images on the GPU

被引:25
作者
Park, In Kyu [1 ]
Germann, Marcel [2 ]
Breitenstein, Michael D. [3 ]
Pfister, Hanspeter [4 ]
机构
[1] Inha Univ, Sch Informat & Commun Engn, Inchon 402751, South Korea
[2] ETH, Swiss Fed Inst Technol, Comp Graph Lab, CH-8092 Zurich, Switzerland
[3] ETH, Swiss Fed Inst Technol, Comp Vis Lab, CH-8092 Zurich, Switzerland
[4] Harvard Univ, Sch Engn & Appl Sci, Cambridge, MA 02138 USA
关键词
Object pose estimation; Bin picking; Range image processing; General purpose GPU programming; Iterative closest point; Euclidean distance transform; Downhill simplex; CUDA; RECOGNITION; REGISTRATION; MODEL; SEGMENTATION;
D O I
10.1007/s00138-009-0209-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a pose estimation method for rigid objects from single range images. Using 3D models of the objects, many pose hypotheses are compared in a data-parallel version of the downhill simplex algorithm with an image-based error function. The pose hypothesis with the lowest error value yields the pose estimation (location and orientation), which is refined using ICP. The algorithm is designed especially for implementation on the GPU. It is completely automatic, fast, robust to occlusion and cluttered scenes, and scales with the number of different object types. We apply the system to bin picking, and evaluate it on cluttered scenes. Comprehensive experiments on challenging synthetic and real-world data demonstrate the effectiveness of our method.
引用
收藏
页码:749 / 766
页数:18
相关论文
共 34 条
[1]   RANGE IMAGE SEGMENTATION WITH APPLICATIONS TO ROBOT BIN-PICKING USING VACUUM GRIPPER [J].
ALHUJAZI, E ;
SOOD, A .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1990, 20 (06) :1313-1325
[2]  
[Anonymous], COMP UN DEV ARCH CUD
[3]  
[Anonymous], 2003, TR200396 MITS EL RES
[4]   Properties of plant plasma membrane lipid models - bilayer permeability and monolayer behaviour of glucosylceramide and phosphatidic acid in phospholipid mixtures [J].
Berglund, AH ;
Norberg, P ;
Quartacci, MF ;
Nilsson, R ;
Liljenberg, C .
PHYSIOLOGIA PLANTARUM, 2000, 109 (02) :117-122
[5]   A METHOD FOR REGISTRATION OF 3-D SHAPES [J].
BESL, PJ ;
MCKAY, ND .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1992, 14 (02) :239-256
[6]  
Blanz Volker., 1999, P 26 ANN C COMPUTER, P187, DOI DOI 10.1145/311535.311556
[7]   Laser ranging and video imaging for bin picking [J].
Boughorbel, F ;
Zhang, Y ;
Kang, SK ;
Chidambaram, U ;
Abidi, B ;
Koschan, A ;
Abidi, M .
ASSEMBLY AUTOMATION, 2003, 23 (01) :53-59
[8]  
Breitenstein MD, 2008, PROC CVPR IEEE, P3613
[9]   OBJECT MODELING BY REGISTRATION OF MULTIPLE RANGE IMAGES [J].
CHEN, Y ;
MEDIONI, G .
IMAGE AND VISION COMPUTING, 1992, 10 (03) :145-155
[10]   COSMOS - A representation scheme for 3D free-form objects [J].
Dorai, C ;
Jain, AK .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (10) :1115-1130