3D Point Cloud Object Detection with Multi-View Convolutional Neural Network

被引:0
作者
Pang, Guan [1 ]
Neumann, Ulrich [1 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90089 USA
来源
2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2016年
关键词
RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Efficient detection of three dimensional (3D) objects in point clouds is a challenging problem. Performing 3D descriptor matching or 3D scanning-window search with detector are both time-consuming due to the 3-dimensional complexity. One solution is to project 3D point cloud into 2D images and thus transform the 3D detection problem into 2D space, but projection at multiple viewpoints and rotations produce a large amount of 2D detection tasks, which limit the performance and complexity of the 2D detection algorithm choice. We propose to use convolutional neural network (CNN) for the 2D detection task, because it can handle all viewpoints and rotations for the same class of object together, as well as predicting multiple classes of objects with the same network, without the need for individual detector for each object class. We further improve the detection efficiency by concatenating two extra levels of early rejection networks with binary outputs before the multi-class detection network. Experiments show that our method has competitive overall performance with at least one-order of magnitude speedup comparing with latest 3D point cloud detection methods.
引用
收藏
页码:585 / 590
页数:6
相关论文
共 28 条
[1]  
[Anonymous], INT C 3D VIS 3DV
[2]  
[Anonymous], 2010, P 10 AS C COMP VIS A
[3]  
[Anonymous], 2010, IEEE T NEURAL NETWOR, V21, P858
[4]  
[Anonymous], 2009, ICCV
[5]   Seeing 3D chairs: exemplar part-based 2D-3D alignment using a large dataset of CAD models [J].
Aubry, Mathieu ;
Maturana, Daniel ;
Efros, Alexei A. ;
Russell, Bryan C. ;
Sivic, Josef .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3762-3769
[6]   On visual similarity based 3D model retrieval [J].
Chen, DY ;
Tian, XP ;
Shen, YT ;
Ming, OY .
COMPUTER GRAPHICS FORUM, 2003, 22 (03) :223-232
[7]  
Frome A, 2004, LECT NOTES COMPUT SC, V3023, P224
[8]  
Girshick R., 2014, IEEE C COMP VIS PATT, DOI [DOI 10.1109/CVPR.2014.81, 10.1109/CVPR.2014.81]
[9]   Learning Rich Features from RGB-D Images for Object Detection and Segmentation [J].
Gupta, Saurabh ;
Girshick, Ross ;
Arbelaez, Pablo ;
Malik, Jitendra .
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 :345-360
[10]  
Habermann D., 2013, BRAZ C INT SYST BRAC