MORE FOR LESS: INSIGHTS INTO CONVOLUTIONAL NETS FOR 3D POINT CLOUD RECOGNITION

被引:0
作者
Shafiq, Usama [1 ]
Taj, Murtaza [1 ]
Ali, Mohsen [2 ]
机构
[1] LUMS Syed Babar Ali Sch Sci & Engn, Lahore, Pakistan
[2] Informat Technol Univ, Lahore, Pakistan
来源
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2017年
关键词
Point cloud; 3DOR; recognition; deep learning;
D O I
暂无
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
With the recent breakthrough in commodity 3D imaging solutions such as depth sensing, photogrammetry, stereoscopic vision and structured light, 3D shape recognition is becoming an increasingly important problem. A longstanding question is what should be the format of the 3D shape (such as voxel, mesh, point-cloud etc.) and what could be a good generic feature representation for shape recognition. This question is particularly important in the context of convolutional neural network (CNN) whose efficacy and complexity depends upon the choice of input shape format and the design of network. It has been seen that both 3D voxel representation as well as collection of rendered views on 2D images have produced competing results. Similarly, it have been seen that networks with few million parameters and networks with several hundred million parameters have similar performance. In this work we compare these solutions and provide an analysis on the factors resulting in increase in the parameters without significantly improving accuracy. On the basis of the above analysis we propose a representation method (point cloud to 2D grid) and architecture that results in much less parameters for the CNN but has competing accuracy.
引用
收藏
页码:1607 / 1611
页数:5
相关论文
共 11 条
[1]  
Alvar N.S., 2016, CoRR, Vabs/1604.03351
[2]  
[Anonymous], 2016, CORR
[3]  
[Anonymous], 2016, NEURAL INFORM PROCES
[4]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7298801
[5]  
[Anonymous], 2016, CORR
[6]   GIFT: A Real-time and Scalable 3D Shape Search Engine [J].
Bai, Song ;
Bai, Xiang ;
Zhou, Zhichao ;
Zhang, Zhaoxiang ;
Latecki, Longin Jan .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :5023-5032
[7]   Pairwise Decomposition of Image Sequences for Active Multi-View Recognition [J].
Johns, Edward ;
Leutenegger, Stefan ;
Davison, Andrew J. .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3813-3822
[8]  
Maturana D, 2015, IEEE INT C INT ROBOT, P922, DOI 10.1109/IROS.2015.7353481
[9]   DeepPano: Deep Panoramic Representation for 3-D Shape Recognition [J].
Shi, Baoguang ;
Bai, Song ;
Zhou, Zhichao ;
Bai, Xiang .
IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (12) :2339-2343
[10]   Multi-view Convolutional Neural Networks for 3D Shape Recognition [J].
Su, Hang ;
Maji, Subhransu ;
Kalogerakis, Evangelos ;
Learned-Miller, Erik .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :945-953