RGB-D OBJECT RECOGNITION WITH MULTIMODAL DEEP CONVOLUTIONAL NEURAL NETWORKS

被引:0
作者
Rahman, Mohammad Muntasir [1 ]
Tan, Yanhao [1 ]
Xue, Jian [1 ]
Lu, Ke [1 ]
机构
[1] Univ Chinese Acad Sci, Sch Engn Sci, Beijing 100049, Peoples R China
来源
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2017年
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Object recognition; RGB-D data; Deep neural networks; Multi-modal feature learning;
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Object recognition from RGB-D images has become a hot topic and gained a significant popularity in recent years due to its numerous applications. In this paper, we propose a novel multimodal deep convolutional neural networks architecture for RGB-D object recognition which composed of three streams with two different types of deep CNNs, where each stream can separately learn from each modality. Finally, we propose a combined architecture of joint network of these three streams to classify the objects. Compared to RGB data, RGB-D images provide additional depth information that can be represented as depth colorization methods or surface normals. Our goal is to exploit both colorization and surface normals information to encode depth images. We show that by utilizing both colorization and surface normals of depth images combined with RGB significantly can improves the classification accuracy. We evaluate our model on one of the most challenging RGB-D object dataset and achieves comparable performance to state-of-the-art methods.
引用
收藏
页码:991 / 996
页数:6
相关论文
共 24 条
[1]  
[Anonymous], 2013, INT C LEARN REPR ICL
[2]  
[Anonymous], CAFFENET
[3]  
[Anonymous], 2012, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2014, P AS C COMP VIS
[5]  
Asif U, 2015, IEEE INT CONF ROBOT, P1295, DOI 10.1109/ICRA.2015.7139358
[6]  
Blum M, 2012, IEEE INT CONF ROBOT, P1298, DOI 10.1109/ICRA.2012.6225188
[7]  
Bo L., 2013, EXPT ROBOTICS, P387, DOI DOI 10.1007/978-3-319-00065-7
[8]  
Bo LF, 2011, IEEE INT C INT ROBOT, P821, DOI 10.1109/IROS.2011.6048717
[9]  
Bo LF, 2011, PROC CVPR IEEE, P1729, DOI 10.1109/CVPR.2011.5995719
[10]   Convolutional Fisher Kernels for RGB-D Object Recognition [J].
Cheng, Yanhua ;
Cai, Rui ;
Zhao, Xin ;
Huang, Kaiqi .
2015 INTERNATIONAL CONFERENCE ON 3D VISION, 2015, :135-143