A Kinect-Based 3D Object Detection and Recognition System with Enhanced Depth Estimation Algorithm

被引:0
作者
Elaraby, Ahmed Fawzy [1 ]
Hamdy, Ayman [1 ]
Rehan, Mohamed [2 ]
机构
[1] Cairo Univ, Aerosp Engn Dept, Cairo, Egypt
[2] AvidBeam Technol, Cairo, Egypt
来源
2018 IEEE 9TH ANNUAL INFORMATION TECHNOLOGY, ELECTRONICS AND MOBILE COMMUNICATION CONFERENCE (IEMCON) | 2018年
关键词
kinect; 3D; computer vision; depth estimation; DNN; object detection; object recognition; image processing; ROS;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a system for 3D object detection and recognition with an enhanced depth estimation algorithm. The system is based on Microsoft Kinect and it uses state of the art Deep Neural Networks (DNN) for object detection and recognition. In addition, a robust algorithm for depth estimation was developed to overcome Kinect depth image accuracy limitations which are attributed to the presence of noisy pixel values, the large variation of pixel values for the same object and the inaccurate identification of objects bounding box. The proposed depth estimation algorithm uses statistical calculations (i. e. Mean, Median, etc.), to refine the depth image and remove or reduce the effect of the noisy pixels and other limitations. Experimental results validate the behavior and performance of the complete system and showed that the proposed depth estimation algorithm has increased the depth estimation accuracy to 88% from 82% in comparison with traditional algorithms (approximate to 5% accuracy enhancement).
引用
收藏
页码:247 / 252
页数:6
相关论文
共 19 条
[1]  
[Anonymous], 2018, P IEEE C COMP VIS PA
[2]  
[Anonymous], 2014, KINECT ONE DEPTH RAN
[3]  
[Anonymous], 2012, Kinect Sensor
[4]  
[Anonymous], 2014, KINECT COMP
[5]  
Baoliang Wang, 2013, 2013 5th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), P418, DOI 10.1109/IHMSC.2013.106
[6]  
Camplani M., 2012, 2012 IEEE International Conference on Emerging Signal Processing Applications, P33, DOI 10.1109/ESPA.2012.6152439
[7]  
Chen L, 2012, INT C PATT RECOG, P3070
[8]  
El-laithy RA, 2012, IEEE POSITION LOCAT, P1280, DOI 10.1109/PLANS.2012.6236985
[9]  
Eric N, 2017, INT CONF UBIQ FUTUR, P531
[10]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338