RGB-D Image Multi-Target Detection Method Based on 3D DSF R-CNN

被引:19
作者
Hu, Qi [1 ,2 ]
Zhai, Lang [2 ]
机构
[1] Changchun Univ Sci & Technol, Weixing Rd 7089, Changchun, Jilin, Peoples R China
[2] Jilin Business & Technol Coll, Coll Engn, Jiutai Econ Dev Area Kalunhu St 1666, Changchun, Jilin, Peoples R China
关键词
Multi-target detection; depth learning; candidate region; convolution neural network; RGB-D; optimal fusion weight; CONVOLUTIONAL NETWORKS; RECOGNITION ALGORITHM;
D O I
10.1142/S0218001419540260
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
At present, the application of deep learning algorithms in two-dimensional color image detection is being continuously innovated and broken. With the popularity of depth cameras, color image detection methods with depth information need to be upgraded. To solve this problem, a multi-target detection algorithm based on 3D DSF R-CNN (Double Stream Faster R-CNN, Convolution Neural Network based on Candidate Region) is proposed in this paper. The RGB information and the depth information of the image are given to two input elements of the convolution network with the same structure and weight sharing, and an optimal fusion weight algorithm is used to determine the weight of the fusion target in accordance with the recognition accuracy of the recognition targets under the single modal information, so as to ensure the most efficient fusion result. After several convolution operations, the independent features are extracted and the two networks are fused according to the optimal weights in the convolution layer. With the conducting of convolution and extract the fused features, and finally get the output through the full link layer. Compared with the previous two-dimensional convolution network algorithm, this algorithm improves the detection rate and success rate while ensuring the detection time. The experimental result shows that this method has strong robustness for complex illumination and partial occlusion, and has excellent detection results under non-restrictive conditions.
引用
收藏
页数:15
相关论文
共 29 条
[11]  
Eitel A, 2015, IEEE INT C INT ROBOT, P681, DOI 10.1109/IROS.2015.7353446
[12]   Multi-view Face Detection Using Deep Convolutional Neural Networks [J].
Farfade, Sachin Sudhakar ;
Saberian, Mohammad ;
Li, Li-Jia .
ICMR'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2015, :643-650
[13]   Region-Based Convolutional Networks for Accurate Object Detection and Segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) :142-158
[14]  
Grohs P, 2016, IEEE INT SYMP INFO, P1163, DOI 10.1109/ISIT.2016.7541482
[15]   Learning Rich Features from RGB-D Images for Object Detection and Segmentation [J].
Gupta, Saurabh ;
Girshick, Ross ;
Arbelaez, Pablo ;
Malik, Jitendra .
COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 :345-360
[16]   Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images [J].
Gupta, Saurabh ;
Arbelaez, Pablo ;
Malik, Jitendra .
2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :564-571
[17]  
Oyallon E, 2015, PROC CVPR IEEE, P2865, DOI 10.1109/CVPR.2015.7298904
[18]   Research on segmentation and recognition algorithm of squamous carcinoma cells based on M-SVM [J].
Qi, Hu ;
Jin, Duan ;
Wang LiNing .
INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (04) :340-349
[19]   Research on the cancer cell's recognition algorithm based on the combination of competitive FHNN and FBPNN [J].
Qi, Hu ;
Jin, Duan ;
Di, Zhai .
INTERNATIONAL JOURNAL OF COMPUTING SCIENCE AND MATHEMATICS, 2016, 7 (03) :229-238
[20]   LabelMe: A database and web-based tool for image annotation [J].
Russell, Bryan C. ;
Torralba, Antonio ;
Murphy, Kevin P. ;
Freeman, William T. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2008, 77 (1-3) :157-173