RGB-D Image Multi-Target Detection Method Based on 3D DSF R-CNN

被引：19

作者：

Hu, Qi ^{[1
,2
]}

Zhai, Lang ^{[2
]}

机构：

[1] Changchun Univ Sci & Technol, Weixing Rd 7089, Changchun, Jilin, Peoples R China

[2] Jilin Business & Technol Coll, Coll Engn, Jiutai Econ Dev Area Kalunhu St 1666, Changchun, Jilin, Peoples R China

来源：

INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE | 2019年 / 33卷 / 08期

关键词：

Multi-target detection; depth learning; candidate region; convolution neural network; RGB-D; optimal fusion weight; CONVOLUTIONAL NETWORKS; RECOGNITION ALGORITHM;

D O I：

10.1142/S0218001419540260

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

At present, the application of deep learning algorithms in two-dimensional color image detection is being continuously innovated and broken. With the popularity of depth cameras, color image detection methods with depth information need to be upgraded. To solve this problem, a multi-target detection algorithm based on 3D DSF R-CNN (Double Stream Faster R-CNN, Convolution Neural Network based on Candidate Region) is proposed in this paper. The RGB information and the depth information of the image are given to two input elements of the convolution network with the same structure and weight sharing, and an optimal fusion weight algorithm is used to determine the weight of the fusion target in accordance with the recognition accuracy of the recognition targets under the single modal information, so as to ensure the most efficient fusion result. After several convolution operations, the independent features are extracted and the two networks are fused according to the optimal weights in the convolution layer. With the conducting of convolution and extract the fused features, and finally get the output through the full link layer. Compared with the previous two-dimensional convolution network algorithm, this algorithm improves the detection rate and success rate while ensuring the detection time. The experimental result shows that this method has strong robustness for complex illumination and partial occlusion, and has excellent detection results under non-restrictive conditions.

引用

页数：15

共 29 条

[1]

[Anonymous], IEEE T INFORM THEORY

[2]

[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]

[3]

[Anonymous], DEEP LEARNING

[4]

[Anonymous], PROC CVPR IEEE

[5]

[Anonymous], ARXIV13013572V2

[6]

[Anonymous], PLOS COMPUTATIONAL B

[7]

[Anonymous], ARXIV160608677

[8]

[Anonymous], ARXIV150205082

[9] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[10] Sparse components of images and optimal atomic decompositions [J].

Donoho, DL .

CONSTRUCTIVE APPROXIMATION, 2001, 17 (03) :353-382

← 1 2 3 →