A multi-scale feature fusion target detection algorithm

被引:0
作者
Dong, Chong [1 ]
Li, Jingmei [1 ]
Wang, Jiaxiang [1 ]
机构
[1] Harbin Engn Univ, Grad Sch Comp, Harbin 150001, Heilongjiang, Peoples R China
来源
2018 INTERNATIONAL CONFERENCE ON IMAGE AND VIDEO PROCESSING, AND ARTIFICIAL INTELLIGENCE | 2018年 / 10836卷
关键词
Target detection; convolutional neural network; Deep learning; Feature extraction;
D O I
10.1117/12.2514046
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For existing Faster R-CNN and single shot multibox detector (SSD) target detection algorithms, they all have the problem of low object detection accuracy under small target conditions. This paper proposes a general and effective target detection algorithm and the detection accuracy has greatly improved for smaller targets. The algorithm is divided into two parts. In the first part, in the feature extraction process, the feature map extracted by the basic feature extraction network is deconvoluted and merged with the previous layer feature map to generate Multi-scale feature maps with rich semantics and high resolution. Using proposed multi-scale feature maps to generate proposals. The second part uses the generated proposals to be sent to the Faster R-CNN network for classification and detection. Experiments show that using this algorithm for target detection can not only improve the recall of proposals, but also improve the accuracy of target detection, especially for small targets. The algorithm provides a new idea for small target detection.
引用
收藏
页数:8
相关论文
共 9 条
[1]  
[Anonymous], 2015, CVPR
[2]  
[Anonymous], 2015, REAL TIME OBJECT DET, DOI DOI 10.1109/CVPR.2016.91
[3]  
[Anonymous], 2000, NATURE STAT LEARNING, DOI DOI 10.1007/978-1-4757-3264-1
[4]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[5]  
Liu W, 2015, EUR C COMP VIS, V9905, P21
[6]   Distinctive image features from scale-invariant keypoints [J].
Lowe, DG .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110
[7]   Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks [J].
Ren, Shaoqing ;
He, Kaiming ;
Girshick, Ross ;
Sun, Jian .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) :1137-1149
[8]  
Simonyan K., 2015, ICLR
[9]   Selective Search for Object Recognition [J].
Uijlings, J. R. R. ;
van de Sande, K. E. A. ;
Gevers, T. ;
Smeulders, A. W. M. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2013, 104 (02) :154-171