Target detection under complex background based on deep learning

被引：0

作者：

Wang H.-M. ^{[1
]}

Wang X.-G. ^{[2
]}

Wang X.-Y. ^{[1
]}

机构：

[1] School of Astronautics, Northwestern Polytechnical University, Xi’an

[2] AVIC Computing Technique Research Institute, Xi’an

来源：

Kongzhi yu Juece/Control and Decision | 2022年 / 37卷 / 12期

关键词：

complex background; deep learning; feature fusion; hard example; SSD algorithm; target detection;

D O I：

10.13195/j.kzyjc.2021.0686

中图分类号：

学科分类号：

摘要：

Object detection is an important research in the field of computer vision. Traditional target detection methods spend a lot of time on feature extraction, and manual features are not robust to the problem of diverse targets. Deep learning technology has gradually become a breakthrough in computer vision in recent years. By using the classical convolutional neural network VGGNet as a basic network, a network framework for target detection is built by adding some deep networks and combining with the SSD (single shot multi-box detector) algorithm, and an algorithm of feature fusion based SSD (FF-SSD) is proposed. Aiming at the problem of sample imbalance during the training of the model, the original loss function is modified according to the principle of hard example mining. The background is regarded as a simple sample and a modulation factor is introduced to reduce the proportion of background loss to the confidence loss, which makes the model be trained more fully and converge faster, and the target detection accuracy under the complex background is promoted as a result. Meanwhile, for poor detection effect to small targets of the SSD algorithm, the feature pyramid is constructed according to the feature maps extracted from each convolutional layer. Appropriate feature maps are selected and fused to form a new feature map for the prediction. The semantic information fusion is strengthened to enhance the detection accuracy of small targets in order to improve the overall detection accuracy. Experimental results show that the proposed target detection algorithm can achieve high detection accuracy for all kinds of targets in the complex background. © 2022 Northeast University. All rights reserved.

引用

页码：3115 / 3121

页数：6

共 26 条

[1]

Wu X W, Sahoo D, Hoi S C H., Recent advances in deep learning for object detection, Neurocomputing, 396, pp. 39-64, (2020)

[2]

Girshick R, Donahue J, Darrell T, Et al., Region-based convolutional networks for accurate object detection and segmentation, IEEE Transactions on Pattern Analysis and Machine Intelligence, 38, 1, pp. 142-158, (2016)

[3]

Girshick R., Fast R-CNN, IEEE International Conference on Computer Vision, pp. 1440-1448, (2015)

[4]

Uijlings J R R, Sande K, Gevers T, Et al., Selective search for object recognition, International Journal of Computer Vision, 104, 2, pp. 154-171, (2013)

[5]

Ren S Q, He K M, Girshick R, Et al., Faster R-CNN: Towards real-time object detection with region proposal networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)

[6]

Redmon J, Divvala S, Girshick R, Et al., You only look once: Unified, real-time object detection, IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)

[7]

Redmon J, Farhadi A., YOLO9000: Better, faster, stronger, IEEE Conference on Computer Vision and Pattern Recognition, pp. 6517-6525, (2017)

[8]

Ju M R, Luo H B, Wang Z B, Et al., The application of improved YOLO v3 in multi-scale target detection, Applied Sciences, 9, 18, (2019)

[9]

Redmon J, Farhadi A., YOLO v3: An incremental improvement, IEEE Conference on Computer Vision and Pattern Recognition, pp. 1-6, (2018)

[10]

He W P, Huang Z, Wei Z F, Et al., TF-YOLO: An improved incremental network for real-time object detection, Applied Sciences, 9, 16, (2019)

← 1 2 3 →