An enhanced SSD with feature cross-reinforcement for small-object detection

被引:0
作者
Lixiong Gong
Xiao Huang
Yinkang Chao
Jialin Chen
Binwen Lei
机构
[1] Hubei University of Technology,School of Mechanical Engineering
来源
Applied Intelligence | 2023年 / 53卷
关键词
Object detection; SSD; Attention mechanism; Feature cross reinforcement; Adaptive threshold;
D O I
暂无
中图分类号
学科分类号
摘要
Due to the limited feature information possessed by small objects in images, it is difficult for a single-shot multibox detector (SSD) to quickly notice the important regions of these small image objects. We propose an enhanced SSD based on feature cross-reinforcement (FCR-SSD). For shallow sampling, an improved group shuffling-efficient channel attention (GS-ECA) mechanism is used to make the model focus on the object areas rather than the background. Then, an FCR module allows the multiscale information from the shallow layer to be passed to the subsequent layer and fused to generate an enhanced feature map, which improves the utilization of the context information associated with small objects. We develop an adaptive algorithm for calculating positive and negative candidate box selection thresholds to select positive and negative samples, determine the intersection over union (IOU) thresholds of candidate boxes and ground-truth boxes, and adaptively determine the threshold for each ground-truth box. The proposed FCR-SSD algorithm achieves 79.6% mean average precision (mAP) for the PASCAL VOC 2007 dataset and 30.1% mAP for the MS COCO dataset at 34.2 frames per second (FPS) when run on an RTX 3080Ti GPU. The experimental results show that the FCR-SSD model yields high accuracy and a good detection speed in small-target detection tasks.
引用
收藏
页码:19449 / 19465
页数:16
相关论文
共 193 条
  • [1] Wei J(2020)Enhanced object detection with deep convolutional neural networks for Advanced driving assistance IEEE Trans Intell Transp Syst 21 1572-1583
  • [2] He J(2020)A fast face detection method via convolutional neural network Neurocomputing 395 128-137
  • [3] Zhou Y(2020)Deep learning for cyber security intrusion detection: approaches, datasets, and comparative study J Inform Secur Appl 50 1-19
  • [4] Chen K(2019)Efficient medical image enhancement based on CNN-FBB model IET Image Proc 13 1736-1744
  • [5] Tang Z(2020)The impact and importance of fabric image preprocessing for the new method of individual inter-thread pores detection Autex Res J 20 250-262
  • [6] Xiong Z(2021)HSPOG: an optimized target Recognition Method based on histogram of spatial pyramid oriented gradients Tsinghua Sci Technol 26 475-483
  • [7] Guo G(2019)Global parenchymal texture features based on histograms of oriented gradients improve cancer development risk estimation from healthy breasts Comput Methods Programs Biomed 177 123-132
  • [8] Wang H(2019)SIFT detector boosted by adaptive contrast threshold to improve matching robustness of Remote sensing panchromatic images Ieee J Sel Top Appl Earth Observations Remote Sens 12 675-684
  • [9] Yan Y(2019)Augmenting photographs with textures using the Laplacian pyramid Visual Comput 35 1489-1500
  • [10] Zheng J(2020)Gradient structural similarity based gradient filtering for multi-modal image fusion Inform Fusion 53 251-268