Small Object Detection of High-Resolution Images Based on Feature Fusion and Learnable Anchor

被引:0
|
作者
Li C. [1 ]
Huang X.-Y. [1 ]
Wang K. [1 ]
机构
[1] School of Computer, Hubei University of Technology, Hubei, Wuhan
来源
Tien Tzu Hsueh Pao/Acta Electronica Sinica | 2022年 / 50卷 / 07期
关键词
feature fusion; high resolution images; learnable anchor; small object detection;
D O I
10.12263/DZXB.20200917
中图分类号
学科分类号
摘要
Small object detection of high-resolution images presents significant challenges. To solve the problem that downsampling and cropping of high-resolution images result in missed detections and false detections due to the loss of fine details and contextual information, an algorithm based on feature fusion and learnable anchor is proposed for small object detection of high-resolution images. Contextual and detailed features are extracted from downsampled images and cropped patches respectively, which are then fused layer-wise. The fused features are further combined with smoothed features to strengthen both fine details and contextual information. To mitigate the feature inconsistency, learnable anchor is applied to make the fused features accommodative to the location and shape of anchors. The proposed method is tested from the perspective of global inference and local inference compared to state-of-the-art detectors. The experimental results show the accuracy and effectiveness of the proposed method. © 2022 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:1684 / 1695
页数:11
相关论文
共 32 条
  • [1] KISANTAL M, WOJNA Z, MURAWSKI J, Et al., Augmentation for small object detection, 9th International Conference on Advances in Computing and Information Technology(ACITY 2019), pp. 119-133, (2019)
  • [2] LIU Y, LIU H Y, FAN J L, Et al., A survey of research and application of small object detection based on deep learning, Acta Electronica Sinica, 48, 3, pp. 590-601, (2020)
  • [3] LIN T Y, MAIRE M, BELONGIE S, Et al., Microsoft COCO: Common Objects in Context, European Conference on Computer Vision, pp. 740-755, (2014)
  • [4] BODLA N, SINGH B, CHELLAPPA R, Et al., Soft-NMS-Improving object detection with one line of code, 2017 IEEE International Conference on Computer Vision, pp. 5562-5570, (2017)
  • [5] LI B Q, HE Y Y, QIANG W, Et al., SSD with parallel additional feature extraction network for ground small target detection, Acta Electronica Sinica, 48, 1, pp. 84-91, (2020)
  • [6] HE K M, ZHANG X Y, REN S Q, Et al., Deep residual learning for image recognition, 2016 IEEE Conference on Computer Vision and Pattern Recognition, pp. 770-778, (2016)
  • [7] LIN T Y, DOLLAR P, GIRSHICK R, Et al., Feature pyramid networks for object detection, 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 936-944, (2017)
  • [8] PANG J M, CHEN K, SHI J P, Et al., Libra R-CNN: Towards balanced learning for object detection, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), pp. 821-830, (2019)
  • [9] PEI W, XU Y M, ZHU Y Y, Et al., The target detection method of aerial photography images with improved SSD, Journal of Software, 30, 3, pp. 738-758, (2019)
  • [10] HUANG J P, SHI Y H, GAO Y., Multi-scale faster-RCNN algorithm for small object detection, Journal of Computer Research and Development, 56, 2, pp. 319-327, (2019)