An attention-based feature pyramid network for single-stage small object detection

被引：0

作者：

Lin Jiao

Chenrui Kang

Shifeng Dong

Peng Chen

Gaoqiang Li

Rujing Wang

机构：

[1] Anhui Unviersity,School of Internet

[2] Hefei Institutes of Physical Science,Institute of Intelligent Machines

[3] Chinese Academy of Science,undefined

[4] Southwest University of Science and Technology,undefined

[5] University of Science and Technology of China,undefined

来源：

Multimedia Tools and Applications | 2023年 / 82卷

关键词：

Object detection; Feature pyramid network; Feature fusion; Single-stage; Small object;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Recently, single-stage detection methods have made great progress, achieving comparable accuracy to two-stage detection methods. However, they have poor performance over small object detection. In this work, we improve the performance of the single-stage detector for detecting objects of small sizes. The proposed method makes two major novel contributions. The first is to devise an attention-based feature pyramid network (aFPN) by introducing a learnable fusion factor for controlling feature information that deep layers deliver to shallow layers. The design of a learnable fusion factor could adapt a feature pyramid network to small object detection. The second contribution is to propose a soft-weighted loss function, which reduces the false attention during network training. To be specify, we reweight the contribution of training samples to the network loss according to their distances with the boundaries of the ground-truth box, leading to fewer false-positive detections. To verify the performance of the proposed method, we conduct extensive experiments on different datasets by comparing including RetinaNet, ATSS, FCOS, FreeAnchor, et al. Experimental results show that our method can achieve 44.2% AP on MS COCO dataset, 23.0% AP on VisDrone dataset, which significantly gains improvements with nearly no computation overhead.

引用

页码：18529 / 18544

页数：15

共 56 条

[1]

Hu J(2020)Squeeze-and-excitation networks IEEE Trans Pattern Anal Mach Intell 42 2011-2023

[2]

Shen L(2020)Foveabox: beyound anchor-based object detection IEEE Trans Image Process 29 7389-7398

[3]

Albanie S(2021)Detecting handcrafted facial image manipulations and gan-generated facial images using shallow-fakefacenet Appl Soft Comput 105 256-3109

[4]

Sun G(2020)Classification of white blood cells using deep features obtained from convolutional neural network models based on the combination of feature selection methods Appl Soft Comput 97 810-8338

[5]

Wu E(2020)Ai-assisted ct imaging analysis for covid-19 screening: building and deploying a medical ai system Appl Soft Comput 98 897-undefined

[6]

Kong T(2021)S-rpn: sampling-balanced region proposal network for small crop pest detection Comput Electron Agric 187 106,290-undefined

[7]

Sun F(2021)Learning to match anchors for visual object detection IEEE Trans Pattern Anal Mach Intell 44 3096-undefined

[8]

Liu H(2020)Matnet: motion-attentive transition network for zero-shot video object segmentation IEEE Trans Image Process 29 8326-undefined

[9]

Jiang Y(2021)Textmountain: accurate scene text detection via instance segmentation Pattern Recogn 110 107,336-undefined

[10]

Li L(undefined)undefined undefined undefined undefined-undefined

← 1 2 3 4 5 6 →