Two-Layer Attention Feature Pyramid Network for Small Object Detection

被引：1

作者：

Xiang, Sheng ^{[1
]}

Ma, Junhao ^{[1
]}

Shang, Qunli ^{[1
]}

Wang, Xianbao ^{[1
]}

Chen, Defu ^{[1
,2
]}

机构：

[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Peoples R China

[2] ZJUT, Binjiang Cyberspace Secur Inst, Hangzhou 310056, Peoples R China

来源：

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 141卷 / 01期

关键词：

Small object detection; two-layer attention module; small object detail enhancement module; feature pyramid network;

D O I：

10.32604/cmes.2024.052759

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Effective small object detection is crucial in various applications including urban intelligent transportation and pedestrian detection. However, small objects are difficult to detect accurately because they contain less information. Many current methods, particularly those based on Feature Pyramid Network (FPN), address this challenge by leveraging multi-scale feature fusion. However, existing FPN-based methods often suffer from inadequate feature fusion due to varying resolutions across different layers, leading to suboptimal small object detection. To address this problem, we propose the Two-layer Attention Feature Pyramid Network (TA-FPN), featuring two key modules: the Two-layer Attention Module (TAM) and the Small Object Detail Enhancement Module (SODEM). TAM uses the attention module to make the network more focused on the semantic information of the object and fuse it to the lower layer, so that each layer contains similar semantic information, to alleviate the problem of small object information being submerged due to semantic gaps between different layers. At the same time, SODEM is introduced to strengthen the local features of the object, suppress background noise, enhance the information details of the small object, and fuse the enhanced features to other feature layers to ensure that each layer is rich in small object information, to improve small object detection accuracy. Our extensive experiments on challenging datasets such as Microsoft Common Objects in Context (MS COCO) and Pattern Analysis Statistical Modelling and Computational Learning, Visual Object Classes (PASCAL VOC) demonstrate the validity of the proposed method. Experimental results show a significant improvement in small object detection accuracy compared to state-of-theart detectors.

引用

页码：713 / 731

页数：19

共 37 条

[1] SOD-MTGAN: Small Object Detection via Multi-Task Generative Adversarial Network [J].

Bai, Yancheng ;

Zhang, Yongqiang ;

Ding, Mingli ;

Ghanem, Bernard .

COMPUTER VISION - ECCV 2018, PT XIII, 2018, 11217 :210-226

[2] A full data augmentation pipeline for small object detection based on generative adversarial networks [J].

Bosquet, Brais ;

Cores, Daniel ;

Seidenari, Lorenzo ;

Brea, Victor M. ;

Mucientes, Manuel ;

Del Bimbo, Alberto .

PATTERN RECOGNITION, 2023, 133

[3] Extended Feature Pyramid Network for Small Object Detection [J].

Deng, Chunfang ;

Wang, Mengmeng ;

Liu, Liang ;

Liu, Yong ;

Jiang, Yunliang .

IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :1968-1979

[4] Multiple spatial residual network for object detection [J].

Dong, Yongsheng ;

Jiang, Zhiqiang ;

Tao, Fazhan ;

Fu, Zhumu .

COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (02) :1347-1362

[5] Region-Based Convolutional Networks for Accurate Object Detection and Segmentation [J].

Girshick, Ross ;

Donahue, Jeff ;

Darrell, Trevor ;

Malik, Jitendra .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) :142-158

[6] AugFPN: Improving Multi-scale Feature Learning for Object Detection [J].

Guo, Chaoxu ;

Fan, Bin ;

Zhang, Qian ;

Xiang, Shiming ;

Pan, Chunhong .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12592-12601

[7] Save the Tiny, Save the All: Hierarchical Activation Network for Tiny Object Detection [J].

Guo, Guangqian ;

Chen, Pengfei ;

Yu, Xuehui ;

Han, Zhenjun ;

Ye, Qixiang ;

Gao, Shan .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) :221-234

[8]

He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10] Small object detection method with shallow feature fusion network for chip surface defect detection [J].

Huang, Haixin ;

Tang, Xueduo ;

Wen, Feng ;

Jin, Xin .

SCIENTIFIC REPORTS, 2022, 12 (01)

← 1 2 3 4 →