Two-Layer Attention Feature Pyramid Network for Small Object Detection

被引:1
作者
Xiang, Sheng [1 ]
Ma, Junhao [1 ]
Shang, Qunli [1 ]
Wang, Xianbao [1 ]
Chen, Defu [1 ,2 ]
机构
[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Peoples R China
[2] ZJUT, Binjiang Cyberspace Secur Inst, Hangzhou 310056, Peoples R China
来源
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES | 2024年 / 141卷 / 01期
关键词
Small object detection; two-layer attention module; small object detail enhancement module; feature pyramid network;
D O I
10.32604/cmes.2024.052759
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Effective small object detection is crucial in various applications including urban intelligent transportation and pedestrian detection. However, small objects are difficult to detect accurately because they contain less information. Many current methods, particularly those based on Feature Pyramid Network (FPN), address this challenge by leveraging multi-scale feature fusion. However, existing FPN-based methods often suffer from inadequate feature fusion due to varying resolutions across different layers, leading to suboptimal small object detection. To address this problem, we propose the Two-layer Attention Feature Pyramid Network (TA-FPN), featuring two key modules: the Two-layer Attention Module (TAM) and the Small Object Detail Enhancement Module (SODEM). TAM uses the attention module to make the network more focused on the semantic information of the object and fuse it to the lower layer, so that each layer contains similar semantic information, to alleviate the problem of small object information being submerged due to semantic gaps between different layers. At the same time, SODEM is introduced to strengthen the local features of the object, suppress background noise, enhance the information details of the small object, and fuse the enhanced features to other feature layers to ensure that each layer is rich in small object information, to improve small object detection accuracy. Our extensive experiments on challenging datasets such as Microsoft Common Objects in Context (MS COCO) and Pattern Analysis Statistical Modelling and Computational Learning, Visual Object Classes (PASCAL VOC) demonstrate the validity of the proposed method. Experimental results show a significant improvement in small object detection accuracy compared to state-of-theart detectors.
引用
收藏
页码:713 / 731
页数:19
相关论文
共 37 条
[11]   Pedestrian walking speed monitoring at street scale by an in-flight drone [J].
Jiao, Dan ;
Fei, Teng .
PEERJ COMPUTER SCIENCE, 2023, 9
[12]  
Joseph RK, 2016, CRIT POL ECON S ASIA, P1
[13]   Localized Semantic Feature Mixers for Efficient Pedestrian Detection in Autonomous Driving [J].
Khan, Abdul Hannan ;
Nawaz, Mohammed Shariq ;
Dengel, Andreas .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :5476-5485
[14]  
Lin T-Y, 2017, P IEEE C COMP VIS PA, P2117
[15]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[16]   Path Aggregation Network for Instance Segmentation [J].
Liu, Shu ;
Qi, Lu ;
Qin, Haifang ;
Shi, Jianping ;
Jia, Jiaya .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :8759-8768
[17]   A Novel Multi-Sensor Fusion Based Object Detection and Recognition Algorithm for Intelligent Assisted Driving [J].
Liu, Tianbi ;
Du, Shanshan ;
Liang, Chenchen ;
Zhang, Bo ;
Feng, Rui .
IEEE ACCESS, 2021, 9 :81564-81574
[18]   SSD: Single Shot MultiBox Detector [J].
Liu, Wei ;
Anguelov, Dragomir ;
Erhan, Dumitru ;
Szegedy, Christian ;
Reed, Scott ;
Fu, Cheng-Yang ;
Berg, Alexander C. .
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 :21-37
[19]   CE-FPN: enhancing channel information for object detection [J].
Luo, Yihao ;
Cao, Xiang ;
Zhang, Juntao ;
Guo, Jingjuan ;
Shen, Haibo ;
Wang, Tianjiang ;
Feng, Qi .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (21) :30685-30704
[20]   Small-object detection based on YOLOv5 in autonomous driving systems [J].
Mahaur, Bharat ;
Mishra, K. K. .
PATTERN RECOGNITION LETTERS, 2023, 168 :115-122