EBiDA-FPN: enhanced bi-directional attention feature pyramid network for object detection

被引:0
作者
Yang, Xiaobao [1 ,2 ]
He, Yulong [2 ]
Wu, Junsheng [3 ]
Wang, Wentao [4 ]
Sun, Wei [2 ]
Ma, Sugang [2 ]
Hou, Zhiqiang [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian, Peoples R China
[2] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, Xian, Peoples R China
[3] Northwestern Polytech Univ, Sch Software, Xian, Peoples R China
[4] Rizhao Branch China Telecom Corp Ltd, Rizhao, Peoples R China
基金
中国国家自然科学基金;
关键词
object detection; convolutional neural network; self-attention; feature pyramid network;
D O I
10.1117/1.JEI.33.2.023013
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As a fundamental task in computer vision, object detection has long been a challenging visual task. However, current object detection models lack attention to salient features when fusing the lateral connections and top-down information flows in feature pyramid networks (FPNs). To address this, we propose a method for object detection based on an enhanced bi-directional attention feature pyramid network, which aims to enhance the feature representation capability of lateral connections and top-down links in FPN. This method adopts the triplet module to give attention to salient features in the original multi-scale information in spatial and channel dimensions, establishing an enhanced triplet attention. In addition, it introduces improved top and down attention to fuse contextual information using the correlation of features between adjacent scales. Furthermore, adaptively spatial feature fusion and self-attention are introduced to expand the receptive field and improve the detection performance of deep levels. Extensive experiments conducted on the PASCAL VOC, MS COCO, KITTI, and CrowdHuman datasets demonstrate that our method achieves performance gains of 1.8%, 0.8%, 0.5%, and 0.2%, respectively. These results indicate that our method has significant effects and is competitive compared with advanced detectors. (c) 2024 SPIE and IS&T
引用
收藏
页数:16
相关论文
共 50 条
[31]   Two-Layer Attention Feature Pyramid Network for Small Object Detection [J].
Xiang, Sheng ;
Ma, Junhao ;
Shang, Qunli ;
Wang, Xianbao ;
Chen, Defu .
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 141 (01) :713-731
[32]   An attention-based feature pyramid network for single-stage small object detection [J].
Lin Jiao ;
Chenrui Kang ;
Shifeng Dong ;
Peng Chen ;
Gaoqiang Li ;
Rujing Wang .
Multimedia Tools and Applications, 2023, 82 :18529-18544
[33]   An attention-based feature pyramid network for single-stage small object detection [J].
Jiao, Lin ;
Kang, Chenrui ;
Dong, Shifeng ;
Chen, Peng ;
Li, Gaoqiang ;
Wang, Rujing .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (12) :18529-18544
[34]   Stacked Pyramid Attention Network for Object Detection [J].
Shijie Hao ;
Zhonghao Wang ;
Fuming Sun .
Neural Processing Letters, 2022, 54 :2759-2782
[35]   Stacked Pyramid Attention Network for Object Detection [J].
Hao, Shijie ;
Wang, Zhonghao ;
Sun, Fuming .
NEURAL PROCESSING LETTERS, 2022, 54 (04) :2759-2782
[36]   Complementary Feature Pyramid Network for Object Detection [J].
Xie, Jin ;
Pang, Yanwei ;
Pan, Jing ;
Nie, Jing ;
Cao, Jiale ;
Han, Jungong .
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (06)
[37]   Gated Feature Pyramid Network for Object Detection [J].
Xie, Xuemei ;
Liao, Quan ;
Ma, Lihua ;
Jin, Xing .
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 :199-208
[38]   E-FPN: an enhanced feature pyramid network for UAV scenarios detectionE-FPN: an enhanced feature pyramid network for UAV scenarios detectionZ. Li et al. [J].
Zhongxu Li ;
Qihan He ;
Wenyuan Yang .
The Visual Computer, 2025, 41 (1) :675-693
[39]   Oriented Object Detection in Remote Sensing Using an Enhanced Feature Pyramid Network [J].
Zhu, Xinyu ;
Zhou, Wei ;
Wang, Kun ;
He, Bing ;
Fu, Ying ;
Wu, Xi ;
Zhou, Jiliu .
ELECTRONICS, 2023, 12 (17)
[40]   Hierarchical Focused Feature Pyramid Network for Small Object Detection [J].
Wang, Siwei ;
Chen, Zhiwei ;
Ding, Haoyang ;
Cao, Liujuan .
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 :432-444