Enhancement-fusion feature pyramid network for object detection

被引:2
作者
Dong, Shifeng [1 ,2 ]
Wang, Rujing [1 ,2 ]
Du, Jianming [1 ]
Jiao, Lin [1 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Hefei Inst Phys Sci, Hefei, Peoples R China
[2] Chinac Anhui Univ, Univ Sci & Technol China, Hefei, Peoples R China
关键词
object detection; feature pyramid network; feature fusion;
D O I
10.1117/1.JEI.32.1.013045
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Scale variation is one of the challenges of object detection. Most state-of-the-art object detectors depend on feature pyramid networks (FPN) for multiscale learning to deal with this problem, in which feature fusion is an essential operation. However, feature fusion does not sufficiently address the difficulty of the detection task. This paper presents an enhancement-fusion feature pyramid network (EFPN) to obtain reliable object representations for object detectors. Specifically, it contains a feature enhancement module (FEM) and a bottom-up path module (BPM). The FEM is used to eliminate the negative impact of the uneven distribution of object scales on the model performance. Then, a BPM is proposed to address the fusion inconsistency in the FPN. Additionally, an attention module (A(c)) is added to eliminate the information loss in the bottom-up aggregation process. EFPN is evaluated by combining it with state-of-the-art detection methods. Extensive experimental results on two datasets MS-COCO and VOC2007 demonstrate the effectiveness of the proposed method.
引用
收藏
页数:14
相关论文
共 42 条
[1]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[2]   LRP-net: A lightweight recursive pyramid network for single image deraining [J].
Bi Xiaojun ;
Chen Zheng ;
Yue Jianyu ;
Wang Haibo .
NEUROCOMPUTING, 2022, 490 :181-192
[3]   Soft-NMS - Improving Object Detection With One Line of Code [J].
Bodla, Navaneeth ;
Singh, Bharat ;
Chellappa, Rama ;
Davis, Larry S. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :5562-5570
[4]   Cascade R-CNN: Delving into High Quality Object Detection [J].
Cai, Zhaowei ;
Vasconcelos, Nuno .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6154-6162
[5]  
Chen K., 2019, arXiv preprint arXiv:1906.07155
[6]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[7]   NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].
Ghiasi, Golnaz ;
Lin, Tsung-Yi ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038
[8]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[9]   Effective Fusion Factor in FPN for Tiny Object Detection [J].
Gong, Yuqi ;
Yu, Xuehui ;
Ding, Yao ;
Peng, Xiaoke ;
Zhao, Jian ;
Han, Zhenjun .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, :1159-1167
[10]   AugFPN: Improving Multi-scale Feature Learning for Object Detection [J].
Guo, Chaoxu ;
Fan, Bin ;
Zhang, Qian ;
Xiang, Shiming ;
Pan, Chunhong .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12592-12601