Enhanced feature pyramidal network for object detection

被引:4
作者
Shao, Mingwen [1 ]
Zhang, Wei [1 ]
Li, Yunhao [1 ]
Fan, Bingbing [1 ]
机构
[1] China Univ Petr, Coll Comp Sci & Technol, Qingdao, Peoples R China
关键词
object detection; machine learning; computer vision; deep convolutional neural networks;
D O I
10.1117/1.JEI.31.1.013030
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Powerful features, which contain more representative information, have become increasingly important in object detection. We exploit the attention mechanism and dilated convolution to strengthen the features used to construct the original feature pyramid network (FPN) and introduce a network that combines the dilated convolution and attention mechanism based on FPN (DAFPN). Specifically, motivated by the attention mechanism, a level-independent attention module (LIAM) is proposed to make high-level feature maps focus on semantic information and low-level feature maps concentrate on spatial information. Meanwhile, we present a pyramidal dilated convolution module (PDCM) that replaces standard convolution with dilated convolution. Instead of previous works that use the same dilation rate for all scales of feature maps, the PDCM applies dilation convolution with various dilation rates to enlarge the effective receptive field of each level's feature maps suitably. Extensive experiments show that our DAFPN achieves extraordinary performance compared to the state-of-the-art FPN-based detectors on MS COCO benchmark. (C) 2022 SPIE and IS&T
引用
收藏
页数:12
相关论文
共 35 条
[1]  
Chen K., 2019, CoRR abs/1906.07155
[2]   Context Refinement for Object Detection [J].
Chen, Zhe ;
Huang, Shaoli ;
Tao, Dacheng .
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 :74-89
[3]  
Fu C., 2017, DSSD: deconvolutional single shot detector
[4]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[5]   AugFPN: Improving Multi-scale Feature Learning for Object Detection [J].
Guo, Chaoxu ;
Fan, Bin ;
Zhang, Qian ;
Xiang, Shiming ;
Pan, Chunhong .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12592-12601
[6]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/ICCV.2017.322, 10.1109/TPAMI.2018.2844175]
[7]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[8]   Bounding Box Regression with Uncertainty for Accurate Object Detection [J].
He, Yihui ;
Zhu, Chenchen ;
Wang, Jianren ;
Savvides, Marios ;
Zhang, Xiangyu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2883-2892
[9]   CornerNet: Detecting Objects as Paired Keypoints [J].
Law, Hei ;
Deng, Jia .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (03) :642-656
[10]   Scale-Aware Trident Networks for Object Detection [J].
Li, Yanghao ;
Chen, Yuntao ;
Wang, Naiyan ;
Zhang, Zhaoxiang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6053-6062