Adaptive Feature Pyramid Networks for Object Detection

被引:54
作者
Wang, Chengyang [1 ]
Zhong, Caiming [1 ]
机构
[1] Ningbo Univ, Coll Sci & Technol, Ningbo 315300, Peoples R China
关键词
Feature extraction; Object detection; Adaptive systems; Prediction algorithms; Location awareness; Interpolation; Semantics; feature pyramid network; adaptive feature pyramid network;
D O I
10.1109/ACCESS.2021.3100369
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In general object detection, scale variation is always a big challenge. At present, feature pyramid networks are employed in numerous methods to alleviate the problems caused by large scale range of objects in object detection, which makes use of multi-level features extracted from the backbone for top-down upsampling and fusion to acquire a set of multi-scale depth image features. However, the feature pyramid network proposed by Ghiasi et al. adopts a simple fusion method, which fails to consider the fusion feature context, and therefore, it is difficult to acquire good features. In addition, the fusion of multi-scale features directly by traditional upsampling is prone to feature misalignment and loss of details. In this paper, an adaptive feature pyramid network is proposed based on the feature pyramid network to alleviate the foregoing potential problems, which includes two major designs, i.e., adaptive feature upsampling and adaptive feature fusion. The adaptive feature upsampling aims to predict a group of sampling points of each pixel through some models, and constitute feature representation of the pixel by feature combination of sampling points, while adaptive feature fusion is to construct pixel-level fusion weights between fusion features through attention mechanism. The experimental results verified the effectiveness of the method proposed in this paper. On the public object detection dataset MS-COCO test-dev, Faster R-CNN model achieved performance improvement of 1.2 AP by virtue of the adaptive feature pyramid network, and FCOS model could achieve performance improvement of 1.0 AP. What's more, the experiments also validated that the adaptive feature pyramid network proposed herein was more accurate for object localization.
引用
收藏
页码:107024 / 107032
页数:9
相关论文
共 30 条
[1]  
[Anonymous], 2018, NEURIPS
[2]   Fast Feature Pyramids for Object Detection [J].
Dollar, Piotr ;
Appel, Ron ;
Belongie, Serge ;
Perona, Pietro .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (08) :1532-1545
[3]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[4]  
Fu C.Y., 2017, ARXIV
[5]   NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection [J].
Ghiasi, Golnaz ;
Lin, Tsung-Yi ;
Le, Quoc V. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :7029-7038
[6]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[7]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[8]   Recombinant Tissue-Type Plasminogen Activator Study of Wake-Up Ischemic Strokes Guided by Rapid MRI [J].
Guo, CuiPing ;
Bai, QingKe ;
Zhao, ZhenGuo ;
Zhang, JianYing .
CEREBROVASCULAR DISEASES, 2019, 48 (1-2) :85-90
[9]   Multi-Scale Object Detection Using Feature Fusion Recalibration Network [J].
Guo, Ziyuan ;
Zhang, Weimin ;
Liang, Zhenshuo ;
Shi, Yongliang ;
Huang, Qiang .
IEEE ACCESS, 2020, 8 :51664-51673
[10]  
He KM, 2017, IEEE I CONF COMP VIS, P2980, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]