Feature Pyramid Networks for Object Detection

被引:14861
作者
Lin, Tsung-Yi [1 ,2 ,3 ]
Dollar, Piotr [1 ]
Girshick, Ross [1 ]
He, Kaiming [1 ]
Hariharan, Bharath [1 ]
Belongie, Serge [2 ,3 ]
机构
[1] Facebook AI Res, Menlo Pk, CA USA
[2] Cornell Univ, Ithaca, NY 14853 USA
[3] Cornell Tech, New York, NY 10044 USA
来源
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017) | 2017年
关键词
D O I
10.1109/CVPR.2017.106
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But recent deep learning object detectors have avoided pyramid representations, in part because they are compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost. A topdown architecture with lateral connections is developed for building high-level semantic feature maps at all scales. This architecture, called a Feature Pyramid Network (FPN), shows significant improvement as a generic feature extractor in several applications. Using FPN in a basic Faster R-CNN system, our method achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles, surpassing all existing single-model entries including those from the COCO 2016 challenge winners. In addition, our method can run at 5 FPS on a GPU and thus is a practical and accurate solution to multi-scale object detection. Code will be made publicly available.
引用
收藏
页码:936 / 944
页数:9
相关论文
共 39 条
  • [31] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
  • [32] Lin TY, 2014, ECCV, P740, DOI DOI 10.1007/978-3-319-10602-1_48
  • [33] Liu W., 2016, ECCV
  • [34] Liu W, 2016, INT WORKS EARTH OB
  • [35] Long J., 2015, P 2015 IEEE C COMPUT
  • [36] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [37] ImageNet Large Scale Visual Recognition Challenge
    Russakovsky, Olga
    Deng, Jia
    Su, Hao
    Krause, Jonathan
    Satheesh, Sanjeev
    Ma, Sean
    Huang, Zhiheng
    Karpathy, Andrej
    Khosla, Aditya
    Bernstein, Michael
    Berg, Alexander C.
    Fei-Fei, Li
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) : 211 - 252
  • [38] Simonyan K., 2014, P 3 INT C LEARN REP, P1
  • [39] Zagoruyko S., 2016, BMVC