Panoptic Feature Pyramid Networks

被引:908
作者
Kirillov, Alexander [1 ]
Girshick, Ross [1 ]
He, Kaiming [1 ]
Dollar, Piotr [1 ]
机构
[1] Facebook AI Res FAIR, Paris, France
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00656
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recently introduced panoptic segmentation task has renewed our community's interest in unifying the tasks of instance segmentation (for thing classes) and semantic segmentation (for stuff classes). However, current state-of-the-art methods for this joint task use separate and dissimilar networks for instance and semantic segmentation, without performing any shared computation. In this work, we aim to unify these methods at the architectural level, designing a single network for both tasks. Our approach is to endow Mask R-CNN, a popular instance segmentation method, with a semantic segmentation branch using a shared Feature Pyramid Network (FPN) backbone. Surprisingly, this simple baseline not only remains effective for instance segmentation, but also yields a lightweight, top-performing method for semantic segmentation. In this work, we perform a detailed study of this minimally extended version of Mask R-CNN with FPN, which we refer to as Panoptic FPN, and show it is a robust and accurate baseline for both tasks. Given its effectiveness and conceptual simplicity, we hope our method can serve as a strong baseline and aid future research in panoptic segmentation.
引用
收藏
页码:6392 / 6401
页数:10
相关论文
共 60 条
  • [41] Seeing through the Human Reporting Bias: Visual Classifiers from Noisy Human-Centric Labels
    Misra, Ishan
    Zitnick, C. Lawrence
    Mitchell, Margaret
    Girshick, Ross
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2930 - 2939
  • [42] Stacked Hourglass Networks for Human Pose Estimation
    Newell, Alejandro
    Yang, Kaiyu
    Deng, Jia
    [J]. COMPUTER VISION - ECCV 2016, PT VIII, 2016, 9912 : 483 - 499
  • [43] Pham V.-Q., 2017, P IEEE INT C FUZZ SY
  • [44] Learning to Refine Object Segments
    Pinheiro, Pedro O.
    Lin, Tsung-Yi
    Collobert, Ronan
    Dollar, Piotr
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 75 - 91
  • [45] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [46] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [47] ImageNet Large Scale Visual Recognition Challenge
    Russakovsky, Olga
    Deng, Jia
    Su, Hao
    Krause, Jonathan
    Satheesh, Sanjeev
    Ma, Sean
    Huang, Zhiheng
    Karpathy, Andrej
    Khosla, Aditya
    Bernstein, Michael
    Berg, Alexander C.
    Fei-Fei, Li
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) : 211 - 252
  • [48] Scene Parsing with Object Instances and Occlusion Ordering
    Tighe, Joseph
    Niethammer, Marc
    Lazebnik, Svetlana
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 3748 - 3755
  • [49] Image parsing: Unifying segmentation, detection, and recognition
    Tu, ZW
    Chen, XG
    Yuille, AL
    Zhu, SC
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2005, 63 (02) : 113 - 140
  • [50] Vijay B., 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence, V39, P2481, DOI DOI 10.1109/TPAMI.2016.2644615