Panoptic Feature Pyramid Networks

被引:908
作者
Kirillov, Alexander [1 ]
Girshick, Ross [1 ]
He, Kaiming [1 ]
Dollar, Piotr [1 ]
机构
[1] Facebook AI Res FAIR, Paris, France
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
关键词
D O I
10.1109/CVPR.2019.00656
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The recently introduced panoptic segmentation task has renewed our community's interest in unifying the tasks of instance segmentation (for thing classes) and semantic segmentation (for stuff classes). However, current state-of-the-art methods for this joint task use separate and dissimilar networks for instance and semantic segmentation, without performing any shared computation. In this work, we aim to unify these methods at the architectural level, designing a single network for both tasks. Our approach is to endow Mask R-CNN, a popular instance segmentation method, with a semantic segmentation branch using a shared Feature Pyramid Network (FPN) backbone. Surprisingly, this simple baseline not only remains effective for instance segmentation, but also yields a lightweight, top-performing method for semantic segmentation. In this work, we perform a detailed study of this minimally extended version of Mask R-CNN with FPN, which we refer to as Panoptic FPN, and show it is a robust and accurate baseline for both tasks. Given its effectiveness and conceptual simplicity, we hope our method can serve as a strong baseline and aid future research in panoptic segmentation.
引用
收藏
页码:6392 / 6401
页数:10
相关论文
共 60 条
  • [51] Wang XP, 2018, IDEAS HIST MOD CHINA, V19, P1, DOI 10.1163/9789004385580_002
  • [52] Wu Yuxin, 2018, ECCV
  • [53] Aggregated Residual Transformations for Deep Neural Networks
    Xie, Saining
    Girshick, Ross
    Dollar, Piotr
    Tu, Zhuowen
    He, Kaiming
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5987 - 5995
  • [54] Yao J, 2012, PROC CVPR IEEE, P702, DOI 10.1109/CVPR.2012.6247739
  • [55] SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation
    Yi, Li
    Su, Hao
    Guo, Xingwen
    Guibas, Leonidas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6584 - 6592
  • [56] Yu F, 2016, ICLR
  • [57] PSANet: Point-wise Spatial Attention Network for Scene Parsing
    Zhao, Hengshuang
    Zhang, Yi
    Liu, Shu
    Shi, Jianping
    Loy, Chen Change
    Lin, Dahua
    Jia, Jiaya
    [J]. COMPUTER VISION - ECCV 2018, PT IX, 2018, 11213 : 270 - 286
  • [58] Pyramid Scene Parsing Network
    Zhao, Hengshuang
    Shi, Jianping
    Qi, Xiaojuan
    Wang, Xiaogang
    Jia, Jiaya
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6230 - 6239
  • [59] Cascade R-CNN: Delving into High Quality Object Detection
    Cai, Zhaowei
    Vasconcelos, Nuno
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
  • [60] Zhou B., 2017, IEEE C COMP VIS PATT, P633