Panoptic Feature Pyramid Networks

被引：1045

作者：

Kirillov, Alexander ^{[1
]}

Girshick, Ross ^{[1
]}

He, Kaiming ^{[1
]}

Dollar, Piotr ^{[1
]}

机构：

[1] Facebook AI Res FAIR, Paris, France

来源：

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年

关键词：

D O I：

10.1109/CVPR.2019.00656

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The recently introduced panoptic segmentation task has renewed our community's interest in unifying the tasks of instance segmentation (for thing classes) and semantic segmentation (for stuff classes). However, current state-of-the-art methods for this joint task use separate and dissimilar networks for instance and semantic segmentation, without performing any shared computation. In this work, we aim to unify these methods at the architectural level, designing a single network for both tasks. Our approach is to endow Mask R-CNN, a popular instance segmentation method, with a semantic segmentation branch using a shared Feature Pyramid Network (FPN) backbone. Surprisingly, this simple baseline not only remains effective for instance segmentation, but also yields a lightweight, top-performing method for semantic segmentation. In this work, we perform a detailed study of this minimally extended version of Mask R-CNN with FPN, which we refer to as Panoptic FPN, and show it is a robust and accurate baseline for both tasks. Given its effectiveness and conceptual simplicity, we hope our method can serve as a strong baseline and aid future research in panoptic segmentation.

引用

页码：6392 / 6401

页数：10

共 60 条

[1]

[Anonymous], 2017, CVPR

[2] Pixelwise Instance Segmentation with a Dynamically Instantiated Network [J].

Arnab, Anurag ;

Torr, Philip H. S. .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :879-888

[3] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks [J].

Bell, Sean ;

Zitnick, C. Lawrence ;

Bala, Kavita ;

Girshick, Ross .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2874-2883

[4]

Bilinski Piotr, 2017, COCO STUFF 2017 CHAL

[5] In-Place Activated BatchNorm for Memory-Optimized Training of DNNs [J].

Bulo, Samuel Rota ;

Porzi, Lorenzo ;

Kontschieder, Peter .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :5639-5647

[6] COCO-Stuff: Thing and Stuff Classes in Context [J].

Caesar, Holger ;

Uijlings, Jasper ;

Ferrari, Vittorio .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1209-1218

[7]

Cao Jiale, 2018, ARXIV180909299

[8] MegDet: A Large Mini-Batch Object Detector [J].

Peng, Chao ;

Xiao, Tete ;

Li, Zeming ;

Jiang, Yuning ;

Zhang, Xiangyu ;

Jia, Kai ;

Yu, Gang ;

Sun, Jian .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6181-6189

[9] MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features [J].

Chen, Liang-Chieh ;

Hermans, Alexander ;

Papandreou, George ;

Schroff, Florian ;

Wang, Peng ;

Adam, Hartwig .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4013-4022

[10] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

← 1 2 3 4 5 6 →